yextend can actually parse PDFs to scan individual components, which is awesome! Unfortunately, this relies on pdftotext, a program not available in Lambda. So when BinaryAlert scans a PDF, yextend returns an empty string and the result is a JSONDecodeError
Desired Change
Add error handling around yextend - if it fails for any reason, we should still continue with the regular analysis
Bundle pdftotext in the Lambda dependencies (this may not happen in v1.1)
Problems like this will be mitigated in the future once yextend supports portable installation
Background
yextend
can actually parse PDFs to scan individual components, which is awesome! Unfortunately, this relies onpdftotext
, a program not available in Lambda. So when BinaryAlert scans a PDF,yextend
returns an empty string and the result is aJSONDecodeError
Desired Change
yextend
- if it fails for any reason, we should still continue with the regular analysispdftotext
in the Lambda dependencies (this may not happen in v1.1)yextend
supports portable installation