mwaldstein / edgarWebR

R package for interacting with the SEC's EDGAR filing search and retrieval system
https://mwaldstein.github.io/edgarWebR/
Other
78 stars 16 forks source link

Bug: parse_text_filing does not split when empty lines have spaces in them #4

Closed mwaldstein closed 6 years ago

mwaldstein commented 6 years ago

Bug report from Mohan:

Filing: https://www.sec.gov/Archives/edgar/data/104938/0000950131-94-000440.txt

Fix: chomp empty lines in parse_text_filing:

filing_doc <- gsub("\\n +\\n", "\n\n", filing_doc)