nlpaueb / edgar-crawler

The only open-source toolkit that can download EDGAR financial reports and extract textual data from specific item sections into nice and clean JSON files.
GNU General Public License v3.0
274 stars 76 forks source link

serious issues of 10-Q extraction: Item 1 Financial statements and Item 2 MD&A are gathered together #22

Closed hahoangnhan closed 2 months ago

hahoangnhan commented 2 months ago

Dear the team,

I am checking the parsing output of 10-Qs and seeing all item_2 MD&A are gathered together with item_1. This is because the 10-Qs have two items 2.

  1. Item 2 The MD&A in Part I
  2. Item 2 Unregistered Sales of Equity Securities in Part II

I got the items 1 and 2 in Part 1 gathered for all of the 10-Qs, resulting in the MD&A appearing in the text of Item_1 Financial Statements. Can you check that? I am in a hurry for my research project, so any updates would be highly appreciated.

Best, HHN

eloukas commented 2 months ago

@hahoangnhan The software does not currently support 10-Q extraction, as mentioned in the README.

hahoangnhan commented 2 months ago

@hahoangnhan The software does not currently support 10-Q extraction, as mentioned in the README.

I got that. So it's not a real issue, but I hope to see any improvement in the future. Thank you so much! HHN