john-friedman / datamule-python

A package to work with SEC data. Incorporates datamule endpoints.
MIT License
66 stars 7 forks source link

10-K parser missing text from links #6

Closed john-friedman closed 1 week ago

john-friedman commented 1 week ago

Issue with load_file_content. Fixed.

john-friedman commented 1 week ago

Also: structure of output dict is unintuitive, will improve soon.

john-friedman commented 1 week ago

Considering implementing this file structure

{
   "id": "000000695524000046_epac-20240831",
   "document": {
       "parti": {
           "item1": "Item  1. Business\n\nGeneral\n\nEnerpac Tool Group Corp...",
           "item1a": "Item 1A. Risk Factors\n\nThe following risks...",
           "item1b": "Item 1B. Unresolved Staff Comments\n\nNone.",
           "item1c": "Item 1C. Cybersecurity\n\nOur cybersecurity program..."
       },
       "partii": {
           "item7": "Item 7. Management's Discussion and Analysis...",
           "item7a": "Item 7A. Quantitative and Qualitative Disclosures..."
       }
   }
}