Closed john-friedman closed 1 week ago
Also: structure of output dict is unintuitive, will improve soon.
Considering implementing this file structure
{
"id": "000000695524000046_epac-20240831",
"document": {
"parti": {
"item1": "Item 1. Business\n\nGeneral\n\nEnerpac Tool Group Corp...",
"item1a": "Item 1A. Risk Factors\n\nThe following risks...",
"item1b": "Item 1B. Unresolved Staff Comments\n\nNone.",
"item1c": "Item 1C. Cybersecurity\n\nOur cybersecurity program..."
},
"partii": {
"item7": "Item 7. Management's Discussion and Analysis...",
"item7a": "Item 7A. Quantitative and Qualitative Disclosures..."
}
}
}
Issue with
load_file_content
. Fixed.