ryansmccoy / py-sec-edgar

Python application used to download, parse, and extract structured/unstructured data from filings in the SEC Edgar Database (including 10-K, 10-Q, 13-D, S-1, 8-K, etc.)
Other
101 stars 17 forks source link

Running version from master fails with AttributeError: module 'pandas' has no attribute 'np' #14

Open dredozubov opened 1 month ago

dredozubov commented 1 month ago
...

2024-10-15 12:50:36,231 INFO py_sec_edgar.process: Initalizing FilingProcessor...
('CIK                                                        104169\n'
 'Company Name                                         Walmart Inc.\n'
 'Form Type                                                    10-Q\n'
 'Date Filed                                             2024-08-30\n'
 'Filename               edgar/data/104169/0000104169-24-000141.txt\n'
 'published                                              2024-08-30\n'
 'url             https://www.sec.gov/Archives/edgar/data/104169...\n'
 'Name: 74687, dtype: object')
2024-10-15 12:50:36,232 INFO py_sec_edgar.utilities:

    Downloading:    https://www.sec.gov/Archives/edgar/data/104169/0000104169-24-000141.txt

2024-10-15 12:50:36,981 INFO py_sec_edgar.utilities:

    Saving to:  /Users/dr/workspace/py-sec-edgar/py-sec-edgar/sec_gov/Archives/edgar/data/104169/0000104169-24-000141.txt

2024-10-15 12:50:37,285 INFO py_sec_edgar.utilities:

    Success!    Saved to filepath:  /Users/dr/workspace/py-sec-edgar/py-sec-edgar/sec_gov/Archives/edgar/data/104169/0000104169-24-000141.txt

2024-10-15 12:50:37,285 INFO py_sec_edgar.extract:

    Extracting Filing Documents:

2024-10-15 12:50:37,285 INFO py_sec_edgar.extract: extracting documents to /Users/dr/workspace/py-sec-edgar/py-sec-edgar/sec_gov/Archives/edgar/data/104169/000010416924000141
    20240830165041

2024-10-15 12:50:36,231 INFO py_sec_edgar.process: Initalizing FilingProcessor...
('CIK                                                        104169\n'
 'Company Name                                         Walmart Inc.\n'
 'Form Type                                                    10-Q\n'
 'Date Filed                                             2024-08-30\n'
 'Filename               edgar/data/104169/0000104169-24-000141.txt\n'
 'published                                              2024-08-30\n'
 'url             https://www.sec.gov/Archives/edgar/data/104169...\n'
 'Name: 74687, dtype: object')
2024-10-15 12:50:36,232 INFO py_sec_edgar.utilities:

    Downloading:    https://www.sec.gov/Archives/edgar/data/104169/0000104169-24-000141.txt

2024-10-15 12:50:36,981 INFO py_sec_edgar.utilities:

    Saving to:  /Users/dr/workspace/py-sec-edgar/py-sec-edgar/sec_gov/Archives/edgar/data/104169/0000104169-24-000141.txt

2024-10-15 12:50:37,285 INFO py_sec_edgar.utilities:

    Success!    Saved to filepath:  /Users/dr/workspace/py-sec-edgar/py-sec-edgar/sec_gov/Archives/edgar/data/104169/0000104169-24-000141.txt

2024-10-15 12:50:37,285 INFO py_sec_edgar.extract:

    Extracting Filing Documents:

2024-10-15 12:50:37,285 INFO py_sec_edgar.extract: extracting documents to /Users/dr/workspace/py-sec-edgar/py-sec-edgar/sec_gov/Archives/edgar/data/104169/000010416924000141
    20240830165041

Traceback (most recent call last):
  File "/Users/dr/miniconda3/envs/py-sec-edgar/lib/python3.8/runpy.py", line 194, in _run_module_as_main
    return _run_code(code, main_globals, None,
  File "/Users/dr/miniconda3/envs/py-sec-edgar/lib/python3.8/runpy.py", line 87, in _run_code
    exec(code, run_globals)
  File "/Users/dr/workspace/py-sec-edgar/py-sec-edgar/py_sec_edgar/__main__.py", line 82, in <module>
    main()
  File "/Users/dr/miniconda3/envs/py-sec-edgar/lib/python3.8/site-packages/click/core.py", line 1157, in __call__
    return self.main(*args, **kwargs)
  File "/Users/dr/miniconda3/envs/py-sec-edgar/lib/python3.8/site-packages/click/core.py", line 1078, in main
    rv = self.invoke(ctx)
  File "/Users/dr/miniconda3/envs/py-sec-edgar/lib/python3.8/site-packages/click/core.py", line 1434, in invoke
    return ctx.invoke(self.callback, **ctx.params)
  File "/Users/dr/miniconda3/envs/py-sec-edgar/lib/python3.8/site-packages/click/core.py", line 783, in invoke
    return __callback(*args, **kwargs)
  File "/Users/dr/workspace/py-sec-edgar/py-sec-edgar/py_sec_edgar/__main__.py", line 76, in main
    filing_broker.process(sec_filing)
  File "/Users/dr/workspace/py-sec-edgar/py-sec-edgar/py_sec_edgar/process.py", line 51, in process
    filing_content = self.extract(filing_filepaths)
  File "/Users/dr/workspace/py-sec-edgar/py-sec-edgar/py_sec_edgar/extract.py", line 28, in extract
    filing_contents = extract_complete_submission_filing(filing_json['filing_filepath'], output_directory=filing_json['extracted_filing_directory'])
  File "/Users/dr/workspace/py-sec-edgar/py-sec-edgar/py_sec_edgar/extract.py", line 73, in extract_complete_submission_filing
    filing_header = header_parser(raw_text)
  File "/Users/dr/workspace/py-sec-edgar/py-sec-edgar/py_sec_edgar/parse/header.py", line 45, in header_parser
    header_dict = header_dict.replace('', pd.np.nan)
AttributeError: module 'pandas' has no attribute 'np'