Closed IgBell closed 4 years ago
Hey @IgBell, can you give me a specific example like a URL and/or a screenshot of what you are speaking of? Also, which form type are you trying to download?
Hello Jad,
I am talking about 13F report here. I use
dl.get("13F-HR", "1067983")
and
dl.get("13F-HR", "1541617")
and got different results? check out in attachment different structure inside files
https://www.avast.com/sig-email?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=webmail Virus-free. www.avast.com https://www.avast.com/sig-email?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=webmail <#DAB4FAD8-2DD7-40BB-A1B8-4E2AA1F9FDF2>
On Wed, Apr 1, 2020, 19:17 Jad Chaar notifications@github.com wrote:
Hey @IgBell https://github.com/IgBell, can you give me a specific example like a URL and/or a screenshot of what you are speaking of? Also, which form type are you trying to download?
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/jadchaar/sec-edgar-downloader/issues/34#issuecomment-607347193, or unsubscribe https://github.com/notifications/unsubscribe-auth/ALHRIESH3U5CQPH24EDYQ53RKNSIPANCNFSM4LY6DZIA .
-----BEGIN PRIVACY-ENHANCED MESSAGE----- Proc-Type: 2001,MIC-CLEAR Originator-Name: webmaster@www.sec.gov Originator-Key-Asymmetric: MFgwCgYEVQgBAQICAf8DSgAwRwJAW2sNKK9AVtBzYZmr6aGjlWyK3XmZv3dTINen TWSM7vrzLADbmYQaionwg5sDW3P6oaM5D3tdezXMm7z1T+B+twIDAQAB MIC-Info: RSA-MD5,RSA, KkaUTPdJoxbkMJDFv3jen6/3Pd0rbEE5WTl9qtSSwKiAicP9GYOJwDD78cG0LoIY wI2efcqlM63gl//XZbGKyw==
-----END PRIVACY-ENHANCED MESSAGE-----
Hey @IgBell, sadly this is something that is out of my control and must be a matter of the 13F format and SEC requirements changing over the years. Notice how the Berkshire 13F is from 2009 but the Altimeter Capital filing is from 2012.
If I do the following, I get very similar formats of the submission files (an XML-like file):
env ❯ python3
Python 3.7.3 (v3.7.3:ef4ec6ed12, Mar 25 2019, 16:52:21)
[Clang 6.0 (clang-600.0.57)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> from sec_edgar_downloader import Downloader
>>> dl = Downloader()
>>> dl.get("13F-HR", "1067983", 1)
1
>>> dl.get("13F-HR", "1541617", 1)
1
To give you a sense of what this tool is doing, it downloads the complete submission txt file here, for example: https://www.sec.gov/Archives/edgar/data/1067983/000095012320002466/0000950123-20-002466-index.htm.
SEC formats change over the years and are kind of a wild west since there are very lenient formatting requirements for filings. I am planning to add support for downloading XML/HTML files in the future (https://github.com/jadchaar/sec-edgar-downloader/issues/32), so this could help alleviate the issues you are seeing. Stay tuned.
got it, it will be much easier if it'll be possible to download 13fInfoTable.html there are some nice ready-made solutions to work with it, Complete submission text file is harder to parse into dataframe to compare (well, maybe for me as a rookie). My idea was to merge your code with this one https://briancaffey.github.io/2018/01/30/reading-13f-sec-filings-with-python.html Thank you, Igor
Reading 13F SEC filings with python in case you can't open the link, copy-paste the link directly to a browser.
This should hopefully be fixed by https://github.com/jadchaar/sec-edgar-downloader/issues/32, so I am closing this issue for now.
For Berkshire, program gives nice txt with tables in it, for Altimetr capital, it gives txt, but like xml file if open. Check out CIK 1067983 vs 1541617.