bws0013 / read_13f

Read 13f filings and present them in a useful form such as a database.
https://www.sec.gov/edgar/searchedgar/companysearch.html
Apache License 2.0
33 stars 10 forks source link
13f fintech java xml-parser

read_13f

This a java program I built to retrieve info from companies related to their holdings as listed on their 13f filings. These are essentially the long positions of holdings of investment companies. The filings are reported to the sec within around 45 days of the end of the quarter. So the filings of an investment firm for the 4th quarter of 2017 will be reported around February 15th 2018.

Reminder: This program is meant for learning purposes, I do not and cannot recommend using it to make investment decisions.

Second Reminder: This program is still under development, some features, such as option 5 on the prompt do not work.

Also keep in mind you will probably need at least java 7 to run this program.

You may be run the program and get a message that says something like Issue with this doc. If you get this that means the program read as many documents as it could before it ran into an issue with one of the files. This is probably a formatting issue. The more old docs I look through the more weird formatting I am seeing. I will figure out a way to address this. There will still be an output of all of the files the program was able to process.

If you do not care about the inner workings of the program you can clone/download the repo and follow the instructions in the folder called Just_Do_It_Already. Everything after this sentence is just me rambling about the program and its development and has nothing to do with running it.

That said, I built a program to test the hypothesis that the combined portfolios of several different profitable firms could be used to outperform the index. This is a remake of that program that, this is much easier to use. That I am releasing anything should tell you whether or not my tests yielded success in proving my hypothesis.

Simplified definitions for the sake of explanation

Questions you may have

What does the program do?

How do I run it?

What is the legality of scraping this data?

Why is such a program necessary and why is this program not shorter?

Why does it take so long to run?

What can you use this for?

What is the difference between a 13F-HR and a 13F-HR/A? Which does this program use?

What is the difference between a cusip and an excel_cusip?

Will this program get me to the moon?

Your maven is messed up...

Contribution

I guess if you would like to contribute you can just fork the repo and make a pull request. I generally only use github for personal projects and bitbucket for work so I am not as familiar with the interface on here.

Development related stuff

If you have any questions/comments/concerns feel free to create a github issue.

If you find a format that my program is unable to process please provide the link to the filing on the sec website and I will get right on adding it.

I have added some documentation if you want to see what the code is actually doing at the code level.