Open JamesAlfonse opened 2 months ago
To add onto this, there are also extraction scripts available from @apes-on-parade here
Issuers are required to disclose the number of record holders each year in their 10K filing, but there is no strict requirement for the language used
Multiple phrase and sentence structures that refer to record holders would need to be identified and used.
Another challenge would be to identify issuers with multiple classes of stocks (Class A, Class B, etc.) and to be able to separate them accordingly. For now it may be useful to only extract record holder information of companies that have just one class of stock. Another issue can be created for a more refined extraction of multiple classes of stocks.
Ideally, this would be an automated script that runs daily and turns the data into a json file. Integrating that data into the database would be the final step; the .db file has CIK as the primary key and Ticker as the secondary key, so either of those columns should be used for merging purposes.
2600_yay brings attention to a tool that can be used to pull data from the SEC
They link to this repository, which provides a robust framework for extracting text from SEC documents. It can potentially be adjusted to extract data concerning record holders, which is crucial for financial analysis and investor relations.
Proposed Feature:
Use Case: This feature will be particularly useful for analysts and investors looking to aggregate or compare shareholder data across different companies, providing a clearer picture of investor engagement and stock distribution.