Skip to content

Utilize the SEC’s official package to retrieve 10-K filings and perform a word count analysis on the document content

License

Notifications You must be signed in to change notification settings

CYC202/SEC-Filings-Downloader

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 

Repository files navigation

SEC-Filings-Downloader

Description

A tool to retrieve filings from the SEC based on a specified identifier (tikcer, cik), form (10k, 20f, etc.), year, amendment, etc.

I also provide basic functionality for word frequency analysis from a given dictionary, including options for removing stop words and stemming.

The implementation has been optimized with multithreading to enhance retrieval speed.

It can skip existing files.

The code is encrypted for personal use; please reach out for collaboration or academic inquiries.

Features

  1. Retrieve Filings: Access and download filings based on a specified conditions.
  2. Word Frequency Analysis: Analyze the frequency of terms from a provided dictionary within the filings.
  3. Stop Words Removal: Filter out common stop words to focus on significant terms.
  4. Stemming: Reduce words to their root form for accurate frequency counts.
  5. Multithreading: Optimize retrieval speed using concurrent processing.

Note

The code is intended for personal use only and has been encrypted. If you have academic needs or wish to discuss further, feel free to reach out for collaboration and exchange of ideas.

Last Updated: November 18, 2024

About

Utilize the SEC’s official package to retrieve 10-K filings and perform a word count analysis on the document content

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published