Earning Call Surprises Project

Authors: Brooks Li, Kush Patel, Albert Nguyen, Deeksha Koonadi, Ashley Soto

This repository contains the code used for the MIS 382N Advanced Machine Learning final project. In particular, it contains the files used to scrape the data, and then perform model analysis on the extracted data afterwards.

Overview of File Names:

Company 10-K 1.csv, Company HTML Urls 1.csv, Company Index Urls.csv, Company Names.csv, edgar_scraper.py were all used as part of the data scraping process.
Earning Surprise Boosting.ipynb, Earning Surprise Neural Nets.ipynb, Earning Surprise Random Forests.ipynb, Earning Surprise SVM.ipynb, Earning Surprise Decision Trees.ipynb were the main files used to build the models for classifying earning surprises.
Sentiment Contrastive Learning files 1, 2, 3 were prototypes of contrastive learning techniques for developing stronger sentiment scores between documents and their respective base positive and negative classes. In particular, Sentiment Contrastive Learning 3.ipynb is the most recent update of the constrastive learning technique.
Earning Surprise Data Preprocessing.ipynb is the file used for most of the data cleaning, feature extraction, data engineering.
Earning Surprise ML Classification.ipynb was used to test what happens if you include the TFIDF word vectorizer into the dataset.
Negative 10K, Positive 10K, Negative Earning Call Transcript, Positive Earning Call Transcript: The base documents generated by LLMs to compare the actual documents to in order to determine sentiment.

A summary of the project can be found on our blog posted on Medium: https://medium.com/@19lizezhou/predicting-earning-surprises-a-deep-dive-into-machine-learning-techniques-3c16b35f019f

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.gitignore		.gitignore
Company 10-K 1.csv		Company 10-K 1.csv
Company HTML Urls 1.csv		Company HTML Urls 1.csv
Company Index Urls 1.csv		Company Index Urls 1.csv
Company Names.csv		Company Names.csv
Cosine Similarity 10K and Earning.csv		Cosine Similarity 10K and Earning.csv
Earning Surprise Boosting.ipynb		Earning Surprise Boosting.ipynb
Earning Surprise Data Preprocessing.ipynb		Earning Surprise Data Preprocessing.ipynb
Earning Surprise ML Classification.ipynb		Earning Surprise ML Classification.ipynb
Earning Surprise Neural Nets.ipynb		Earning Surprise Neural Nets.ipynb
Earning Surprise Random Forests.ipynb		Earning Surprise Random Forests.ipynb
Earning Surprise SVM.ipynb		Earning Surprise SVM.ipynb
Earning Surprises Data Set.csv		Earning Surprises Data Set.csv
Earning Surprises Decision Trees.ipynb		Earning Surprises Decision Trees.ipynb
Earning Surprises.csv		Earning Surprises.csv
Negative 10k.TXT		Negative 10k.TXT
Negative Earning Call Transcript.txt		Negative Earning Call Transcript.txt
Positive 10k.TXT		Positive 10k.TXT
Positive Earning Call Transcript.txt		Positive Earning Call Transcript.txt
README.md		README.md
Sentiment 10K.csv		Sentiment 10K.csv
Sentiment Contrastive Learning 2.ipynb		Sentiment Contrastive Learning 2.ipynb
Sentiment Contrastive Learning 3.ipynb		Sentiment Contrastive Learning 3.ipynb
Sentiment Contrastive Learning.ipynb		Sentiment Contrastive Learning.ipynb
Sentiment Earning Calls.csv		Sentiment Earning Calls.csv
WRDS Query 1.csv		WRDS Query 1.csv
WRDS Query 2.csv		WRDS Query 2.csv
Yahoo Earning Surprises.csv		Yahoo Earning Surprises.csv
cik.csv		cik.csv
cik.txt		cik.txt
edgar_scraper.py		edgar_scraper.py
requirements.TXT		requirements.TXT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Earning Call Surprises Project

About

Releases

Packages

Languages

alnguyen47/MIS-382-Final-Project

Folders and files

Latest commit

History

Repository files navigation

Earning Call Surprises Project

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages