Overview

This project is a data-driven approach to detecting Russian Disinformation utilizing a dataset of Internet Research Agency ads released by the U.S. House Intelligence Committee as part of an investigation into the alleged Russian Disinformation Campaign in the 2016 U.S. Election.

This project is carried out by Faculty members Dr. Farnoush Banaei-Kashani and Dr. Haadi Jafarian as well as student Tobby Lie at the Computer Science school of the University of Colorado Denver

Methods

Machine Learning Models on Textual Data

We utilized Support Vector Machine and Naive Bayes based solutions to training on textual data in the dataset. We gained successful metrics from this which can be viewed in our plots directory.

Deep Convolutional Neural Networks on Image Data

This is an ongoing area we are experimenting with. We have gained successful metrics via VGG16 and ResNet50. These results can also be viewed in the Russian-Disinformation-Project/CNN_experiments/CNN experiments metrics directory.

Latent Dirichlet Allocation to Topic Model

We utilized LDA to derive the dominant themes in our data and to draw correlations between the themes and data samples themselves.

Future Work

We intend to co-train our models to effectively combine our efforts in image and textual data training. This will be carried out after our CNN training has been finalized.

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
.ipynb_checkpoints		.ipynb_checkpoints
CNN_experiments		CNN_experiments
LDAvis Visualization		LDAvis Visualization
__pycache__		__pycache__
data		data
documents		documents
fully_connected_embedding_fusion		fully_connected_embedding_fusion
notebooks_to_get_image_data_for_cnns		notebooks_to_get_image_data_for_cnns
plots_for_NB_SVM_topic_modeling		plots_for_NB_SVM_topic_modeling
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
NB_measures.json		NB_measures.json
README.md		README.md
Russian_Disinformation_Guns.csv		Russian_Disinformation_Guns.csv
SVM_Guns_predictions.csv		SVM_Guns_predictions.csv
SVM_guns.ipynb		SVM_guns.ipynb
SVM_measures.json		SVM_measures.json
bag_of_words.ipynb		bag_of_words.ipynb
barchart_plotter.py		barchart_plotter.py
barchart_plotter_script.py		barchart_plotter_script.py
dominant_topic.csv		dominant_topic.csv
environment.yaml		environment.yaml
items-Copy1.csv		items-Copy1.csv
items.csv		items.csv
latent_dirichlet_allocation.ipynb		latent_dirichlet_allocation.ipynb
measures.py		measures.py
naive_bayes.ipynb		naive_bayes.ipynb
nlp_experiments_items.ipynb		nlp_experiments_items.ipynb
oversampled_dictionary.txt		oversampled_dictionary.txt
perf_measure.ipynb		perf_measure.ipynb
sent_topics_sorteddf_mallet.csv		sent_topics_sorteddf_mallet.csv
singular_value_decomposition.ipynb		singular_value_decomposition.ipynb
support_vector_machine.ipynb		support_vector_machine.ipynb
tag-matrix-Copy1.csv		tag-matrix-Copy1.csv
tag-matrix.csv		tag-matrix.csv
tempCodeRunnerFile.py		tempCodeRunnerFile.py
text_directory.csv		text_directory.csv
topic_directory_df.csv		topic_directory_df.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Methods

Machine Learning Models on Textual Data

Deep Convolutional Neural Networks on Image Data

Latent Dirichlet Allocation to Topic Model

Future Work

About

Releases

Packages

Languages

License

tobby-lie/Russian-Disinformation-Project

Folders and files

Latest commit

History

Repository files navigation

Overview

Methods

Machine Learning Models on Textual Data

Deep Convolutional Neural Networks on Image Data

Latent Dirichlet Allocation to Topic Model

Future Work

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages