Newspaper bias analysis

This is the code that served as basis for the blogpost Predicting Political Bias with Python. The data required to run the notebooks/scripts is available on request.

Data exploration & sampling

Download and extract Zepellin. Then run:
bin/zeppelin-daemon.sh start
Go to http://localhost:8080/ and click on Import note
Open zeppelin-notebooks/articles-stats-and-samping.json and you should see the following:

We have exported a CSV file that lists how many articles there are per publisher in this dataset.

Jupyter notebooks analysis

Requirements

Install a python 3 environment

You can use virtualenv or anaconda, as long as you have a py3 version installed.

Install Jupyter notebook

You can follow the instructions here: http://jupyter.readthedocs.io/en/latest/install.html On linux or Mac OS, simply run after activating your environment: pip install jupyter

Download the training & test data (a sample of the whole dataset with about ~100K articles)

Download https://console.aws.amazon.com/s3/object/newsclustering/filtered-csv/newsclust.csv?region=us-east-1&tab=overview and place the CSV file under data/source/newsclust.csv

Packages required for `Report.ipynb`

pip install jupyter
pip install scikit-learn
pip install -U spacy
pip install plotly
pip install matplotlib
pip install pandas
pip install scipy
pip install seaborn
pip install imbalanced-learn

python -m spacy download en

Once the packages have been installed, you can launch the notebook interface using jupyter notebook. Then navigate to the jupyter-notebooks directory and open any notebook by clicking on it.

Additional packages required for `nn-classifer-keras-title.ipynb`

pip install --upgrade tensorflow
pip install keras

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
data		data
images		images
jupyter-notebooks		jupyter-notebooks
zeppelin-notebooks		zeppelin-notebooks
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Newspaper bias analysis

Data exploration & sampling

Jupyter notebooks analysis

Requirements

Install a python 3 environment

Install Jupyter notebook

Download the training & test data (a sample of the whole dataset with about ~100K articles)

Packages required for `Report.ipynb`

Additional packages required for `nn-classifer-keras-title.ipynb`

About

Releases

Packages

Contributors 3

Languages

zermelozf/newspapers-clustering

Folders and files

Latest commit

History

Repository files navigation

Newspaper bias analysis

Data exploration & sampling

Jupyter notebooks analysis

Requirements

Install a python 3 environment

Install Jupyter notebook

Download the training & test data (a sample of the whole dataset with about ~100K articles)

Packages required for Report.ipynb

Additional packages required for nn-classifer-keras-title.ipynb

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages required for `Report.ipynb`

Additional packages required for `nn-classifer-keras-title.ipynb`

Packages