Skip to content

Python package to web scrape AO3 reading history data for each user

License

Notifications You must be signed in to change notification settings

jezaven/ao3-history

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

96 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AO3 History Scraper

GitHub issues GitHub last commit GitHub

Web scraper for collecting a user's personal AO3 reading history and organizing all the story information.

Usage

To run the project, you need to clone our repo and cd to the ao3_scrapy folder with

cd src/ao3_scraper/ao3_scraper

Add in your account information in the file /src/ao3_scrapy/ao3_scrapy/spiders/ao3_spider.py in line 147. Please note that you must have a valid AO3 account and Reading History enabled to use this API.

You can run our spider with the following command:

scrapy crawl history

If you want to save the output in a json file, run the following command:

scrapy crawl history -o nameoffile.json

Future Work

We have a long ways to go before finishing this project. Currently, we aim to package our scraper so that it's as accessible as possible. We're also planning on creating a web app that uses the data from this API and analyzes it and creates visualizations for users.

License

MIT License

Copyright (c) 2020 Fiza Goyal, Jessica Yang

About

Python package to web scrape AO3 reading history data for each user

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages