This project is a comprehensive data science implementation focusing on anime data. It spans from data extraction through web scraping of MyAnimeList to deploying a web page in a Quarto environment. This project demonstrates the full data workflow, from acquisition to deployment, providing a transparent and replicable approach for anime data enthusiasts.
This repository contains the second part of the Data Architecture course project for the MCD.
Here we implement the analysis and deploy a web page in a Quarto environment.
AnimeAnalysis.yaml
: Conda environment file for the project.config.yaml
: Project configuration file.data
: Directory for storing the data used in the analysis.modelos
: Directory containing the models and related files used in the analysis.src
: Directory containing the project's source code.styles.css
: CSS file for the web page design.PDF
: Working backwards, frequently asked questions, and project architecture.
1. Clone this repository to your local machine.
2. Ensure that the Conda environment is installed.
3. Create a new Conda environment using the AnimeAnalysis.yaml file.
4. Run the scripts and notebooks in the src folder to perform the analysis and generate the results.
5. The processed data and generated models can be found in the data and modelos folders, respectively.
6. Deploy the web page using Quarto by executing the appropriate command as per the provided instructions.
For more details about the first part of the project, which includes web scraping of the main anime page and ETL in AWS, refer to the following repository: Anime_ELT_dashboard.
The generated web page is located in the _site
folder. You can access it by opening the index.html
file in your browser.
Contributions are welcome. If you wish to contribute to this project, make sure to follow the guidelines and submit your pull requests.
Thank you for your interest in this anime project!