This project is a web scraper designed to extract product information (names and prices) from the Jumia Android Phones page. The data is saved in a CSV file for further analysis. Note: This project is intended for educational purposes only.
- Extracts product names and prices.
- Handles pagination to scrape data across multiple pages.
- Automatically dismisses pop-up windows if they appear.
- Python
- Selenium for web scraping
- pandas for data processing
- Python 3.7 or later installed.
- Google Chrome browser installed.
- ChromeDriver compatible with your browser version.
- Clone this repository:
git clone https://github.com/Yahia-Hosny/jumia-scraper.git
- Install the required dependencies:
pip install selenium pandas
- Download and configure ChromeDriver:
- Download ChromeDriver.
- Place the
chromedriver
executable in your PATH or specify its location in the script.
- Run the script:
python web_scraber.py
- The extracted data will be saved in a file named
products.csv
.
- Ensure that your Chrome browser version matches the ChromeDriver version.
- If the scraper fails due to website changes, update the XPath/CSS selectors accordingly.
This project is licensed under the MIT License. See the LICENSE
file for more details.
Pull requests are welcome! For major changes, please open an issue first to discuss what you would like to change.