Skip to content

A web scraper for departing flights in flightstats website (optional filter by type, different location levels and elevation), enriched by Covid-19 API data (country, region, city and travel restrictions). A short presentation of the project in the link -->

Notifications You must be signed in to change notification settings

dormeir999/flightstats_scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Flights stats scraper during Covid-19

POC of the Project: https://docs.google.com/presentation/d/1sIZYruUzLq0ge70ACb_Ux8AyxM_FX4rozimYFW2nai4/edit?usp=sharing

The final dashboard: alt text

Some visualizations: alt text alt text

What is this repository for?

  • Flightstats_scaper is a webscraper that collects departure and flight data from big and medium airports around the world form the https://www.flightstats.com/ website.

  • Version 1.3

Requierments:

  • Python 3

Python Libaries:

  • selenium
  • bs4 (BeautifulSoup)
  • requests
  • re
  • csv
  • sys
  • os

Webbrowser: The program uses Chrome as a browser. For Selenium to work a WebDriver needs to be intalled: Instructions: https://selenium-python.readthedocs.io/installation.html Download: https://sites.google.com/a/chromium.org/chromedriver/downloads

Data: Airport data was taken from: https://ourairports.com/data/airports.csv"

Deployment:

Run flightstats-scraper.py with the airpot data csv file as argument: python3 flightstats-scraper.py airports.csv

Testing:

Before performing the scrape the scraper will first try to fetch the recent flights from the Hartsfield–Jackson Atlanta International Airport - The busiest airport in the world. If this succeeds, it will move on to scraping the supplied airport data csv file.

Database Design:

/***

|-------------|------------|------------|----------|-------------|------------|

FIELD TYPE NULL KEY DEFAULT EXTRA
id. INT [PK]
name VARCHAR
type VARCHAR
elevation_ft INTEGER
continent VARCHAR
iso_country VARCHAR
iso_region VARCHAR
municipality VARCHAR
gps_code VARCHAR
iata_code VARCHAR
local_code VARCHAR
longitude FLOAT
latitude FLOAT
***/

About

A web scraper for departing flights in flightstats website (optional filter by type, different location levels and elevation), enriched by Covid-19 API data (country, region, city and travel restrictions). A short presentation of the project in the link -->

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages