Skip to content

Major airlines data analysis by comparing with LCC data _youtube, twitter, naver blog crawling

Notifications You must be signed in to change notification settings

amolanggbsp/Major-Airline-Data-Analysis_Big-Data-Marketing

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 Cannot retrieve latest commit at this time.

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Major-Airline-Data-Analysis_Big-Data-Marketing

Major airlines data analysis by comparing with LCC data

1.Project Summary

purpose of this project: compare the key difference of Low Cost Career(LCC) with major korean airlines
Source of SNS for analysis: Youtube, Twitter, Naver Blog 
Key Brands: Jeju airlines, Asiana Airlines, Korean Air


SNS type	: Youtube		Naver Blog		Twitter				
Language	: Python 								
Essential Module	: sys		urllib		time	pandas	re	bs4(beautifulsoup)	selenium
Final code update date	: 2019.05.07								
crawling 기간	: Youtube_2017.05.01~ 2019.05.03		Naver_2018.04.27~2019.05.04		Twitter_2018.04.03~2019.05.04				

Youtube Naver Blog Twitter
Language Python3
Module sys, urllib, time, pandas, re, bs4, selenium
Final update 2019.05.07
Crawling Target Commentes Blog preview text Posts
crawling 기간 2017.05.01 ~ 2019.05.03 2018.04.27 ~ 2019.05.04 2018.04.03 ~ 2019.05.04

2. Crawling

-Filtering Code: filter spam words in crawled data
-Or: get data with word A or B 
-And: get data with word A and B (this code is to filter SNS comment which mentioned more than two brands) 


3. input/output files

github

4. Frequency Analysis

github

5. Conclusion

github

About

Major airlines data analysis by comparing with LCC data _youtube, twitter, naver blog crawling

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages