This simple python script collects daily data from the coronavirus status updates published by Tecent.
Data source: https://news.qq.com/zt2020/page/feiyan.htm
Note: It focuses on China only.
The script grabs the data and save them into three seperate csv files in the path where the script is stored.
The data collected contain information at country , province and prefecture levels. For country level data, it collects all data from 1/13/2020 to the time being. For province and prefecture level data, it collects data for the day ONLY since there is no historical data available from the data source.
Fields of the country level output are as follows:
- Date (日期)
- Accumulated Confirmed (累计确诊)
- Suspect (疑似)
- Dead (死亡)
- Heal (治愈)
- Current Confirmed (现有确诊)
- Current Severe (现有重症)
- Death Rate (死亡率)
- Heal Rate (治愈率)
Fields of the province level output are as follows:
- Province (省)
- Daily Added Confirmed (新增确诊)
- Accumulated Confirmed (累计确诊)
- Dead (死亡)
- Healed (治愈)
- Death Rate (死亡率)
- Heal Rate (治愈率)
Fields of the prefecture level output are as follows:
- Province (省)
- Prefecture (市)
- Daily Added Confirmed (新增确诊)
- Accumulated Confirmed (累计确诊)
- Dead (死亡)
- Healed (治愈)
- Death Rate (死亡率)
- Heal Rate (治愈率)
To run the script, please make sure you are using python 3.x, and have libraries of pandas and requests installed.
A set of sample results is also provided.