仓库地址:https://github.com/CSSEGISandData/COVID-19
主要汇总COVID-19目录下的csse_covid_19_data/csse_covid_19_time_series中的某国家的累积确诊、累积死亡、累积出院数目成三个excel文件
以英国为例。
import pandas as pd
#导入csse_covid_19_data
#死亡数据
def readDeathData(country):
death_data = pd.read_csv('COVID-19/csse_covid_19_data/csse_covid_19_time_series/time_series_covid19_deaths_global.csv')
data = death_data[(death_data['Country/Region'] == country)].iloc[:,4:]
# & (death_data['Province/State'].isna())
return data.sum()
#累积确诊数据
def readConfirmedData(country):
recover_data = pd.read_csv('COVID-19/csse_covid_19_data/csse_covid_19_time_series/time_series_covid19_confirmed_global.csv')
data = recover_data[(recover_data['Country/Region'] == country)].iloc[:,4:]
return data.sum()
#治愈出院数据
def readRecoveredData(country):
recover_data = pd.read_csv('COVID-19/csse_covid_19_data/csse_covid_19_time_series/time_series_covid19_recovered_global.csv')
data = recover_data[(recover_data['Country/Region'] == country)].iloc[:,4:]
return data.sum()
readDeathData('United Kingdom').to_csv('Death.csv',index=True,header=False)
readConfirmedData('United Kingdom').to_csv('Confirmed.csv',index=True,header=False)
readRecoveredData('United Kingdom').to_csv('Recovered.csv',index=True,header=False)
可以看一眼data.sum()长什么样,sum()函数按照dataframe的每一列相加,并返回(注意:以前的列索引,变成了行索引)
最后汇总成3个文件Death.csv,Confirmed.csv,Recovered.csv