According to WHO, 2019 Novel Corona Virus (COVID-19) is a virus (more specifically, a coronavirus) identified as the cause of an outbreak of respiratory illness, which was unknown before the outbreak began in Wuhan, China, in December 2019 [1]. Early on, the disease demonstrated an animal-to-person spread, then a person-to-person spread. Infections with COVID-19, were reported in a growing number of international locations, including the United States". The United States reported the first confirmed instance of person-to-person spread with this virus on January 30, 2020 [2].
This notebook shows how to use the ArcGIS API for Python
to monitor the spread of COVID-19 as it became a pandemic (updated as of June 28th, 2021).
1. Import Data
Esri provides an open-to-public and free-to-share feature layer that contains the most up-to-date COVID-19 cases covering China, the United States, Canada, Australia (at province/state level), and the rest of the world (at country level, represented by either the country centroids or their capitals). Data sources are WHO, US CDC, China NHC, ECDC, and DXY. The China data is updated automatically at least once per hour, and non-China data is updating manually. The data source repo that this layer referenced from, is created and maintained by the Center for Systems Science and Engineering (CSSE) at the Johns Hopkins University, and can be viewed here. In this notebook, we will use the feature layer supported by Esri Living Atlas team and JHU Data Services, and provide a different perspective in viewing the global maps of COVID-19 via the use of ArcGIS API for Python.
NOTE: "Since COVID-19 is continuously evolving, the sample reflects data as of June 28th, 2021 (or if you are looking at a previously published version of this notebook, the data and maps were fetched/made as of July 4th, 2020). Running this notebook at a later date might reflect a different result, but the overall steps should hold good."
DISCLAIMER: "This notebook is for the purpose of illustrating an analytical process using Jupyter Notebooks and ArcGIS API for Python, and should not be used as medical or epidemiological advice."
Necessary Imports
from io import BytesIO
import requests
import pandas as pd
from arcgis.features import FeatureLayer
from arcgis.gis import GIS
from arcgis.mapping import WebMap
"""
# if you are using arcgis api for python with version 1.8.0 or above,
# make sure that the pandas version>=1.0,
# if not, use `pip install --upgrade pandas>=1` to upgrade.
"""
pd.__version__
'1.2.3'
Get data for the analysis
Query the source feature layer
The dashboard item contributed to the public by Esri
and JHU CSSE
, is accessible through here:
gis = GIS(profile="your_online_profile")
item = gis.content.search("Coronavirus_2019_nCoV_Cases owner:CSSE_covid19", outside_org=True)[0]
item
Through the API Explorer
provided along with the dashboard product, we can easily fetch the source URL for the Feature Service containing daily updated COVID-19 statistics, which can then be used to create a FeatureLayer
object good for querying and visualizing.
src_url = "https://services1.arcgis.com/0MSEUqKaxRlEPj5g/arcgis/rest/services/Coronavirus_2019_nCoV_Cases/FeatureServer/1"
fl = FeatureLayer(url=src_url)
df_global = fl.query(where="1=1",
return_geometry=True,
as_df=True)
As stated in the dashboard, the source data can be grouped into:
- A. Countries or regions of which data are collected at province/state level, e.g. China, the United States, Canada, Australia;
- B. Countries or regions for the rest of the world of which data collected at country level, and shape represented by either the country centroids or their capitals;
- C. Cruise Ships with confirmed COVID-19 cases.
Group A
Let us first take a look at how many countries are within group A, that Country_Region
and Province_State
are not null or NAN.
df_global[~pd.isnull(df_global['Province_State'])].groupby('Country_Region').sum(numeric_only=True)[['Confirmed', 'Recovered', 'Deaths']]
Confirmed | Recovered | Deaths | |
---|---|---|---|
Country_Region | |||
Australia | 30528 | 23781 | 910 |
Belgium | 1082476 | 0 | 25160 |
Brazil | 18420598 | 16160826 | 513474 |
Canada | 1421009 | 1385623 | 26188 |
Chile | 1547103 | 1484410 | 32298 |
China | 103727 | 98331 | 4847 |
Colombia | 4158716 | 3854457 | 104678 |
Denmark | 822 | 798 | 1 |
France | 128701 | 64102 | 1069 |
Germany | 3734597 | 3619010 | 90777 |
India | 30279331 | 29309607 | 396730 |
Italy | 4258069 | 4073435 | 127472 |
Japan | 797079 | 763208 | 14659 |
Mexico | 2505792 | 1990610 | 232564 |
Netherlands | 1711530 | 27290 | 18024 |
New Zealand | 1 | 0 | 0 |
Pakistan | 955657 | 901201 | 22231 |
Peru | 2048115 | 2007201 | 191899 |
Russia | 5408744 | 4911752 | 131671 |
Spain | 3792642 | 150376 | 80789 |
Sweden | 1088896 | 0 | 14619 |
US | 33631532 | 0 | 604030 |
Ukraine | 2298478 | 2221380 | 54517 |
United Kingdom | 4771289 | 15724 | 128367 |
Each country/region in Group A, has more than 1 feature, as what we have seen below from the query() results.
fset_usa = fl.query(where="Country_Region='US'")
fset_usa
<FeatureSet> 60 features
fset_china = fl.query(where="Country_Region='China'")
fset_china
<FeatureSet> 34 features
fl.query(where="Country_Region='Denmark'")
<FeatureSet> 3 features
Group C
Group C contains cruise ships across the globe with reported cases:
df_cruise_ships = fl.query(where="Province_State='Diamond Princess' or \
Province_State='Grand Princess' or \
Country_Region='MS Zaandam' or \
Country_Region='Diamond Princess'",
as_df=True)
df_cruise_ships[["Province_State", "Country_Region", "Last_Update", "Confirmed", "Recovered", "Deaths"]]
Province_State | Country_Region | Last_Update | Confirmed | Recovered | Deaths | |
---|---|---|---|---|---|---|
0 | Diamond Princess | Canada | 2020-07-05 00:34:01 | 0 | 0 | 1 |
1 | Grand Princess | Canada | 2020-07-05 00:34:01 | 13 | 13 | 0 |
2 | None | Diamond Princess | 2020-07-05 00:34:01 | 712 | 651 | 13 |
3 | None | MS Zaandam | 2020-07-05 00:34:01 | 9 | 0 | 2 |
4 | Grand Princess | US | 2020-07-05 00:34:01 | 103 | 0 | 3 |
5 | Diamond Princess | US | 2020-07-05 00:34:01 | 49 | 0 | 0 |
Group B
In the df_global
, other than the 22 countries (Australia, Canada, China, etc.) in Group A, and those cruise ships in Group C, all other countries/regions fall into Group B, e.g. Thailand. The major difference between Group A and Group B is that the latter contains one and only feature per country.
fl.query(where="Country_Region='Thailand'")
<FeatureSet> 1 features
Query the reference feature layers
Because the geo-information provided by the dashboard contains only the coordinates representing the centroid of each country/region, the feature layer can only be rendered as points on Map. If this is what you want, you can now skip the rest of section 1, and jump right onto section 2.
On the other hand, if you want to visualize the confirmed/death/recovered cases per country/region as polygons, in other words as a choropleth map, please read along:
First, we need to access the feature service that contains geometry/shape info for all provinces in Mainland China, and merge with the COVID-19 DataFrame.
Access the reference feature layer of China
provinces_item = gis.content.get("0f57da7f853c4a1aa5b2e048ff8655d2")
provinces_item
provinces_flayer = provinces_item.layers[0]
provinces_df = provinces_flayer.query(as_df=True)
provinces_df.columns
Index(['OBJECTID', 'ID', 'NAME', 'AREA', 'TOTPOP_CY', 'ISO_CODE', 'ISO_SUB', 'ISO2_CC', 'ISO3_CC', 'Shape__Area', 'Shape__Length', 'ID_1', 'sourceCountry', 'ENRICH_FID', 'aggregationMethod', 'populationToPolygonSizeRating', 'apportionmentConfidence', 'HasData', 'TOTPOP_CY_1', 'YEAR2018', 'SHAPE'], dtype='object')
tmp = provinces_df.sort_values('NAME', ascending=True)
provinces_df = tmp.drop_duplicates(subset='NAME', keep='last')
provinces_df.shape
(31, 21)
DataFrame Merging for China Dataset
The subsets of dataframe being created in the previous section now needs to be merged with feature services which have geographic information (e.g. geometries, shape, or longitude/latitude) in order to provide location and geometries required for geographic mapping. First, let's acquire the geometries from feature services existing on living atlas or arcgis online organization to represent the geographic information needed of overlap_rows_china
.
df_china = fset_china.sdf[['Province_State', 'Confirmed', 'Recovered', 'Deaths']]
df_china = df_china.assign(NAME = df_china["Province_State"])
df_china.head()
Province_State | Confirmed | Recovered | Deaths | NAME | |
---|---|---|---|---|---|
0 | Anhui | 1004 | 998 | 6 | Anhui |
1 | Beijing | 1078 | 1054 | 9 | Beijing |
2 | Chongqing | 598 | 591 | 6 | Chongqing |
3 | Fujian | 672 | 620 | 1 | Fujian |
4 | Gansu | 195 | 192 | 2 | Gansu |
Because the names are inconsistent between the two data sources (e.g. provinces represented differently in df_china['Province_State']
and provinces_df['NAME"]
), the replace_value_in_column
method is declared below to edit the records and unify the column names.
def replace_value_in_column(data_frame, l_value, r_value, column_name = 'NAME'):
data_frame.loc[data_frame[column_name] == l_value, column_name] = r_value
replace_value_in_column(df_china, 'Guangxi', 'Guangxi Zhuang Autonomous Region')
replace_value_in_column(df_china, 'Inner Mongolia', 'Inner Mongolia Autonomous Region')
replace_value_in_column(df_china, 'Ningxia', 'Ningxia Hui Autonomous Region')
replace_value_in_column(df_china, 'Tibet', 'Tibet Autonomous Region')
Now the two DataFrame objects have got unified column names, we can go ahead to use a single function in Pandas
called merge
as an entry point to perform in-memory standard database join operations (similar to that of relational databases such as SQL), and its syntax is shown here -
pd.merge(left, right, how='inner', on=None, left_on=None, right_on=None,
left_index=False, right_index=False, sort=True)
Note that, the how
argument specifies how to merge (a.k.a. how to determine which keys are to be included in the resulting table). If a key combination does not appear in either the left or the right tables, the values in the joined table will be NA. The table below then shows a summary of the how options and their SQL equivalent names −
Merge Method | SQL Equivalent | Description |
---|---|---|
left | LEFT OUTER JOIN | Use keys from left object |
right | RIGHT OUTER JOIN | Use keys from right object |
outer | FULL OUTER JOIN | Use union of keys |
inner | INNER JOIN | Use intersection of keys |
In this case, we will be calling merge()
with how='inner'
to perform the inner join of the two DataFrame objects on the index field "NAME"
and only to keep the intersection of keys.
cols_2 = ['NAME', 'AREA', 'TOTPOP_CY','SHAPE']
overlap_rows_china = pd.merge(left = provinces_df[cols_2], right = df_china,
how='inner', on = 'NAME')
overlap_rows_china.head()
NAME | AREA | TOTPOP_CY | SHAPE | Province_State | Confirmed | Recovered | Deaths | |
---|---|---|---|---|---|---|---|---|
0 | Anhui | 140139.795987 | 61795934 | {'rings': [[[116.36710167, 34.643320083], [116... | Anhui | 1004 | 998 | 6 |
1 | Beijing | 16535.630276 | 22237467 | {'rings': [[[116.647890001, 41.0513740000001],... | Beijing | 1078 | 1054 | 9 |
2 | Chongqing | 82390.600691 | 30365055 | {'rings': [[[108.50186348, 32.20025444], [108.... | Chongqing | 598 | 591 | 6 |
3 | Fujian | 121789.666833 | 38659398 | {'rings': [[[118.426801682, 28.2953205110001],... | Fujian | 672 | 620 | 1 |
4 | Gansu | 401138.394581 | 26268236 | {'rings': [[[97.1721363070001, 42.793840408000... | Gansu | 195 | 192 | 2 |
cols_2 = ['NAME', 'AREA', 'TOTPOP_CY','SHAPE','Shape__Area', 'Shape__Length']
overlap_rows_china = pd.merge(left = provinces_df[cols_2], right = df_china, how='inner',
on = 'NAME')
overlap_rows_china.head()
NAME | AREA | TOTPOP_CY | SHAPE | Shape__Area | Shape__Length | Province_State | Confirmed | Recovered | Deaths | |
---|---|---|---|---|---|---|---|---|---|---|
0 | Anhui | 140139.795987 | 61795934 | {'rings': [[[116.36710167, 34.643320083], [116... | 13.352904 | 32.965496 | Anhui | 1004 | 998 | 6 |
1 | Beijing | 16535.630276 | 22237467 | {'rings': [[[116.647890001, 41.0513740000001],... | NaN | NaN | Beijing | 1078 | 1054 | 9 |
2 | Chongqing | 82390.600691 | 30365055 | {'rings': [[[108.50186348, 32.20025444], [108.... | 7.708492 | 30.308633 | Chongqing | 598 | 591 | 6 |
3 | Fujian | 121789.666833 | 38659398 | {'rings': [[[118.426801682, 28.2953205110001],... | 10.989105 | 54.634113 | Fujian | 672 | 620 | 1 |
4 | Gansu | 401138.394581 | 26268236 | {'rings': [[[97.1721363070001, 42.793840408000... | 41.085703 | 80.438096 | Gansu | 195 | 192 | 2 |
As shown in overlap_rows_china
, each province/state in China is now merged while retaining the SHAPE
column, and is ready to be rendered as polygons.
Access the reference feature layer of the United States
Next, we need to access the feature service that contains geometry/shape info for all states in the U. S., and merge with the DataFrame depicting COVID-19 statistics.
us_states_item = gis.content.get('99fd67933e754a1181cc755146be21ca')
us_states_item
us_states_flayer = us_states_item.layers[0]
us_states_df = us_states_flayer.query(as_df=True)
us_states_df.columns
Index(['FID', 'STATE_NAME', 'STATE_FIPS', 'SUB_REGION', 'STATE_ABBR', 'POPULATION', 'POP_SQMI', 'POP2010', 'POP10_SQMI', 'WHITE', 'BLACK', 'AMERI_ES', 'ASIAN', 'HAWN_PI', 'HISPANIC', 'OTHER', 'MULT_RACE', 'MALES', 'FEMALES', 'AGE_UNDER5', 'AGE_5_9', 'AGE_10_14', 'AGE_15_19', 'AGE_20_24', 'AGE_25_34', 'AGE_35_44', 'AGE_45_54', 'AGE_55_64', 'AGE_65_74', 'AGE_75_84', 'AGE_85_UP', 'MED_AGE', 'MED_AGE_M', 'MED_AGE_F', 'HOUSEHOLDS', 'AVE_HH_SZ', 'HSEHLD_1_M', 'HSEHLD_1_F', 'MARHH_CHD', 'MARHH_NO_C', 'MHH_CHILD', 'FHH_CHILD', 'FAMILIES', 'AVE_FAM_SZ', 'HSE_UNITS', 'VACANT', 'OWNER_OCC', 'RENTER_OCC', 'NO_FARMS12', 'AVE_SIZE12', 'CROP_ACR12', 'AVE_SALE12', 'SQMI', 'Shape__Area', 'Shape__Length', 'GlobalID', 'SHAPE'], dtype='object')
DataFrame Merging for U.S. Dataset
df_usa = fset_usa.sdf[['Province_State', 'Confirmed', 'Recovered', 'Deaths']]
df_usa = df_usa.assign(STATE_NAME = df_usa["Province_State"])
df_usa.head()
Province_State | Confirmed | Recovered | Deaths | STATE_NAME | |
---|---|---|---|---|---|
0 | Mississippi | 321138 | 0 | 7401 | Mississippi |
1 | Grand Princess | 103 | 0 | 3 | Grand Princess |
2 | Oklahoma | 457571 | 0 | 7384 | Oklahoma |
3 | Delaware | 109682 | 0 | 1694 | Delaware |
4 | Minnesota | 605218 | 0 | 7680 | Minnesota |
cols_4 = ['STATE_NAME','SHAPE']
overlap_rows_usa = pd.merge(left = us_states_df[cols_4], right = df_usa,
how='inner', on = 'STATE_NAME')
overlap_rows_usa.head()
STATE_NAME | SHAPE | Province_State | Confirmed | Recovered | Deaths | |
---|---|---|---|---|---|---|
0 | Alaska | {'rings': [[[-17959594.8053098, 8122953.575198... | Alaska | 71035 | 0 | 374 |
1 | California | {'rings': [[[-13543710.3257494, 4603367.827345... | California | 3816296 | 0 | 63593 |
2 | Hawaii | {'rings': [[[-17819334.303422, 2512026.7784964... | Hawaii | 37647 | 0 | 516 |
3 | Idaho | {'rings': [[[-13027307.5891034, 5415905.134774... | Idaho | 194719 | 0 | 2145 |
4 | Nevada | {'rings': [[[-13263990.1054907, 4637763.931898... | Nevada | 331614 | 0 | 5667 |
Access the reference feature layer of world countries
countries_item = gis.content.get('2b93b06dc0dc4e809d3c8db5cb96ba69')
countries_item
countries_flayer = countries_item.layers[0]
countries_df = countries_flayer.query(as_df=True)
countries_df.columns
Index(['FID', 'COUNTRY', 'ISO', 'COUNTRYAFF', 'AFF_ISO', 'Shape__Area', 'Shape__Length', 'SHAPE'], dtype='object')
DataFrame Merging for global Dataset
The df_global
has listed its Country_Region
column with their current best-known names in English, while the countries_df
uses their currently best-known equivalents, and this difference in naming countries has created a problem for the merge()
operation to understand if the two countries listed in two DataFrame objects are the same. We need to hence run the following cell in order to make country names consistent between the two DataFrames to be merged.
df_global.loc[df_global['Country_Region']=='US', 'Country_Region'] = 'United States'
df_global.loc[df_global['Country_Region']=='Korea, South', 'Country_Region'] = 'South Korea'
df_global.loc[df_global['Country_Region']=='Korea, North', 'Country_Region'] = 'North Korea'
df_global.loc[df_global['Country_Region']=='Russia', 'Country_Region'] = 'Russian Federation'
df_global.loc[df_global['Country_Region']=='Czechia', 'Country_Region'] = 'Czech Republic'
List the top 10 countries with largest numbers
With df_global
ready, we can now sort countries or regions by their numbers of confirmed/recovered/death cases, through usage of groupby()
, and sort_values()
.
# sorted by # of confirmed cases
df_global_sum = df_global.groupby('Country_Region').sum(numeric_only=True)[['Confirmed', 'Recovered', 'Deaths']]
df_global_sum_c = df_global_sum.sort_values(by = ['Confirmed'], ascending = False)
df_global_sum_c.head(10)
Confirmed | Recovered | Deaths | |
---|---|---|---|
Country_Region | |||
United States | 33631532 | 0 | 604030 |
India | 30279331 | 29309607 | 396730 |
Brazil | 18420598 | 16160826 | 513474 |
France | 5832225 | 404395 | 111130 |
Turkey | 5414310 | 5280558 | 49634 |
Russian Federation | 5408744 | 4911752 | 131671 |
United Kingdom | 4771289 | 15724 | 128367 |
Argentina | 4405247 | 4027510 | 92568 |
Italy | 4258069 | 4073435 | 127472 |
Colombia | 4158716 | 3854457 | 104678 |
# sorted by death tolls
df_global_sum_d = df_global_sum.sort_values(by = ['Deaths'], ascending = False)
df_global_sum_d.head(10)
Confirmed | Recovered | Deaths | |
---|---|---|---|
Country_Region | |||
United States | 33631532 | 0 | 604030 |
Brazil | 18420598 | 16160826 | 513474 |
India | 30279331 | 29309607 | 396730 |
Mexico | 2505792 | 1990610 | 232564 |
Peru | 2048115 | 2007201 | 191899 |
Russian Federation | 5408744 | 4911752 | 131671 |
United Kingdom | 4771289 | 15724 | 128367 |
Italy | 4258069 | 4073435 | 127472 |
France | 5832225 | 404395 | 111130 |
Colombia | 4158716 | 3854457 | 104678 |
Joining the COVID-19 stats and world countries DataFrames
world_merged1 = pd.merge(df_global_sum_c, countries_df[['COUNTRY', 'SHAPE']],
left_index=True, right_on='COUNTRY',
how="left")
world_merged1[['COUNTRY', 'Confirmed','Deaths', 'Recovered']].head(10)
COUNTRY | Confirmed | Deaths | Recovered | |
---|---|---|---|---|
154.0 | United States | 33631532 | 604030 | 0 |
193.0 | India | 30279331 | 396730 | 29309607 |
20.0 | Brazil | 18420598 | 513474 | 16160826 |
155.0 | France | 5832225 | 111130 | 404395 |
151.0 | Turkey | 5414310 | 49634 | 5280558 |
246.0 | Russian Federation | 5408744 | 131671 | 4911752 |
89.0 | United Kingdom | 4771289 | 128367 | 15724 |
14.0 | Argentina | 4405247 | 92568 | 4027510 |
170.0 | Italy | 4258069 | 127472 | 4073435 |
37.0 | Colombia | 4158716 | 104678 | 3854457 |
Now, each country/region in world_merged1
is now merged with the SHAPE
column, and is ready to be rendered as polygons.
2. Map the COVID-19 cases in China
Next, let us start visualizing the following scenarios targeting at China:
- Confirmed cases rendered as points, and polygons
- Death cases rendered as points, and polygons
- Recovered cases rendered as points, and polygons
Map the confirmed COVID-19 cases in China
We can either call the add_layer()
function to add the specified layer or item (i.e. the FeatureLayer object created as fl
) to the map widget, and set the visualization options to be using ClassedSizeRenderer
, or plot the derived SeDF on the map view directly with further descriptions such as how to renderer spatial data using symbol and color palette (Here, the SeDF is the derivative of the merged DataFrame, which now contains a SHAPE column that we can use to plot in the Map widget as polygons).
Display confirmed cases in China as points
map1 = gis.map('China', zoomlevel=4)
map1
map1.add_layer(fl, { "type": "FeatureLayer",
"renderer":"ClassedSizeRenderer",
"field_name":"Confirmed"})
map1.zoom = 3
map1.legend = True
Display confirmed cases in China as polygons
map1b = gis.map('China')
map1b
map1b.clear_graphics()
overlap_rows_china.spatial.plot( kind='map', map_widget=map1b,
renderer_type='c', # for class breaks renderer
method='esriClassifyNaturalBreaks', # classification algorithm
class_count=4, # choose the number of classes
col='Confirmed', # numeric column to classify
cmap='inferno', # color map to pick colors from for each class
alpha=0.7 # specify opacity
)
True
map1b.zoom = 4
map1b.legend=True
The Map view above (map1b
) displays the number of confirmed cases per province in Mainland China. Orange polygons refer to provinces with number of confirmed cases in the range of [45423, 68134], and black polygons represent those in the range of [1, 22712].
Also, we can save the MapView object into a Web Map item for the purpose of future references and modifications.
map1b.save({'title':'Confirmed COVID-19 Cases in China',
'snippet':'Map created using Python API showing confirmed COVID-19 cases in China',
'tags':['automation', 'COVID19', 'world health', 'python']})
For example, we can browse the web map in the browser, change its symbology to different color maps in the configuration pane, then visualize it again with different looks here.
map1b_item = gis.content.search('Confirmed COVID-19 Cases in China')[0]
WebMap(map1b_item)
Map the deaths caused by COVID-19 in China
Display death cases in China as points
map2 = gis.map('China', zoomlevel=4)
map2
map2.add_layer(fl, { "type": "FeatureLayer",
"renderer":"ClassedSizeRenderer",
"field_name":"Deaths"})
map2.legend = True
Display death cases in China as polygons
map2b = gis.map('China')
map2b
map2b = gis.map('China')
map2b
map2b.clear_graphics()
overlap_rows_china.spatial.plot( kind='map', map_widget=map2b,
renderer_type='c', # for class breaks renderer
method='esriClassifyNaturalBreaks', # classification algorithm
class_count=4, # choose the number of classes
col='Deaths', # numeric column to classify
cmap='inferno', # color map to pick colors from for each class
alpha=0.7 # specify opacity
)
True
map2b.zoom = 4
map2b.legend = True
Using the same approach, we can then map the number of death cases per province in Mainland China. With legend displayed, map2b
shows us orange polygons refer to provinces with number of death cases in the range of [3008, 4512], and black polygons represent those in the range of [0, 1504].
Similarly, we can create an additional deliverable - the Web Map Item created on the active GIS
- and then browse the web map in the browser, change its symbology to different color maps in the configuration pane, and/or visualize it again with different looks here.
map2b_item = map2b.save({'title':'COVID-19 Death Cases in China',
'snippet':'Map created using Python API showing COVID-19 death cases in China',
'tags':['automation', 'COVID19', 'world health', 'python']})
WebMap(map2b_item)
Map the recovered COVID-19 cases in China
Display the recovered cases in China as points
map3 = gis.map('China', zoomlevel=4)
map3
map3.add_layer(fl, { "type": "FeatureLayer",
"renderer":"ClassedSizeRenderer",
"field_name":"Recovered"})
map3.legend = True
Display the recovered cases in China as polygons
map3b = gis.map('China')
map3b
map3b.clear_graphics()
overlap_rows_china.spatial.plot( kind='map', map_widget=map3b,
renderer_type='c', # for class breaks renderer
method='esriClassifyNaturalBreaks', # classification algorithm
class_count=4, # choose the number of classes
col='Recovered', # numeric column to classify
cmap='inferno', # color map to pick colors from for each class
alpha=0.7 # specify opacity
)
True
map3b.zoom = 4
map3b.legend = True
Same here in map3b
, with legend displayed, we can tell orange polygons refer to provinces with number of recovered cases in the range of [42411, 63616], and black polygons represent those in the range of [1, 21206].
Based on the sets of maps plotted above, the Hubei province has topped all provinces in China no matter when we are looking at the confirmed cases, or the recovered/death tolls.
Again as how we create an additional deliverable - the Web Map Item - on the active GIS in previous two sections, we can then browse the web map in the browser, change its symbology to different color maps in the configuration pane, and/or visualize it again with different looks.
map3b_item = map3b.save({'title':'COVID-19 Recovered Cases in China',
'snippet':'Map created using Python API showing COVID-19 recovered cases in China',
'tags':['automation', 'COVID19', 'world health', 'python']})
WebMap(map3b_item)
3. Map the COVID-19 cases in the U.S.
Similar to how we map the scenarios in China, visualizing these scenarios targeting at the United States can be divided into these sub-sections:
- Confirmed cases rendered as points, and polygons
- Death cases rendered as points, and polygons
Map the confirmed COVID-19 cases in the U. S.
There are two ways to map the COVID-19 cases:
- call the
add_layer()
function to add the specified layer or item (i.e. the FeatureLayer object created asfl
) to the map widget, and set the visualization options to be usingClassedSizeRenderer
, - plot the derived SeDF on the map view directly with further descriptions such as how to renderer spatial data using symbol and color palette (Here, the SeDF is the derivative of the merged DataFrame, which now contains a SHAPE column that we can use to plot in the Map widget as polygons).
We are going to show how the confirmed cases in the U.S. look like in both ways:
Map the confirmed cases in the U.S. as points
map4 = gis.map("US", zoomlevel=4)
map4
map4.add_layer(fl, { "type": "FeatureLayer",
"renderer":"ClassedSizeRenderer",
"field_name":"Confirmed"})
map4.legend = True
Map the confirmed cases in the U.S. as polygons
map4a = gis.map("US")
map4a
map4a.clear_graphics()
overlap_rows_usa.spatial.plot( kind='map', map_widget=map4a,
renderer_type='c', # for class breaks renderer
class_count=4, # choose the number of classes
method='esriClassifyEqualInterval', # classification algorithm
col='Confirmed', # numeric column to classify
cmap='inferno', # color map to pick colors from for each class
alpha=0.7 # specify opacity
)
True
map4a.zoom = 4
map4a.legend = True
Map the COVID-19 deaths in the U. S.
Map the death cases in the U.S. as points
map4b = gis.map("US")
map4b
map4b.add_layer(fl, { "type": "FeatureLayer",
"renderer":"ClassedSizeRenderer",
"field_name":"Deaths"})
map4b.zoom=4
map4b.legend=True
Map the death cases in the U.S. as polygons
map4c = gis.map("US")
map4c
map4c.clear_graphics()
overlap_rows_usa.spatial.plot( kind='map', map_widget=map4c,
renderer_type='c', # for class breaks renderer
class_count=4, # choose the number of classes
method='esriClassifyEqualInterval', # classification algorithm
col='Deaths', # numeric column to classify
cmap='inferno', # color map to pick colors from for each class
alpha=0.7 # specify opacity
)
True
map4c.zoom = 4
map4c.legend = True
4. Map the COVID-19 cases globally
Now we have learned how to map COVID-19 cases in China, and the United States, let's take a look at how to map across the globe. For illustration purpose, the rest of this section will only talk about mapping the confirmed cases globally, yet the workflow is almost the same to map deaths and recovered cases globally.
Map the confirmed COVID-19 cases globally
We are going to walk through how the confirmed cases across the globe can be mapped in the Map widget:
- call the
add_layer()
function to add the specified layer or item (i.e. the FeatureLayer object created asfl
) to the map widget, and set the visualization options to be usingClassedSizeRenderer
, - plot the derived SeDF on the map view directly with further descriptions such as how to renderer spatial data using symbol and color palette (Here, the SeDF is the derivative of the merged DataFrame, which now contains a SHAPE column that we can use to plot in the Map widget as polygons).
Map the global confirmed cases as points
map5a = gis.map("Italy")
map5a
map5a.add_layer(fl, { "type": "FeatureLayer",
"renderer":"ClassedSizeRenderer",
"field_name":"Confirmed"})
map5a.legend=True
map5a.zoom = 4
Map the global confirmed cases as polygons
map5b = gis.map("Italy")
map5b
world_merged1.spatial.plot( kind='map', map_widget=map5b,
renderer_type='c', # for class breaks renderer
method='esriClassifyStandardDeviation', # classification algorithm
class_count=7, # choose the number of classes
col='Confirmed', # numeric column to classify
cmap='inferno', # color map to pick colors from for each class
alpha=0.7 # specify opacity
)
True
map5b.zoom = 4
map5b.legend = True
When map5b
is created centering Brazil, the legend added along shows that yellow polygons (the 2nd class in the colormap) indicate that the countries are of number of confirmed cases ranging within (28026277, 33631532], and purple polygons represent those ranging between (5605255, 11210511].
5. What's next?
This notebook demonstrates accessing the corona viruses statistics, tabularizing them in DataFrames, and finally mapping the outbreak in different regions or counties. Next step, you can refer to Part 2 notebook for analyzing and charting time-series of the 2019 Novel Coronavirus.
References
[1] https://www.cdc.gov/coronavirus/2019-ncov/about/index.html