How to Create Data Maps of the United States With Matplotlib

Hello, and welcome to this tutorial.

Today, I will teach you to create the map that you see above using geo-data and the Social Connectivity Index.

If you want to know more about the visualization and dataset, you can take a look at this article in my new free newsletter, Data Wonder.

Let’s get started with the tutorial.


Step 1: Download data

Before we begin, we need to download a dataset exciting enough for this tutorial and geo-data to draw accurate maps of the United States.

For the maps, I’m using shape files from Cencus.gov. You can use the following links to download both states and counties.

To have a complementary dataset, I’ve selected the Facebook Connectivity Index, which measures the likelihood that two people in different counties are connected on Facebook.

You can download the connectivity data using this link.

Once the downloads have finished, unzip them and put them in a good location. I’m using ./data in the tutorial, but you can do whatever you like.

It should look something like this.

Let’s write some code.


Step 2: Import libraries and prepare Seaborn

The only new library (if you’ve done any of my other Matplotlib Tutorials) is geopandas, which we will use to draw maps.

<span># Import libraries </span>
<span>import</span> <span>numpy</span> <span>as</span> <span>np</span>
<span>import</span> <span>pandas</span> <span>as</span> <span>pd</span>
<span>import</span> <span>seaborn</span> <span>as</span> <span>sns</span>
<span>import</span> <span>geopandas</span> <span>as</span> <span>gpd</span>
<span>import</span> <span>matplotlib.pyplot</span> <span>as</span> <span>plt</span>
<span>from</span> <span>PIL</span> <span>import</span> <span>Image</span>
<span>from</span> <span>matplotlib.patches</span> <span>import</span> <span>Patch</span><span>,</span> <span>Circle</span>
<span># Import libraries </span>
<span>import</span> <span>numpy</span> <span>as</span> <span>np</span>
<span>import</span> <span>pandas</span> <span>as</span> <span>pd</span>
<span>import</span> <span>seaborn</span> <span>as</span> <span>sns</span>
<span>import</span> <span>geopandas</span> <span>as</span> <span>gpd</span>
<span>import</span> <span>matplotlib.pyplot</span> <span>as</span> <span>plt</span>

<span>from</span> <span>PIL</span> <span>import</span> <span>Image</span>
<span>from</span> <span>matplotlib.patches</span> <span>import</span> <span>Patch</span><span>,</span> <span>Circle</span>
# Import libraries import numpy as np import pandas as pd import seaborn as sns import geopandas as gpd import matplotlib.pyplot as plt from PIL import Image from matplotlib.patches import Patch, Circle

Enter fullscreen mode Exit fullscreen mode

Next, let’s define a few features about the style using seaborn.

<span>edge_color</span> <span>=</span> <span>"</span><span>#30011E</span><span>"</span>
<span>background_color</span> <span>=</span> <span>"</span><span>#fafafa</span><span>"</span>
<span>sns</span><span>.</span><span>set_style</span><span>({</span>
<span>"</span><span>font.family</span><span>"</span><span>:</span> <span>"</span><span>serif</span><span>"</span><span>,</span>
<span>"</span><span>figure.facecolor</span><span>"</span><span>:</span> <span>background_color</span><span>,</span>
<span>"</span><span>axes.facecolor</span><span>"</span><span>:</span> <span>background_color</span><span>,</span>
<span>})</span>
<span>edge_color</span> <span>=</span> <span>"</span><span>#30011E</span><span>"</span>
<span>background_color</span> <span>=</span> <span>"</span><span>#fafafa</span><span>"</span>

<span>sns</span><span>.</span><span>set_style</span><span>({</span>
    <span>"</span><span>font.family</span><span>"</span><span>:</span> <span>"</span><span>serif</span><span>"</span><span>,</span>
    <span>"</span><span>figure.facecolor</span><span>"</span><span>:</span> <span>background_color</span><span>,</span>
    <span>"</span><span>axes.facecolor</span><span>"</span><span>:</span> <span>background_color</span><span>,</span>
<span>})</span>
edge_color = "#30011E" background_color = "#fafafa" sns.set_style({ "font.family": "serif", "figure.facecolor": background_color, "axes.facecolor": background_color, })

Enter fullscreen mode Exit fullscreen mode

Now it’s time to learn how to draw a map.


Step 3: Load and prepare geo-data

I use geopandas to load the data and remove “unincorporated territories” such as Guam, Puerto Rico, and American Samoa.

<span># Load and prepare geo-data </span><span>counties</span> <span>=</span> <span>gpd</span><span>.</span><span>read_file</span><span>(</span><span>"</span><span>./data/cb_2018_us_county_500k/</span><span>"</span><span>)</span>
<span>counties</span> <span>=</span> <span>counties</span><span>[</span><span>~</span><span>counties</span><span>.</span><span>STATEFP</span><span>.</span><span>isin</span><span>([</span><span>"</span><span>72</span><span>"</span><span>,</span> <span>"</span><span>69</span><span>"</span><span>,</span> <span>"</span><span>60</span><span>"</span><span>,</span> <span>"</span><span>66</span><span>"</span><span>,</span> <span>"</span><span>78</span><span>"</span><span>])]</span>
<span>counties</span> <span>=</span> <span>counties</span><span>.</span><span>set_index</span><span>(</span><span>"</span><span>GEOID</span><span>"</span><span>)</span>
<span>states</span> <span>=</span> <span>gpd</span><span>.</span><span>read_file</span><span>(</span><span>"</span><span>./data/cb_2018_us_state_500k/</span><span>"</span><span>)</span>
<span>states</span> <span>=</span> <span>states</span><span>[</span><span>~</span><span>states</span><span>.</span><span>STATEFP</span><span>.</span><span>isin</span><span>([</span><span>"</span><span>72</span><span>"</span><span>,</span> <span>"</span><span>69</span><span>"</span><span>,</span> <span>"</span><span>60</span><span>"</span><span>,</span> <span>"</span><span>66</span><span>"</span><span>,</span> <span>"</span><span>78</span><span>"</span><span>])]</span>
<span># Load and prepare geo-data </span><span>counties</span> <span>=</span> <span>gpd</span><span>.</span><span>read_file</span><span>(</span><span>"</span><span>./data/cb_2018_us_county_500k/</span><span>"</span><span>)</span>
<span>counties</span> <span>=</span> <span>counties</span><span>[</span><span>~</span><span>counties</span><span>.</span><span>STATEFP</span><span>.</span><span>isin</span><span>([</span><span>"</span><span>72</span><span>"</span><span>,</span> <span>"</span><span>69</span><span>"</span><span>,</span> <span>"</span><span>60</span><span>"</span><span>,</span> <span>"</span><span>66</span><span>"</span><span>,</span> <span>"</span><span>78</span><span>"</span><span>])]</span>
<span>counties</span> <span>=</span> <span>counties</span><span>.</span><span>set_index</span><span>(</span><span>"</span><span>GEOID</span><span>"</span><span>)</span>

<span>states</span> <span>=</span> <span>gpd</span><span>.</span><span>read_file</span><span>(</span><span>"</span><span>./data/cb_2018_us_state_500k/</span><span>"</span><span>)</span>
<span>states</span> <span>=</span> <span>states</span><span>[</span><span>~</span><span>states</span><span>.</span><span>STATEFP</span><span>.</span><span>isin</span><span>([</span><span>"</span><span>72</span><span>"</span><span>,</span> <span>"</span><span>69</span><span>"</span><span>,</span> <span>"</span><span>60</span><span>"</span><span>,</span> <span>"</span><span>66</span><span>"</span><span>,</span> <span>"</span><span>78</span><span>"</span><span>])]</span>
# Load and prepare geo-data counties = gpd.read_file("./data/cb_2018_us_county_500k/") counties = counties[~counties.STATEFP.isin(["72", "69", "60", "66", "78"])] counties = counties.set_index("GEOID") states = gpd.read_file("./data/cb_2018_us_state_500k/") states = states[~states.STATEFP.isin(["72", "69", "60", "66", "78"])]

Enter fullscreen mode Exit fullscreen mode

A geopandas data frame has a geometry column that defines the shape of each row. It allows us to draw a map by calling counties.plot() or states.plot() like this.

<span>ax</span> <span>=</span> <span>counties</span><span>.</span><span>plot</span><span>(</span><span>edgecolor</span><span>=</span><span>edge_color</span> <span>+</span> <span>"</span><span>55</span><span>"</span><span>,</span> <span>color</span><span>=</span><span>"</span><span>None</span><span>"</span><span>,</span> <span>figsize</span><span>=</span><span>(</span><span>20</span><span>,</span> <span>20</span><span>))</span>
<span>states</span><span>.</span><span>plot</span><span>(</span><span>ax</span><span>=</span><span>ax</span><span>,</span> <span>edgecolor</span><span>=</span><span>edge_color</span><span>,</span> <span>color</span><span>=</span><span>"</span><span>None</span><span>"</span><span>,</span> <span>linewidth</span><span>=</span><span>1</span><span>)</span>
<span>plt</span><span>.</span><span>axis</span><span>(</span><span>"</span><span>off</span><span>"</span><span>)</span>
<span>plt</span><span>.</span><span>show</span><span>()</span>
<span>ax</span> <span>=</span> <span>counties</span><span>.</span><span>plot</span><span>(</span><span>edgecolor</span><span>=</span><span>edge_color</span> <span>+</span> <span>"</span><span>55</span><span>"</span><span>,</span> <span>color</span><span>=</span><span>"</span><span>None</span><span>"</span><span>,</span> <span>figsize</span><span>=</span><span>(</span><span>20</span><span>,</span> <span>20</span><span>))</span>
<span>states</span><span>.</span><span>plot</span><span>(</span><span>ax</span><span>=</span><span>ax</span><span>,</span> <span>edgecolor</span><span>=</span><span>edge_color</span><span>,</span> <span>color</span><span>=</span><span>"</span><span>None</span><span>"</span><span>,</span> <span>linewidth</span><span>=</span><span>1</span><span>)</span>

<span>plt</span><span>.</span><span>axis</span><span>(</span><span>"</span><span>off</span><span>"</span><span>)</span>
<span>plt</span><span>.</span><span>show</span><span>()</span>
ax = counties.plot(edgecolor=edge_color + "55", color="None", figsize=(20, 20)) states.plot(ax=ax, edgecolor=edge_color, color="None", linewidth=1) plt.axis("off") plt.show()

Enter fullscreen mode Exit fullscreen mode

Here, I start by drawing the counties with transparent borders, and then I reuse ax when I call states.plot() so that I don’t draw separate maps.

This is what I get.

The results don’t look great, but I will make a few quick adjustments to get us on the right track.

The first adjustment is to change the map projection to one centered on the United States. You can do that with geopandas by calling to_crs().

<span># Load and prepare geo-data </span><span>...</span>
<span>counties</span> <span>=</span> <span>counties</span><span>.</span><span>to_crs</span><span>(</span><span>"</span><span>ESRI:102003</span><span>"</span><span>)</span>
<span>states</span> <span>=</span> <span>states</span><span>.</span><span>to_crs</span><span>(</span><span>"</span><span>ESRI:102003</span><span>"</span><span>)</span>
<span># Load and prepare geo-data </span><span>...</span>

<span>counties</span> <span>=</span> <span>counties</span><span>.</span><span>to_crs</span><span>(</span><span>"</span><span>ESRI:102003</span><span>"</span><span>)</span>
<span>states</span> <span>=</span> <span>states</span><span>.</span><span>to_crs</span><span>(</span><span>"</span><span>ESRI:102003</span><span>"</span><span>)</span>
# Load and prepare geo-data ... counties = counties.to_crs("ESRI:102003") states = states.to_crs("ESRI:102003")

Enter fullscreen mode Exit fullscreen mode

Here’s the difference.

It’s common to draw Alaska and Hawaii underneath the mainland when drawing data maps of the United States, and that’s what we will do as well.

With geopandas, you can translate, scale, and rotate geometries with built-in functions. Here’s a helpful function to do that.

<span>def</span> <span>translate_geometries</span><span>(</span><span>df</span><span>,</span> <span>x</span><span>,</span> <span>y</span><span>,</span> <span>scale</span><span>,</span> <span>rotate</span><span>):</span>
<span>df</span><span>.</span><span>loc</span><span>[:,</span> <span>"</span><span>geometry</span><span>"</span><span>]</span> <span>=</span> <span>df</span><span>.</span><span>geometry</span><span>.</span><span>translate</span><span>(</span><span>yoff</span><span>=</span><span>y</span><span>,</span> <span>xoff</span><span>=</span><span>x</span><span>)</span>
<span>center</span> <span>=</span> <span>df</span><span>.</span><span>dissolve</span><span>().</span><span>centroid</span><span>.</span><span>iloc</span><span>[</span><span>0</span><span>]</span>
<span>df</span><span>.</span><span>loc</span><span>[:,</span> <span>"</span><span>geometry</span><span>"</span><span>]</span> <span>=</span> <span>df</span><span>.</span><span>geometry</span><span>.</span><span>scale</span><span>(</span><span>xfact</span><span>=</span><span>scale</span><span>,</span> <span>yfact</span><span>=</span><span>scale</span><span>,</span> <span>origin</span><span>=</span><span>center</span><span>)</span>
<span>df</span><span>.</span><span>loc</span><span>[:,</span> <span>"</span><span>geometry</span><span>"</span><span>]</span> <span>=</span> <span>df</span><span>.</span><span>geometry</span><span>.</span><span>rotate</span><span>(</span><span>rotate</span><span>,</span> <span>origin</span><span>=</span><span>center</span><span>)</span>
<span>return</span> <span>df</span>
<span>def</span> <span>translate_geometries</span><span>(</span><span>df</span><span>,</span> <span>x</span><span>,</span> <span>y</span><span>,</span> <span>scale</span><span>,</span> <span>rotate</span><span>):</span>
    <span>df</span><span>.</span><span>loc</span><span>[:,</span> <span>"</span><span>geometry</span><span>"</span><span>]</span> <span>=</span> <span>df</span><span>.</span><span>geometry</span><span>.</span><span>translate</span><span>(</span><span>yoff</span><span>=</span><span>y</span><span>,</span> <span>xoff</span><span>=</span><span>x</span><span>)</span>
    <span>center</span> <span>=</span> <span>df</span><span>.</span><span>dissolve</span><span>().</span><span>centroid</span><span>.</span><span>iloc</span><span>[</span><span>0</span><span>]</span>
    <span>df</span><span>.</span><span>loc</span><span>[:,</span> <span>"</span><span>geometry</span><span>"</span><span>]</span> <span>=</span> <span>df</span><span>.</span><span>geometry</span><span>.</span><span>scale</span><span>(</span><span>xfact</span><span>=</span><span>scale</span><span>,</span> <span>yfact</span><span>=</span><span>scale</span><span>,</span> <span>origin</span><span>=</span><span>center</span><span>)</span>
    <span>df</span><span>.</span><span>loc</span><span>[:,</span> <span>"</span><span>geometry</span><span>"</span><span>]</span> <span>=</span> <span>df</span><span>.</span><span>geometry</span><span>.</span><span>rotate</span><span>(</span><span>rotate</span><span>,</span> <span>origin</span><span>=</span><span>center</span><span>)</span>
    <span>return</span> <span>df</span>
def translate_geometries(df, x, y, scale, rotate): df.loc[:, "geometry"] = df.geometry.translate(yoff=y, xoff=x) center = df.dissolve().centroid.iloc[0] df.loc[:, "geometry"] = df.geometry.scale(xfact=scale, yfact=scale, origin=center) df.loc[:, "geometry"] = df.geometry.rotate(rotate, origin=center) return df

Enter fullscreen mode Exit fullscreen mode

I calculate a center point for the entire data frame that defines the origin of rotation and scaling. If I don’t, geopandas does that automatically for each row, which makes the map look completely messed up.

This next function takes our current data frames, separates Hawaii and Alaska, calls translate_geometries() to adjust their geometries, and put them back into new data frames.

<span>def</span> <span>adjust_maps</span><span>(</span><span>df</span><span>):</span>
<span>df_main_land</span> <span>=</span> <span>df</span><span>[</span><span>~</span><span>df</span><span>.</span><span>STATEFP</span><span>.</span><span>isin</span><span>([</span><span>"</span><span>02</span><span>"</span><span>,</span> <span>"</span><span>15</span><span>"</span><span>])]</span>
<span>df_alaska</span> <span>=</span> <span>df</span><span>[</span><span>df</span><span>.</span><span>STATEFP</span> <span>==</span> <span>"</span><span>02</span><span>"</span><span>]</span>
<span>df_hawaii</span> <span>=</span> <span>df</span><span>[</span><span>df</span><span>.</span><span>STATEFP</span> <span>==</span> <span>"</span><span>15</span><span>"</span><span>]</span>
<span>df_alaska</span> <span>=</span> <span>translate_geometries</span><span>(</span><span>df_alaska</span><span>,</span> <span>1300000</span><span>,</span> <span>-</span><span>4900000</span><span>,</span> <span>0.5</span><span>,</span> <span>32</span><span>)</span>
<span>df_hawaii</span> <span>=</span> <span>translate_geometries</span><span>(</span><span>df_hawaii</span><span>,</span> <span>5400000</span><span>,</span> <span>-</span><span>1500000</span><span>,</span> <span>1</span><span>,</span> <span>24</span><span>)</span>
<span>return</span> <span>pd</span><span>.</span><span>concat</span><span>([</span><span>df_main_land</span><span>,</span> <span>df_alaska</span><span>,</span> <span>df_hawaii</span><span>])</span>
<span>def</span> <span>adjust_maps</span><span>(</span><span>df</span><span>):</span>
    <span>df_main_land</span> <span>=</span> <span>df</span><span>[</span><span>~</span><span>df</span><span>.</span><span>STATEFP</span><span>.</span><span>isin</span><span>([</span><span>"</span><span>02</span><span>"</span><span>,</span> <span>"</span><span>15</span><span>"</span><span>])]</span>
    <span>df_alaska</span> <span>=</span> <span>df</span><span>[</span><span>df</span><span>.</span><span>STATEFP</span> <span>==</span> <span>"</span><span>02</span><span>"</span><span>]</span>
    <span>df_hawaii</span> <span>=</span> <span>df</span><span>[</span><span>df</span><span>.</span><span>STATEFP</span> <span>==</span> <span>"</span><span>15</span><span>"</span><span>]</span>

    <span>df_alaska</span> <span>=</span> <span>translate_geometries</span><span>(</span><span>df_alaska</span><span>,</span> <span>1300000</span><span>,</span> <span>-</span><span>4900000</span><span>,</span> <span>0.5</span><span>,</span> <span>32</span><span>)</span>
    <span>df_hawaii</span> <span>=</span> <span>translate_geometries</span><span>(</span><span>df_hawaii</span><span>,</span> <span>5400000</span><span>,</span> <span>-</span><span>1500000</span><span>,</span> <span>1</span><span>,</span> <span>24</span><span>)</span>

    <span>return</span> <span>pd</span><span>.</span><span>concat</span><span>([</span><span>df_main_land</span><span>,</span> <span>df_alaska</span><span>,</span> <span>df_hawaii</span><span>])</span>
def adjust_maps(df): df_main_land = df[~df.STATEFP.isin(["02", "15"])] df_alaska = df[df.STATEFP == "02"] df_hawaii = df[df.STATEFP == "15"] df_alaska = translate_geometries(df_alaska, 1300000, -4900000, 0.5, 32) df_hawaii = translate_geometries(df_hawaii, 5400000, -1500000, 1, 24) return pd.concat([df_main_land, df_alaska, df_hawaii])

Enter fullscreen mode Exit fullscreen mode

We add adjust_maps() to our code.

<span># Load and prepare geo-data </span><span>...</span>
<span>counties</span> <span>=</span> <span>adjust_maps</span><span>(</span><span>counties</span><span>)</span>
<span>states</span> <span>=</span> <span>adjust_maps</span><span>(</span><span>states</span><span>)</span>
<span># Load and prepare geo-data </span><span>...</span>

<span>counties</span> <span>=</span> <span>adjust_maps</span><span>(</span><span>counties</span><span>)</span>
<span>states</span> <span>=</span> <span>adjust_maps</span><span>(</span><span>states</span><span>)</span>
# Load and prepare geo-data ... counties = adjust_maps(counties) states = adjust_maps(states)

Enter fullscreen mode Exit fullscreen mode

And our map now looks like this.

Time for the next step.


Step 4: Adding data

To add data, we start by loading the Facebook connectivity data. I’m turning the user_loc and fr_loc columns into strings and adding leading zeros to make them consistent with the geo data.

<span># Load facebook data </span><span>facebook_df</span> <span>=</span> <span>pd</span><span>.</span><span>read_csv</span><span>(</span><span>"</span><span>./data/county_county.tsv</span><span>"</span><span>,</span> <span>sep</span><span>=</span><span>"</span><span>\t</span><span>"</span><span>)</span>
<span>facebook_df</span><span>.</span><span>user_loc</span> <span>=</span> <span>(</span><span>"</span><span>0</span><span>"</span> <span>+</span> <span>facebook_df</span><span>.</span><span>user_loc</span><span>.</span><span>astype</span><span>(</span><span>str</span><span>)).</span><span>str</span><span>[</span><span>-</span><span>5</span><span>:]</span>
<span>facebook_df</span><span>.</span><span>fr_loc</span> <span>=</span> <span>(</span><span>"</span><span>0</span><span>"</span> <span>+</span> <span>facebook_df</span><span>.</span><span>fr_loc</span><span>.</span><span>astype</span><span>(</span><span>str</span><span>)).</span><span>str</span><span>[</span><span>-</span><span>5</span><span>:]</span>
<span># Load facebook data </span><span>facebook_df</span> <span>=</span> <span>pd</span><span>.</span><span>read_csv</span><span>(</span><span>"</span><span>./data/county_county.tsv</span><span>"</span><span>,</span> <span>sep</span><span>=</span><span>"</span><span>\t</span><span>"</span><span>)</span>
<span>facebook_df</span><span>.</span><span>user_loc</span> <span>=</span> <span>(</span><span>"</span><span>0</span><span>"</span> <span>+</span> <span>facebook_df</span><span>.</span><span>user_loc</span><span>.</span><span>astype</span><span>(</span><span>str</span><span>)).</span><span>str</span><span>[</span><span>-</span><span>5</span><span>:]</span>
<span>facebook_df</span><span>.</span><span>fr_loc</span> <span>=</span> <span>(</span><span>"</span><span>0</span><span>"</span> <span>+</span> <span>facebook_df</span><span>.</span><span>fr_loc</span><span>.</span><span>astype</span><span>(</span><span>str</span><span>)).</span><span>str</span><span>[</span><span>-</span><span>5</span><span>:]</span>
# Load facebook data facebook_df = pd.read_csv("./data/county_county.tsv", sep="\t") facebook_df.user_loc = ("0" + facebook_df.user_loc.astype(str)).str[-5:] facebook_df.fr_loc = ("0" + facebook_df.fr_loc.astype(str)).str[-5:]

Enter fullscreen mode Exit fullscreen mode

The user_loc and fr_loc columns define a county pair, and the third column, scaled_sci, is the value we want to display.

There are 3,227 counties in the dataset, which means there are a total of 10,413,529 pairs, but we will show the connectivity indexes for one county at a time.

<span># Create data map </span><span>county_id</span> <span>=</span> <span>"</span><span>06075</span><span>"</span> <span># San Fransisco </span><span>county_name</span> <span>=</span> <span>counties</span><span>.</span><span>loc</span><span>[</span><span>county_id</span><span>].</span><span>NAME</span>
<span>county_facebook_df</span> <span>=</span> <span>facebook_df</span><span>[</span><span>facebook_df</span><span>.</span><span>user_loc</span> <span>==</span> <span>county_id</span><span>]</span>
<span># Create data map </span><span>county_id</span> <span>=</span> <span>"</span><span>06075</span><span>"</span> <span># San Fransisco </span><span>county_name</span> <span>=</span> <span>counties</span><span>.</span><span>loc</span><span>[</span><span>county_id</span><span>].</span><span>NAME</span>
<span>county_facebook_df</span> <span>=</span> <span>facebook_df</span><span>[</span><span>facebook_df</span><span>.</span><span>user_loc</span> <span>==</span> <span>county_id</span><span>]</span>
# Create data map county_id = "06075" # San Fransisco county_name = counties.loc[county_id].NAME county_facebook_df = facebook_df[facebook_df.user_loc == county_id]

Enter fullscreen mode Exit fullscreen mode

Next, I define a selected_color and data_breaks which contains percentiles, colors, and legend texts for later.

<span># Create data map </span><span>...</span>
<span>selected_color</span> <span>=</span> <span>"</span><span>#FA26A0</span><span>"</span>
<span>data_breaks</span> <span>=</span> <span>[</span>
<span>(</span><span>90</span><span>,</span> <span>"</span><span>#00ffff</span><span>"</span><span>,</span> <span>"</span><span>Top 10%</span><span>"</span><span>),</span>
<span>(</span><span>70</span><span>,</span> <span>"</span><span>#00b5ff</span><span>"</span><span>,</span> <span>"</span><span>90-70%</span><span>"</span><span>),</span>
<span>(</span><span>50</span><span>,</span> <span>"</span><span>#6784ff</span><span>"</span><span>,</span> <span>"</span><span>70-50%</span><span>"</span><span>),</span>
<span>(</span><span>30</span><span>,</span> <span>"</span><span>#aeb3fe</span><span>"</span><span>,</span> <span>"</span><span>50-30%</span><span>"</span><span>),</span>
<span>(</span><span>0</span><span>,</span> <span>"</span><span>#e6e5fc</span><span>"</span><span>,</span> <span>"</span><span>Bottom 30%</span><span>"</span><span>),</span>
<span>]</span>
<span># Create data map </span><span>...</span>

<span>selected_color</span> <span>=</span> <span>"</span><span>#FA26A0</span><span>"</span>
<span>data_breaks</span> <span>=</span> <span>[</span>
    <span>(</span><span>90</span><span>,</span> <span>"</span><span>#00ffff</span><span>"</span><span>,</span> <span>"</span><span>Top 10%</span><span>"</span><span>),</span>
    <span>(</span><span>70</span><span>,</span> <span>"</span><span>#00b5ff</span><span>"</span><span>,</span> <span>"</span><span>90-70%</span><span>"</span><span>),</span>
    <span>(</span><span>50</span><span>,</span> <span>"</span><span>#6784ff</span><span>"</span><span>,</span> <span>"</span><span>70-50%</span><span>"</span><span>),</span>
    <span>(</span><span>30</span><span>,</span> <span>"</span><span>#aeb3fe</span><span>"</span><span>,</span> <span>"</span><span>50-30%</span><span>"</span><span>),</span>
    <span>(</span><span>0</span><span>,</span> <span>"</span><span>#e6e5fc</span><span>"</span><span>,</span> <span>"</span><span>Bottom 30%</span><span>"</span><span>),</span>
<span>]</span>
# Create data map ... selected_color = "#FA26A0" data_breaks = [ (90, "#00ffff", "Top 10%"), (70, "#00b5ff", "90-70%"), (50, "#6784ff", "70-50%"), (30, "#aeb3fe", "50-30%"), (0, "#e6e5fc", "Bottom 30%"), ]

Enter fullscreen mode Exit fullscreen mode

The following function defines the color for each row using a county_df and the data_breaks we just defined.

<span>def</span> <span>create_color</span><span>(</span><span>county_df</span><span>,</span> <span>data_breaks</span><span>):</span>
<span>colors</span> <span>=</span> <span>[]</span>
<span>for</span> <span>i</span><span>,</span> <span>row</span> <span>in</span> <span>county_df</span><span>.</span><span>iterrows</span><span>():</span>
<span>for</span> <span>p</span><span>,</span> <span>c</span><span>,</span> <span>_</span> <span>in</span> <span>data_breaks</span><span>:</span>
<span>if</span> <span>row</span><span>.</span><span>value</span> <span>>=</span> <span>np</span><span>.</span><span>percentile</span><span>(</span><span>county_df</span><span>.</span><span>value</span><span>,</span> <span>p</span><span>):</span>
<span>colors</span><span>.</span><span>append</span><span>(</span><span>c</span><span>)</span>
<span>break</span>
<span>return</span> <span>colors</span>
<span>def</span> <span>create_color</span><span>(</span><span>county_df</span><span>,</span> <span>data_breaks</span><span>):</span>
    <span>colors</span> <span>=</span> <span>[]</span>

    <span>for</span> <span>i</span><span>,</span> <span>row</span> <span>in</span> <span>county_df</span><span>.</span><span>iterrows</span><span>():</span>
        <span>for</span> <span>p</span><span>,</span> <span>c</span><span>,</span> <span>_</span> <span>in</span> <span>data_breaks</span><span>:</span>
            <span>if</span> <span>row</span><span>.</span><span>value</span> <span>>=</span> <span>np</span><span>.</span><span>percentile</span><span>(</span><span>county_df</span><span>.</span><span>value</span><span>,</span> <span>p</span><span>):</span>
                <span>colors</span><span>.</span><span>append</span><span>(</span><span>c</span><span>)</span>
                <span>break</span>

    <span>return</span> <span>colors</span>
def create_color(county_df, data_breaks): colors = [] for i, row in county_df.iterrows(): for p, c, _ in data_breaks: if row.value >= np.percentile(county_df.value, p): colors.append(c) break return colors

Enter fullscreen mode Exit fullscreen mode

We calculate the correct values and add create_color() like this.

<span># Create data map </span><span>...</span>
<span>counties</span><span>.</span><span>loc</span><span>[:,</span> <span>"</span><span>value</span><span>"</span><span>]</span> <span>=</span> <span>(</span><span>county_facebook_df</span><span>.</span><span>set_index</span><span>(</span><span>"</span><span>fr_loc</span><span>"</span><span>).</span><span>scaled_sci</span><span>)</span>
<span>counties</span><span>.</span><span>loc</span><span>[:,</span> <span>"</span><span>value</span><span>"</span><span>]</span> <span>=</span> <span>counties</span><span>[</span><span>"</span><span>value</span><span>"</span><span>].</span><span>fillna</span><span>(</span><span>0</span><span>)</span>
<span>counties</span><span>.</span><span>loc</span><span>[:,</span> <span>"</span><span>color</span><span>"</span><span>]</span> <span>=</span> <span>create_color</span><span>(</span><span>counties</span><span>,</span> <span>data_breaks</span><span>)</span>
<span>counties</span><span>.</span><span>loc</span><span>[</span><span>county_id</span><span>,</span> <span>"</span><span>color</span><span>"</span><span>]</span> <span>=</span> <span>selected_color</span>
<span>ax</span> <span>=</span> <span>counties</span><span>.</span><span>plot</span><span>(</span><span>edgecolor</span><span>=</span><span>edge_color</span> <span>+</span> <span>"</span><span>55</span><span>"</span><span>,</span> <span>color</span><span>=</span><span>counties</span><span>.</span><span>color</span><span>,</span> <span>figsize</span><span>=</span><span>(</span><span>20</span><span>,</span> <span>20</span><span>))</span>
<span>states</span><span>.</span><span>plot</span><span>(</span><span>ax</span><span>=</span><span>ax</span><span>,</span> <span>edgecolor</span><span>=</span><span>edge_color</span><span>,</span> <span>color</span><span>=</span><span>"</span><span>None</span><span>"</span><span>,</span> <span>linewidth</span><span>=</span><span>1</span><span>)</span>
<span>ax</span><span>.</span><span>set</span><span>(</span><span>xlim</span><span>=</span><span>(</span><span>-</span><span>2600000</span><span>,</span> <span>None</span><span>))</span> <span># Removing some of the padding to the left </span>
<span>plt</span><span>.</span><span>axis</span><span>(</span><span>"</span><span>off</span><span>"</span><span>)</span>
<span>plt</span><span>.</span><span>show</span><span>()</span>
<span># Create data map </span><span>...</span>

<span>counties</span><span>.</span><span>loc</span><span>[:,</span> <span>"</span><span>value</span><span>"</span><span>]</span> <span>=</span> <span>(</span><span>county_facebook_df</span><span>.</span><span>set_index</span><span>(</span><span>"</span><span>fr_loc</span><span>"</span><span>).</span><span>scaled_sci</span><span>)</span>
<span>counties</span><span>.</span><span>loc</span><span>[:,</span> <span>"</span><span>value</span><span>"</span><span>]</span> <span>=</span> <span>counties</span><span>[</span><span>"</span><span>value</span><span>"</span><span>].</span><span>fillna</span><span>(</span><span>0</span><span>)</span>
<span>counties</span><span>.</span><span>loc</span><span>[:,</span> <span>"</span><span>color</span><span>"</span><span>]</span> <span>=</span> <span>create_color</span><span>(</span><span>counties</span><span>,</span> <span>data_breaks</span><span>)</span>
<span>counties</span><span>.</span><span>loc</span><span>[</span><span>county_id</span><span>,</span> <span>"</span><span>color</span><span>"</span><span>]</span> <span>=</span> <span>selected_color</span>

<span>ax</span> <span>=</span> <span>counties</span><span>.</span><span>plot</span><span>(</span><span>edgecolor</span><span>=</span><span>edge_color</span> <span>+</span> <span>"</span><span>55</span><span>"</span><span>,</span> <span>color</span><span>=</span><span>counties</span><span>.</span><span>color</span><span>,</span> <span>figsize</span><span>=</span><span>(</span><span>20</span><span>,</span> <span>20</span><span>))</span>
<span>states</span><span>.</span><span>plot</span><span>(</span><span>ax</span><span>=</span><span>ax</span><span>,</span> <span>edgecolor</span><span>=</span><span>edge_color</span><span>,</span> <span>color</span><span>=</span><span>"</span><span>None</span><span>"</span><span>,</span> <span>linewidth</span><span>=</span><span>1</span><span>)</span>
<span>ax</span><span>.</span><span>set</span><span>(</span><span>xlim</span><span>=</span><span>(</span><span>-</span><span>2600000</span><span>,</span> <span>None</span><span>))</span> <span># Removing some of the padding to the left </span>
<span>plt</span><span>.</span><span>axis</span><span>(</span><span>"</span><span>off</span><span>"</span><span>)</span>
<span>plt</span><span>.</span><span>show</span><span>()</span>
# Create data map ... counties.loc[:, "value"] = (county_facebook_df.set_index("fr_loc").scaled_sci) counties.loc[:, "value"] = counties["value"].fillna(0) counties.loc[:, "color"] = create_color(counties, data_breaks) counties.loc[county_id, "color"] = selected_color ax = counties.plot(edgecolor=edge_color + "55", color=counties.color, figsize=(20, 20)) states.plot(ax=ax, edgecolor=edge_color, color="None", linewidth=1) ax.set(xlim=(-2600000, None)) # Removing some of the padding to the left plt.axis("off") plt.show()

Enter fullscreen mode Exit fullscreen mode

Here’s what we get.

It looks fantastic, but we need to add some information.


Step 5: Adding information

The first piece of information we need is a title to explain what the data visualization is about.

Here’s a function that does that.

<span>def</span> <span>add_title</span><span>(</span><span>county_id</span><span>,</span> <span>county_name</span><span>):</span>
<span>plt</span><span>.</span><span>annotate</span><span>(</span>
<span>text</span><span>=</span><span>"</span><span>Social Connectedness Ranking Between US Counties and</span><span>"</span><span>,</span>
<span>xy</span><span>=</span><span>(</span><span>0.5</span><span>,</span> <span>1.1</span><span>),</span> <span>xycoords</span><span>=</span><span>"</span><span>axes fraction</span><span>"</span><span>,</span> <span>fontsize</span><span>=</span><span>16</span><span>,</span> <span>ha</span><span>=</span><span>"</span><span>center</span><span>"</span>
<span>)</span>
<span>plt</span><span>.</span><span>annotate</span><span>(</span>
<span>text</span><span>=</span><span>"</span><span>{} (FIPS Code {})</span><span>"</span><span>.</span><span>format</span><span>(</span><span>county_name</span><span>,</span> <span>county_id</span><span>),</span>
<span>xy</span><span>=</span><span>(</span><span>0.5</span><span>,</span> <span>1.03</span><span>),</span> <span>xycoords</span><span>=</span><span>"</span><span>axes fraction</span><span>"</span><span>,</span> <span>fontsize</span><span>=</span><span>32</span><span>,</span> <span>ha</span><span>=</span><span>"</span><span>center</span><span>"</span><span>,</span>
<span>fontweight</span><span>=</span><span>"</span><span>bold</span><span>"</span>
<span>)</span>
<span>def</span> <span>add_title</span><span>(</span><span>county_id</span><span>,</span> <span>county_name</span><span>):</span>
    <span>plt</span><span>.</span><span>annotate</span><span>(</span>
        <span>text</span><span>=</span><span>"</span><span>Social Connectedness Ranking Between US Counties and</span><span>"</span><span>,</span>
        <span>xy</span><span>=</span><span>(</span><span>0.5</span><span>,</span> <span>1.1</span><span>),</span> <span>xycoords</span><span>=</span><span>"</span><span>axes fraction</span><span>"</span><span>,</span> <span>fontsize</span><span>=</span><span>16</span><span>,</span> <span>ha</span><span>=</span><span>"</span><span>center</span><span>"</span>
    <span>)</span>

    <span>plt</span><span>.</span><span>annotate</span><span>(</span>
        <span>text</span><span>=</span><span>"</span><span>{} (FIPS Code {})</span><span>"</span><span>.</span><span>format</span><span>(</span><span>county_name</span><span>,</span> <span>county_id</span><span>),</span> 
        <span>xy</span><span>=</span><span>(</span><span>0.5</span><span>,</span> <span>1.03</span><span>),</span> <span>xycoords</span><span>=</span><span>"</span><span>axes fraction</span><span>"</span><span>,</span> <span>fontsize</span><span>=</span><span>32</span><span>,</span> <span>ha</span><span>=</span><span>"</span><span>center</span><span>"</span><span>,</span>
        <span>fontweight</span><span>=</span><span>"</span><span>bold</span><span>"</span>
    <span>)</span>
def add_title(county_id, county_name): plt.annotate( text="Social Connectedness Ranking Between US Counties and", xy=(0.5, 1.1), xycoords="axes fraction", fontsize=16, ha="center" ) plt.annotate( text="{} (FIPS Code {})".format(county_name, county_id), xy=(0.5, 1.03), xycoords="axes fraction", fontsize=32, ha="center", fontweight="bold" )

Enter fullscreen mode Exit fullscreen mode

Next, we need a legend and supporting information that explains the data since it’s a bit complex.

The function for adding a legend uses the data_breaks and the selected_color to create Patch(es) that we add using plt.legend().

<span>def</span> <span>add_legend</span><span>(</span><span>data_breaks</span><span>,</span> <span>selected_color</span><span>,</span> <span>county_name</span><span>):</span>
<span>patches</span> <span>=</span> <span>[</span><span>Patch</span><span>(</span><span>facecolor</span><span>=</span><span>c</span><span>,</span> <span>edgecolor</span><span>=</span><span>edge_color</span><span>,</span> <span>label</span><span>=</span><span>t</span><span>)</span> <span>for</span> <span>_</span><span>,</span> <span>c</span><span>,</span> <span>t</span> <span>in</span> <span>data_breaks</span><span>]</span>
<span>patches</span> <span>=</span> <span>[</span><span>Patch</span><span>(</span><span>facecolor</span><span>=</span><span>selected_color</span><span>,</span> <span>edgecolor</span><span>=</span><span>edge_color</span><span>,</span> <span>label</span><span>=</span><span>county_name</span><span>)]</span> <span>+</span> <span>patches</span>
<span>leg</span> <span>=</span> <span>plt</span><span>.</span><span>legend</span><span>(</span>
<span>handles</span><span>=</span><span>patches</span><span>,</span>
<span>bbox_to_anchor</span><span>=</span><span>(</span><span>0.5</span><span>,</span> <span>-</span><span>0.03</span><span>),</span> <span>loc</span><span>=</span><span>'</span><span>center</span><span>'</span><span>,</span>
<span>ncol</span><span>=</span><span>10</span><span>,</span> <span>fontsize</span><span>=</span><span>20</span><span>,</span> <span>columnspacing</span><span>=</span><span>1</span><span>,</span>
<span>handlelength</span><span>=</span><span>1</span><span>,</span> <span>handleheight</span><span>=</span><span>1</span><span>,</span>
<span>edgecolor</span><span>=</span><span>background_color</span><span>,</span>
<span>handletextpad</span><span>=</span><span>0.4</span>
<span>)</span>
<span>def</span> <span>add_legend</span><span>(</span><span>data_breaks</span><span>,</span> <span>selected_color</span><span>,</span> <span>county_name</span><span>):</span>
    <span>patches</span> <span>=</span> <span>[</span><span>Patch</span><span>(</span><span>facecolor</span><span>=</span><span>c</span><span>,</span> <span>edgecolor</span><span>=</span><span>edge_color</span><span>,</span> <span>label</span><span>=</span><span>t</span><span>)</span> <span>for</span> <span>_</span><span>,</span> <span>c</span><span>,</span> <span>t</span> <span>in</span> <span>data_breaks</span><span>]</span>
    <span>patches</span> <span>=</span> <span>[</span><span>Patch</span><span>(</span><span>facecolor</span><span>=</span><span>selected_color</span><span>,</span> <span>edgecolor</span><span>=</span><span>edge_color</span><span>,</span> <span>label</span><span>=</span><span>county_name</span><span>)]</span> <span>+</span> <span>patches</span>

    <span>leg</span> <span>=</span> <span>plt</span><span>.</span><span>legend</span><span>(</span>
        <span>handles</span><span>=</span><span>patches</span><span>,</span>
        <span>bbox_to_anchor</span><span>=</span><span>(</span><span>0.5</span><span>,</span> <span>-</span><span>0.03</span><span>),</span> <span>loc</span><span>=</span><span>'</span><span>center</span><span>'</span><span>,</span>
        <span>ncol</span><span>=</span><span>10</span><span>,</span> <span>fontsize</span><span>=</span><span>20</span><span>,</span> <span>columnspacing</span><span>=</span><span>1</span><span>,</span>
        <span>handlelength</span><span>=</span><span>1</span><span>,</span> <span>handleheight</span><span>=</span><span>1</span><span>,</span>
        <span>edgecolor</span><span>=</span><span>background_color</span><span>,</span>
        <span>handletextpad</span><span>=</span><span>0.4</span>
    <span>)</span>
def add_legend(data_breaks, selected_color, county_name): patches = [Patch(facecolor=c, edgecolor=edge_color, label=t) for _, c, t in data_breaks] patches = [Patch(facecolor=selected_color, edgecolor=edge_color, label=county_name)] + patches leg = plt.legend( handles=patches, bbox_to_anchor=(0.5, -0.03), loc='center', ncol=10, fontsize=20, columnspacing=1, handlelength=1, handleheight=1, edgecolor=background_color, handletextpad=0.4 )

Enter fullscreen mode Exit fullscreen mode

I also have a simple function to add some additional information below the legend.

<span>def</span> <span>add_information</span><span>():</span>
<span>plt</span><span>.</span><span>annotate</span><span>(</span>
<span>"</span><span>The Facebook Connectivity Index measure the likelyhood that users in different</span><span>\n</span><span>locations are connected on Facebook. The formula divides the number of Facebook</span><span>\n</span><span>connections with the number of possible connections for the two locations.</span><span>"</span><span>,</span>
<span>xy</span><span>=</span><span>(</span><span>0.5</span><span>,</span> <span>-</span><span>0.08</span><span>),</span> <span>xycoords</span><span>=</span><span>"</span><span>axes fraction</span><span>"</span><span>,</span> <span>ha</span><span>=</span><span>"</span><span>center</span><span>"</span><span>,</span> <span>va</span><span>=</span><span>"</span><span>top</span><span>"</span><span>,</span> <span>fontsize</span><span>=</span><span>18</span><span>,</span> <span>linespacing</span><span>=</span><span>1.8</span>
<span>)</span>
<span>plt</span><span>.</span><span>annotate</span><span>(</span>
<span>"</span><span>Source: https://dataforgood.facebook.com/</span><span>"</span><span>,</span>
<span>xy</span><span>=</span><span>(</span><span>0.5</span><span>,</span> <span>-</span><span>0.22</span><span>),</span> <span>xycoords</span><span>=</span><span>"</span><span>axes fraction</span><span>"</span><span>,</span> <span>fontsize</span><span>=</span><span>16</span><span>,</span> <span>ha</span><span>=</span><span>"</span><span>center</span><span>"</span><span>,</span>
<span>fontweight</span><span>=</span><span>"</span><span>bold</span><span>"</span>
<span>)</span>
<span>def</span> <span>add_information</span><span>():</span>
    <span>plt</span><span>.</span><span>annotate</span><span>(</span>
        <span>"</span><span>The Facebook Connectivity Index measure the likelyhood that users in different</span><span>\n</span><span>locations are connected on Facebook. The formula divides the number of Facebook</span><span>\n</span><span>connections with the number of possible connections for the two locations.</span><span>"</span><span>,</span>
        <span>xy</span><span>=</span><span>(</span><span>0.5</span><span>,</span> <span>-</span><span>0.08</span><span>),</span> <span>xycoords</span><span>=</span><span>"</span><span>axes fraction</span><span>"</span><span>,</span> <span>ha</span><span>=</span><span>"</span><span>center</span><span>"</span><span>,</span> <span>va</span><span>=</span><span>"</span><span>top</span><span>"</span><span>,</span> <span>fontsize</span><span>=</span><span>18</span><span>,</span> <span>linespacing</span><span>=</span><span>1.8</span>
    <span>)</span>

    <span>plt</span><span>.</span><span>annotate</span><span>(</span>
        <span>"</span><span>Source: https://dataforgood.facebook.com/</span><span>"</span><span>,</span> 
        <span>xy</span><span>=</span><span>(</span><span>0.5</span><span>,</span> <span>-</span><span>0.22</span><span>),</span> <span>xycoords</span><span>=</span><span>"</span><span>axes fraction</span><span>"</span><span>,</span> <span>fontsize</span><span>=</span><span>16</span><span>,</span> <span>ha</span><span>=</span><span>"</span><span>center</span><span>"</span><span>,</span>
        <span>fontweight</span><span>=</span><span>"</span><span>bold</span><span>"</span>
    <span>)</span>
def add_information(): plt.annotate( "The Facebook Connectivity Index measure the likelyhood that users in different\nlocations are connected on Facebook. The formula divides the number of Facebook\nconnections with the number of possible connections for the two locations.", xy=(0.5, -0.08), xycoords="axes fraction", ha="center", va="top", fontsize=18, linespacing=1.8 ) plt.annotate( "Source: https://dataforgood.facebook.com/", xy=(0.5, -0.22), xycoords="axes fraction", fontsize=16, ha="center", fontweight="bold" )

Enter fullscreen mode Exit fullscreen mode

Lastly, I have the add_circle() function to indicate which county we’re looking at by drawing a circle around it.

<span>def</span> <span>add_circle</span><span>(</span><span>ax</span><span>,</span> <span>counties_df</span><span>,</span> <span>county_id</span><span>):</span>
<span>center</span> <span>=</span> <span>counties_df</span><span>[</span><span>counties_df</span><span>.</span><span>index</span> <span>==</span> <span>county_id</span><span>].</span><span>geometry</span><span>.</span><span>centroid</span><span>.</span><span>iloc</span><span>[</span><span>0</span><span>]</span>
<span>ax</span><span>.</span><span>add_artist</span><span>(</span>
<span>Circle</span><span>(</span>
<span>radius</span><span>=</span><span>100000</span><span>,</span> <span>xy</span><span>=</span><span>(</span><span>center</span><span>.</span><span>x</span><span>,</span> <span>center</span><span>.</span><span>y</span><span>),</span> <span>facecolor</span><span>=</span><span>"</span><span>None</span><span>"</span><span>,</span> <span>edgecolor</span><span>=</span><span>selected_color</span><span>,</span> <span>linewidth</span><span>=</span><span>4</span>
<span>)</span>
<span>)</span>
<span>def</span> <span>add_circle</span><span>(</span><span>ax</span><span>,</span> <span>counties_df</span><span>,</span> <span>county_id</span><span>):</span>
    <span>center</span> <span>=</span> <span>counties_df</span><span>[</span><span>counties_df</span><span>.</span><span>index</span> <span>==</span> <span>county_id</span><span>].</span><span>geometry</span><span>.</span><span>centroid</span><span>.</span><span>iloc</span><span>[</span><span>0</span><span>]</span>
    <span>ax</span><span>.</span><span>add_artist</span><span>(</span>
        <span>Circle</span><span>(</span>
            <span>radius</span><span>=</span><span>100000</span><span>,</span> <span>xy</span><span>=</span><span>(</span><span>center</span><span>.</span><span>x</span><span>,</span> <span>center</span><span>.</span><span>y</span><span>),</span> <span>facecolor</span><span>=</span><span>"</span><span>None</span><span>"</span><span>,</span> <span>edgecolor</span><span>=</span><span>selected_color</span><span>,</span> <span>linewidth</span><span>=</span><span>4</span>
        <span>)</span>
    <span>)</span>
def add_circle(ax, counties_df, county_id): center = counties_df[counties_df.index == county_id].geometry.centroid.iloc[0] ax.add_artist( Circle( radius=100000, xy=(center.x, center.y), facecolor="None", edgecolor=selected_color, linewidth=4 ) )

Enter fullscreen mode Exit fullscreen mode

We add all of them below the rest of the code under the # Create data map comment.

<span># Create data map </span><span>...</span>
<span>add_circle</span><span>(</span><span>ax</span><span>,</span> <span>counties</span><span>,</span> <span>county_id</span><span>)</span>
<span>add_title</span><span>(</span><span>county_id</span><span>,</span> <span>county_name</span><span>)</span>
<span>add_legend</span><span>(</span><span>data_breaks</span><span>,</span> <span>selected_color</span><span>,</span> <span>county_name</span><span>)</span>
<span>add_information</span><span>()</span>
<span>plt</span><span>.</span><span>axis</span><span>(</span><span>"</span><span>off</span><span>"</span><span>)</span>
<span>plt</span><span>.</span><span>show</span><span>()</span>
<span># Create data map </span><span>...</span>

<span>add_circle</span><span>(</span><span>ax</span><span>,</span> <span>counties</span><span>,</span> <span>county_id</span><span>)</span>
<span>add_title</span><span>(</span><span>county_id</span><span>,</span> <span>county_name</span><span>)</span>
<span>add_legend</span><span>(</span><span>data_breaks</span><span>,</span> <span>selected_color</span><span>,</span> <span>county_name</span><span>)</span>
<span>add_information</span><span>()</span>

<span>plt</span><span>.</span><span>axis</span><span>(</span><span>"</span><span>off</span><span>"</span><span>)</span>
<span>plt</span><span>.</span><span>show</span><span>()</span>
# Create data map ... add_circle(ax, counties, county_id) add_title(county_id, county_name) add_legend(data_breaks, selected_color, county_name) add_information() plt.axis("off") plt.show()

Enter fullscreen mode Exit fullscreen mode

Here’s the finished data visualization.

Congratulations, you now know how to create fantastic data maps of the United States in Matplotlib! 🙂


Conclusion

Data maps are fantastic when you want to visualize geographic information in a way that captures the user’s eye.

This time, we worked with the Social Connectedness Index from Facebook, but you can change that to anything else with geographic information.

I hope you enjoyed this tutorial and learned something new.

If you did, make sure to join my newsletter, Data Wonder.

See you next time.

原文链接:How to Create Data Maps of the United States With Matplotlib

© 版权声明
THE END
喜欢就支持一下吧
点赞8 分享
Time always save the best for last.
时间总把最好的人留到最后
评论 抢沙发

请登录后发表评论

    暂无评论内容