DEFINING SUB-STATE REGIONS

Overview

The Rocky Mountain COVID Data project aims to deliver data visualizations that address public health leaders’ current and future needs for COVID-19 in the Rocky Mountain West. This part of the country is home to nearly 15 million residents and includes sparsely populated rural expanses, dense urban areas, and destinations that attract tourists from around the country and the world. These characteristics, as well as heterogeneous population demographics and health behaviors, can impact the spread of SARS-CoV-2, leading to important differences in the timing and severity of COVID-19 across the Rocky Mountain West.

Our data visualizations are designed to help show differences in the timing and magnitude of severe COVID-19 (measured as COVID-19 hospitalizations), as well as vaccination coverage, across the Rocky Mountain West. To do this, we defined 21 sub-state regions that capture the unique demographics and population dynamics in each region. We produce surveillance outputs for each of these 21 regions (3 or 4 to a state), allowing state, local, and tribal public health officials to access crucial information on the past and current landscape of COVID-19.

We aimed to define regions that grouped together populations that are likely to mix with each other, taking into account boundaries for public health decision-making. To do this, we used mobility data, a partitioning algorithm, and information on current health planning regions in a three-step process as described below.

Continue reading below to see how we created the regions for our work.

Defining the Regions: A Three-Step Process

  • Our goal was to define regions that maximized mobility within clusters and minimized mobility between clusters. We also aimed to reflect some practical constraints when defining regions, such as the need for sufficient population size. We established the following criteria to reflect these goals:

    1. Regions would not cross state lines.

    2. Regions would not subdivide counties.

    3. Each state would have at least two, but up to four regions.

    4. Each region would have a population of at least 100,000.

    5. All counties in a region would be geographically adjacent.

  • Population mobility data were obtained from SafeGraph, a company that aggregates anonymized location data from numerous applications to provide information about places people visit. SafeGraph reports the number of mobile device visits between all census block groups (CBG) by day in the U.S.* We used data from the period from January 01, 2021 to April 16, 2021—a time when mobility was impacted by COVID-19, but to a lesser extent than earlier phases of the pandemic.

    We created mobility informed clusters for each state by taking the daily number of mobile device visits by CBG and aggregating this count up to the county level to determine a daily device count for every origin-destination county pair within each of the six Rocky Mountain West states. This count includes the number of origin county devices that were seen in the destination county, and the number of destination county devices that were seen in the origin county. The higher the device count, the more connected the two counties were deemed to be.

    We wrote a clustering algorithm in R using capabilities from the igraph package to determine counties’ contact with each other. We utilized the usmap package to visualize how counties clustered with each other given certain constraints on cluster numbers, as well as the dendextend package to create dendrograms (which depict all the counties of a state and their connectedness to each other in a large tree-style diagram).

    Cluster map using Utah as an example:

    cluster map

    We used this information to resolve regional configurations for each state that comprised at least two but no more than four regions, choosing the configuration with the maximum number of regions that conformed to our five criteria. Once this configuration was chosen, we created a dendrogram to go with it.

    Utah corresponding dendrogram (below). Red dashed lines indicated the boundaries of the original mobility informed clusters determined by the algorithm. Dendrograms read like a family tree—the longer and more far back the branches between two counties go, the less connected they are.

    dendrogram

    To see the code and data used for this clustering analysis, click here to visit our GitHub page!

    *To enhance privacy, SafeGraph excludes census block group information if fewer than two devices visited an establishment in a month from a given CBG.

  • We compared the preliminary clusters generated in Step 2 to existing public health planning regions for each state. If the mobility informed clusters did not align with these regions, we identified and assessed discordant counties, and investigated the dendrograms to determine whether to keep the counties in their mobility informed clusters or reallocate them to better match with public health planning regions.

    For example, Utah is broken down into the following public health planning regions:

    planning

    The counties in our mobility informed clusters that did not agree with the public health planning regions were Piute, Sevier, and Wayne.

    discordant

    We investigated the dendrogram (shown in step 2) and determined that these three counties were not connected enough with the rest of their mobility informed cluster to require them to remain in the cluster. As a result, we reallocated these counties to better match with the public health planning regions.

    utah final

Final Regional Breakdown

See the below map for the regions we designated for each state. Hover over each county to find out to what region the county belongs.

To zoom in and out, hover over the map and scroll, or press the + or - buttons in the left hand toolbar.

To pull up an individual state, filter from the dropdown menu in the upper right hand corner, or search for a state in the upper left hand search bar.

To return to the default view, click on the home icon in the left hand toolbar, or hit the return to start button in the bottom toolbar. The bottom toolbar also allows you to share, download, and view the map in full screen.

County Breakdown

  • Baca, Bent, Cheyenne, Crowley, Custer, Fremont, Huerfano, Kiowa, Kit Carson, Las Animas, Lincoln, Logan, Morgan, Otero, Phillips, Prowers, Pueblo, Sedgwick, Teller, Washington, Yuma

    Total population approx. 377,000

  • Adams, Arapahoe, Boulder, Broomfield, Clear Creek, Denver, Douglas, Elbert, El Paso, Gilpin, Grand, Jackson, Jefferson, Larimer, Park, Summit, Weld

    Total population approx. 4,820,000

  • Alamosa, Archuleta, Chaffee, Conejos, Costilla, Delta, Dolores, Eagle, Garfield, Hinsdale, La Plata, Mesa, Mineral, Moffat, Montezuma, Montrose, Ouray, Pitkin, Rio Blanco, Rio Grande, Routt, Saguache, San Juan, San Miguel

    Total population approx. 586,000

  • Bannock, Bear Lake, Benewah, Bingham, Bonneville, Butte, Caribou, Clark, Custer, Franklin, Fremont, Jefferson, Lemhi, Madison, Oneida, Power, Teton

    Total population approx. 425,000

  • Bonner, Boundary, Clearwater, Idaho, Kootenai, Latah, Lewis, Nez Perce, Shoshone

    Total population approx. 365,000

  • Blaine, Camas, Cassia, Gooding, Jerome, Lincoln, Minidoka, Twin Falls

    Total population approx. 207,000

  • Ada, Adams, Boise, Canyon, Elmore, Gem, Owyhee, Payette, Valley, Washington

    Total population approx. 851,000

  • Big Horn, Carbon, Carter, Custer, Daniels, Dawson, Fallon, Fergus, Garfield, Golden Valley, Judith Basin, McCone, Musselshell, Petroleum, Phillips, Powder River, Prairie, Richland, Roosevelt, Rosebud, Sheridan, Stillwater, Sweet Grass, Treasure, Valley, Wheatland, Wibaux, Yellowstone

    Total population approx. 303,000

  • Blaine, Cascade, Chouteau, Glacier, Hill, Liberty, Pondera, Teton, Toole

    Total population approx. 146,000

  • Beaverhead, Broadwater, Deer Lodge, Flathead, Gallatin, Granite, Jefferson, Lake, Lewis and Clark, Lincoln, Madison, Meagher, Mineral, Missoula, Park, Powell, Ravalli, Sanders, Silver Bow

    Total population approx. 637,000

  • Colfax, Curry, De Baca, Guadalupe, Harding, Mora, Quay, Roosevelt, San Miguel, Union

    Total population approx. 131,000

  • Los Alamos, Rio Arriba, Santa Fe, Taos

    Total population approx. 249,000

  • Catron, Chaves, Doña Ana, Eddy, Grant, Hidalgo, Lea, Lincoln, Luna, Otero, Sierra

    Total population approx. 583,000

  • Bernalillo, Cibola, McKinley, San Juan, Sandoval, Socorro, Torrance, Valencia

    Total population approx. 1,150,000

  • Carbon, Daggett, Duchesne, Emery, Grand, San Juan, Uintah

    Total population approx. 111,000

  • Box Elder, Cache, David, Morgan, Rich, Weber

    Total population approx. 833,000

  • Beaver, Garfield, Iron, Kane, Washington

    Total population approx. 259,000

  • Juab, Millard, Piute, Salt Lake, Sanpete, Sevier, Summit, Tooele, Utah, Wasatch, Wayne

    Total population approx. 2,080,000

  • Albany, Carbon, Converse, Fremont, Goshen, Laramie, Natrona, Niobrara, Platte

    Total population approx. 309,000

  • Campbell, Crook, Johnson, Sheridan, Weston

    Total population approx. 100,000

  • Big Horn, Hot Springs, Lincoln, Park, Sublette, Sweetwater, Teton, Uinta, Washakie

    Total population approx. 168,000