Following the methods described in Baynes, J., Neale, A., & Hultgren, T. (2022), EPA developed the Intelligent Dasymetric Mapping (IDM) toolbox for ArcGIS Pro and a corresponding toolbox that uses open source GIS packages.
The US Census Bureau aggregates population counts into various units (e.g., blocks, block groups, tracts) that are bounded by both visible features such as roads and streams, as well as invisible boundaries such as county or state limits. While this aggregation is practical for the purposes of the Census, it is difficult to determine how many individuals within a census block live near roads, in floodplains, or other potential risk exposures using these boundaries. This is particularly true in sparsely populated areas with large census blocks.
Dasymetric mapping is a geospatial technique that uses information such as land cover to distribute population counts within selected boundaries like census blocks. EPA researchers use dasymetric population data to calculate the distribution of ecosystem services, model viewsheds, estimate resource use, and measure exposure potential.
The image on the left shows a map of the population by block group based solely on the census data. The image on the right shows the dasymetric population allocation for several block groups in Tampa, Fla.IDM uses a representative population density for each ancillary class (e.g., land cover type) to distribute population counts from source units (e.g., census blocks). The representative population density of an ancillary class is the number of people per grid cell that are expected to reside in the ancillary class throughout the study area. IDM uses three methods to determine the representative population density of an ancillary class.
- Preset density - A preset density is a representative population density for an ancillary class that is determined by the user and provided in a configuration file. Any class and density value can be set using this method, but it is commonly used to identify uninhabited ancillary classes (e.g., open water as 0).
- Sampling - A sampled density is a representative population density for an ancillary class that is determined by collecting representative source units of the ancillary class. A representative source unit for an ancillary class is a source unit that is 1) of sufficient size 2) sufficiently composed of the ancillary class (i.e., homogenous). Finally, there must be 3) a sufficient number of representative source units for an ancillary class to be sampled. These three parameters are adjustable by the user in the IDM toolbox.
- Intelligent Areal Weighting - Representative population densities that are not preset or sampled are determined using intelligent areal weighting (IAW).
Finally, to ensure that the population estimated for a source unit is equal to the original population count of the source unit, a distribution ratio is applied.
NOTE: Source units that are too small or irregularly shaped to be represented in a raster matching the resolution of the ancillary raster are ignored by the IDM toolbox. For the EnviroAtlas dasymetric model, population counts in these blocks were identified and merged into neighboring blocks as a preprocessing step.
NOTE: Simple area weighting is used for population counts within a source unit made up entirely of ancillary classes with representative population densities estimated at or preset to zero.
For more information on the dasymetric data created for EnviroAtlas, see our website, factsheet, or journal article.
The IDM Toolbox was developed for ArcGIS Pro 2.2.4. It has been tested on versions up to 3.2
The Spatial Analyst or 3D Analyst extensions are required.
Use of the IDM toolbox may require increasing the maximum number of unique values the 'Combine Tool' can produce.
You can increase this number by changing a setting in ArcGIS Pro. On the Project tab, select Options and select the Raster and Imagery choice. In the dialog box, select the Raster Dataset choice and enter an appropriate value for the Maximum number of unique values to display.
https://pro.arcgis.com/en/pro-app/tool-reference/spatial-analyst/combine.htm
The IDM Toolbox for ArcGIS Pro can be added to a new or existing ArcGIS Pro project.
The user can set a population density for any ancillary class using their own domain knowledge by modifying the 'config.json' file in the toolbox's root directory. Any class and density value can be provided using this method, but it is commonly used to identify uninhabited ancillary classes (e.g., open water as 0).
The preset densities for the following ancillary classes from the National Land Cover Database (NLCD) are set to 0 people per pixel:
- 11, Open Water
- 12, Perennial Ice/Snow
- 95, Emergent Herbaceous Wetlands
The config.json file is a data dictionary with ancillary raster value as the key (string data type) and preset density as the value (float / integer data type).
For example:
{ "11":0, "12":0, "95":0, "0":0 }
Parameter | Description | Data Type |
---|---|---|
Population Features | The source units with population counts and a unique identifier to be converted to a raster. Common source units are census polygons. | Feature Class |
Population Count Field | The field in Population Features that stores the population counts. | Field |
Population Key Field | The field in the Population Features that stores the unique identifier for the source unit. | Field
|
Ancillary Raster | The ancillary raster is a categorical raster dataset with classes that will be used to redistribute population. Land-use or land-cover are frequently used ancillary datasets, but any dataset that has classes of relatively homogenous population density could be used here. NOTE: The cell size and spatial reference from the ancillary raster are used for all output rasters from this tool. |
Raster Dataset |
Uninhabited File (optional) | An optional feature class containing polygons where no population is expected to reside. The ancillary raster values that overlay the areas within the provided feature class are reclassified as an uninhabited ancillary class (i.e., class "0") and the representative population density for this class is preset to 0 people per pixel. | Feature Class |
Minimum Sample | This is the minimum number of representative source units required for an ancillary class to be considered sampled. A source unit is considered representative of an ancillary class if the ancillary class makes up a certain percent of the source unit’s populated area. Any class that is not preset or sufficiently sampled will be assigned a density using the IAW method. The default is 3. |
Long |
Minimum Sampling Area | The minimum size of a representative source unit (in number of ancillary raster cells). Any source unit with a populated area smaller than this parameter will not be considered a representative source unit. Increasing this number may eliminate small areas that might be outliers that could skew the class average. The default is 1. |
Long |
Percent | A source unit is considered representative of an ancillary class if the ancillary class makes up a certain percent of the source unit’s populated area. This parameter is the minimum percent area of a source unit that a single ancillary class must cover for that source unit to be a representative source unit. This value is in decimal notation. The default is 0.95. |
Double |
Output Directory | The directory where all outputs from the tool will be saved. NOTE: Must be a folder location, geodatabases are not supported with this tool. |
Workspace |
Filename | Description |
---|---|
DensityRaster.tif | The final population density raster for the study area. |
DasyRaster.tif | The dasymetric raster represents the spatial intersection of the population source units and the ancillary raster. Each value represents a unique combination of source unit and ancillary raster. These are also known as 'target units'. |
PopRaster.tif | The population features provided by the user are converted to a raster using the population key field as the values of the raster. |
uninhab_landcover.tif | If the user provides an optional uninhabited file, this raster will be provided in the output directory. This is a copy of the ancillary raster where areas covered by the uninhabited areas are classified as an uninhabited ancillary class (i.e., class 0). |
PopTable.dbf | The population working table consists of the following information for each source unit in the population features
|
DasyWorkTable.dbf | The dasymetric working table for each target unit:
|
SamplingSummaryTable.dbf | Information about how the representative population density for each ancillary class was determined.
|
Example data are provided.
Parameter | Value |
---|---|
Population Features | 2010_blocks_DE.shp |
Population Count Field | POP10 |
Population Key Field | polyID |
Ancillary Raster | nlcd_2011_DE.tif |
Uninhabited File | uninhab_DE.shp |
Minimum Sample | 3 |
Minimum Sampling Area | 1 |
Percent | 0.95 |
Output Directory |
U.S. Environmental Protection Agency
Office of Research and Development
Durham, NC 27709
https://ecomments.epa.gov/enviroatlas/
The Intelligent Dasymetric Toolbox for ArcGIS Pro was developed for EnviroAtlas. EnviroAtlas is a collaborative effort led by U.S. EPA that provides geospatial data, easy-to-use tools, and other resources related to ecosystem services, their stressors, and human health.
The dasymetric toolbox was updated for ArcGIS Pro in January 2020 by Anam Khan2 and Jeremy Baynes3 and described in Baynes, et al., 20221. This release introduced optional functionality to mask known uninhabited areas.
The toolbox was originally developed for ArcMap 10 by Torrin Hultgren4 following the methods of Mennis and Hultgren (2006)5.
1 Baynes, J., Neale, A., & Hultgren, T. (2022). Improving intelligent dasymetric mapping population density estimates at 30 m resolution for the conterminous United States by excluding uninhabited areas. Earth system science data, 14(6), 2833.
2 Oak Ridge Associated Universities, National Student Services Contractor at the U.S. EPA
3 U.S. EPA Office of Research and Development
4 National Geospatial Support Team at U.S. EPA
5 Mennis, Jeremy & Hultgren, Torrin. (2006). Intelligent Dasymetric Mapping and Its Application to Areal Interpolation. Cartography and Geographic Information Science. 33. 179-194.
MIT License
Copyright (c) 2024 U.S. Federal Government (in countries where recognized)
Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
The United States Environmental Protection Agency (EPA) GitHub project code is provided on an "as is" basis and the user assumes responsibility for its use. EPA has relinquished control of the information and no longer has responsibility to protect the integrity, confidentiality, or availability of the information. Any reference to specific commercial products, processes, or services by service mark, trademark, manufacturer, or otherwise, does not constitute or imply their endorsement, recommendation or favoring by EPA. The EPA seal and logo shall not be used in any manner to imply endorsement of any commercial product or activity by EPA or the United States Government.