Population Demographics data is created by applying a cohort model to historical, published demographics data in order to project that data forward. A cohort refers to a particular segment of the population, such as 29-year-old Asian females. The cohort model works by aging a population forward a year, applying birth, death, and migration rates, and repeating that process for each cohort in each projected year.
Therefore, the steps in the process of creating demographics data are to break out existing data to the desired geographical and cohort granularity; construct the birth, death, and migration rates to be used by the cohort model; apply the cohort model to the data; and finally, adjust the projected data to published rates of growth.
1. Disaggregate Data
The output of the demographics process is historical and projected population counts by tract, race, ethnicity, gender, and single-age. For historical Census data, we have annual single-age demographic data at the state level, tract data available in age-groups, and annual county-level data available in age-groups. We use tract and age breakout processes to get historical population data from the Census down to the tract and single-age level.
It is worth noting that ACS data, although available at the tract level, is only published in 5-year estimates. We elect to use the 5-year tract data to mode the more current Census Bureau data down to the tract level, creating a more recent dataset.
The end result of this step is tract-level demographics data by single-age, race, ethnicity, and gender for all historical years.
2. Create Cohort Model Components
The migration estimation is created by aging the population forward a year, calculating deaths (using CDC data), and re-calculating population. This new population is then compared to published population data for the year from the Census. If the projected population is lower than the published estimate, this indicates out-migration for the region for the year. If the projected population is higher than the published estimate, this indicates in-migration for the region for the year.
This process is repeated for each consecutive pair of historical years for which published data exists. Migration estimates are created for all historical years, and these will be used in the cohort model to estimate migration for future years. To calculate future migration rates, we work through all possible county, race/ethnicity, gender, and single-year age combinations and project future migration rates using linear regression.
First, for each year of historical data, convert the birth/death count from CDC to a rate by dividing the count for a particular demographic cohort by the census population of that particular cohort. Second, given several years of historical rates, project the next year’s rate.
We loop through all possible nodes (i.e., every county, race, gender, single-age combination), calculate the rate, and get the projection. For births, we limit our search to nodes of females age 12 – 50 years (i.e., capable of giving birth). The rate is technically a fertility rate, since we base it on the population of potential mothers instead of the entire population.
After calculating rates for all historical years, we then project the next year’s rate using linear regression. If the projected rate goes below 0, we set the rate to half the most recent year’s rate (i.e., asymptotically approach zero). Similarly, if the projected rate goes above 1 we set the rate to the mid-point between the most recent year’s rate and 1 (i.e., asymptotically approach one).
3. Apply Cohort Model
The cohort model brings together all the preparatory calculations described above. The output of the cohort model is final demographics data.
The cohort model uses a methodology very similar to that employed in the migration estimation process. We age each cohort forward a year, account for deaths, apply in- and out-migration, and add births.
4. Adjust to Published Rates of Growth
Because the Census publishes national demographic projections (50 years out from last published Census), we want to adjust our projections to their results. We do so by making our national growth rate match that of the Census projections. We then proportionally adjust populations at each area child (i.e., state, county, tract) so they sum to the adjusted totals.
It is important to note that the projection adjustment step does not make our totals match that of the census projections. Instead our projected growth rate at the national level, from year to year, will match that of the Census Bureau’s projected growth rates.
Once every tract for each demographic cohort has been adjusted accordingly, we save the output as our final demographic cohort model projection.
The modeled demographics data produced in January 2022 is available for free (download here).
Let us know what specific questions we can help you with (we may even add your question to our knowledge base).
Let us know what specific questions we can help you with (we may even add your question to our knowledge base).