This page contains release notes for each of Lightcast’s quarterly US dataruns. The release notes outline any major methodology changes or fixes included with each datarun.
Lightcast updates core LMI data four times per year with the latest data available from each dataset. These updates are called Dataruns. These are the target release dates for the next four dataruns:
The Bureau of Labor Statistics’ Quarterly Census of Employment and Wages (QCEW), released quarterly, kicks off a new Lightcast datarun. You can view the QCEW release schedule here (note that Lightcast uses the Full Data Release, not the News Release). As soon as a new quarter of QCEW is released, Lightcast’s Data team downloads it and re-flows all Lightcast data to incorporate all data source updates that happened since the last datarun. This incorporation of new data sources is then released as a new datarun.
To read about updates for postings and profiles data, see this article.
This section lists the changes and updates introduced in the latest datarun.
Summary of OES Changes for May 2021:
OES data will be published using 2018 SOC occupation definitions.
The OES data utilizes QCEW data to predict outcomes of occupational staffing patterns and associated wages. It partitions the target population into two separate population groups: stable responders (people who consistently respond to OES surveys) and nonresponders. The outcomes are modeled separately between the two groups, with weighting favoring stable responders.
Projections are also weighted to favor more recent years, a change from the previous OES methodology that contained an even weighting across all years. The projections methodology also utilizes observed outcomes from stable responders to form predicted outcomes for nonresponders.
Due to the new methodology, wage levels may show differences compared to that of previous updates, especially in years that are being rerun with the new methodology (2015-2019). The BLS states that the new model’s mean hourly wage estimates are almost always higher than the mean hourly wage estimates for detailed occupations under the previous model.
Note: Earlier release notes indicated that 2021 IPEDS data would be released with the 2022.3 datarun. This was a mistake. 2021 IPEDS data will not be released until September, and will be incorporated with the 2022.4 datarun
This section lists major sources and what “vintage” of each source was used in the last four dataruns. Click the hyperlinked name to read more about the source and what role it plays in Lightcast data.
Bolded items indicate an update.
|Name||Abbrev.||Source||Lightcast 2022.3||Lightcast 2022.2||Lightcast 2022.1||Lightcast 2021.4|
|Quarterly Census of Employment and Wages||QCEW||BLS||2021Q4||2021Q3||2021Q2||2021Q1|
|Occupational Employment Statistics||OES||BLS||2021||2020||2020||2020|
|National Ind/Occ Employment Matrix||NIOEM||BLS||2020-2030||2020-2030||2020-2030||2019-2029|
|Employment Projections Tables||EP||BLS||2020-2030||2020-2030||2020-2030||2020-2030|
|Consumer Expenditure Survey||CEX||BLS||2020||2020||2020||2020|
|State Personal Income||SPI||BEA||2020||2020||2020||2019|
|Local Area Personal Income||LPI||BEA||2020||2020||2020||2019|
|Make & Use Tables||MUTs||BEA||2020||2020||2020||2019|
|National Income and Product Accounts||NIPA||BEA||2021Q4||2021Q3||2021Q3||2021Q2|
|Gross Domestic Product by State||GSP||BEA||2021||2021||2020||2020|
|American Community Survey||ACS||Census||2020||2020||2020||2019|
|County Business Patterns||CBP||Census||2020||2019||2019||2019|
|ZIP Code Business Patterns||ZBP||Census||N/A||N/A||N/A||N/A|
|Current Population Survey Tabulations||CPS||Census||2020||2020||2020||2020|
|Sate and Local Finances (Census of Government)||COG||Census||2019||2019||2019||2019|
|Origin-Destination Employment Statistics||LODES||Census||2019||2019||2019||2018|
|Quarterly Workforce Indicators||QWI||Census||2022Q2||2021Q2||2021Q2||2021Q1|
|Railroad Retirement Board||RRB||RRB||2020||2020||2019||2019|
|Occupational Information Network||ONET||US DOL/ETA||26.3||26.2||26.1||26.0|
|Crime By County||FBI||Federal Bureau of Investigation||2020||2020||2020||2020|
|Birth/Mortality Data||CDC||Center for Disease Control||2019||2019||2019||2019|
|Migration Data||Migration||Internal Revenue Service||2019||2019||2019||2019|
|Cost of Living||COLI||C2ER||2021||2021||2021||2021|
|Database USA||DBUSA||DBUSA||March 2022||Sept 2021||Sept 2021||Sept 2021*|
2019* Business patterns created from Database USA for modeling ZIP code and Census Tract data. Business listings displayed in Emsi tools are from the December 2019 DBUSA database snapshot.
|Name||Abbreviation||Publishing Agency||Data Source that Determines Update||Lightcast 2022.3||Lightcast 2022.2||Lightcast 2022.1||Lightcast 2021.4|
|Metropolitan Statistical Areas||MSA||Census Bureau||MSA Delineation Files||March 2020||March 2020||March 2020||March 2020|
|Federal Information Processing Codes||FIPS||NIST||Usually QCEW||2015||2015||2015||2015|
|North American Industry Classification System||NAICS||Census Bureau||QCEW||2017||2017||2017||2017|
|Classification of Instructional Programs||CIP||NCES||IPEDS||2020||2020||2020||2020|
|Standard Occupation Classification||SOC||ONS||LFS||2019*||2019*||2019*||2019*|
|Occupational Information Network||ONET||ETA||ONET||26.3||26.2||26.1||26.0|
* Emsi SOC 2019 is almost identical to the SOC2018-based occupation classification currently used by OES. For more information see Emsi Burning Glass SOC
This section contains release notes for past dataruns.
Unusually high wages
With the 2021 Q3 release of QCEW earnings data, several industries reported unusually high wages in certain areas. For example, Other Health and Personal Care Stores in Middlesex County, NJ reported a 14x increase in wages (roughly 100 million dollars) over the previous quarter. Medical Laboratories in San Mateo, CA saw similar 12x increase in earnings.
Revision of Birth and Death Rates
Birth and death rates feeding into Emsi’s demographic model were revised. Rates have been adjusted to match with observed population age cohorts.
Removal of Educational Attainment Dataset
Primarily due to lack of use, the deprecated EMSI.US.EducationalAttainment.ACS is no longer included in our datasets.
Metric Added to Detailed Industry Data
We have added a ‘TurnoverRate’ metric to the detailed (Age & Gender and Race & Ethnicity) Industry datasets, which gives the rate of separations per job.
Metrics Added to Unemployment Data
We have added four new metrics to all of our Unemployment datasets:
‘Employment‘ : The number of employed people in the area
‘TwelveMonthEmpAverage‘ : The average of the last 12 months’ employment
‘LaborForce‘ : The number of employed and unemployed people in the area
‘UnemploymentRate‘ : Employment/LaborForce
Metric Removed from Staffing Pattern Data
We have removed the ‘Percent’ metric from the Staffing Pattern datasets.
ZIP Code Employment Breakouts Recency
We replaced outdated 2016 ZBP data with 2021Q1 DatabaseUSA (DBUSA) data. Emsi Burning Glass updated ZBP data annually until ZBP began making use of a new Census suppression methodology which suppresses data far more heavily than in the past. ZBP (and now DBUSA) represented the largest of the roughly ten sources that are used for modeling county-level data from QCEW down to ZIP codes.
While nothing about Emsi Burning Glass’s industry data will change at aggregate levels (6-digit county industry data will be identical), this will cause some changes to industry data at the zip level. The business patterns created from DBUSA are less precise that those from ZBP, so our initial model produced roughly 16% more breakouts with smaller values than previously. Consequently, we more aggressively redistribute those small values, which will result in ZIP code employment being more tightly clustered around known data. Because of this, the average of monthly QCEW industry data differs from annual data more than it used to, and year-over-year changes are now often greater than they have been at the ZIP and Census Tract levels. The upshot is that we still strongly recommend using ZIP code data in groups of ZIP codes rather than singling out a single ZIP code. See ZIP-Level Employment Data for more cautions about using ZIP code data.
QCEW Unsuppression Improvement for Current Year
We are no longer using the closest year of CBP to form the initial guesses for unknown values when unsuppressing first quarter QCEW data, but rather are using only same year CBP data. The immediate effect of this change is that 2020 Q1 data is no longer being seeded by 2019 CBP, which should increase accuracy of 2020 Q1 and later QCEW unsuppressions.
MSA Occupational Earnings Now Match OES Where Possible
OES MSA earnings are now being pulled straight through from the raw data to our final data product without modification where possible. Where we are unable to pull OES data through unchanged, either because we are supplementing those earnings with ACS data or because taxonomy changes mean the data points do not have the same definition, we also corrected some logic for combining county occupational earnings to create MSA data to use the correct state minimum wage instead of the national minimum wage.
Occupational Earnings Improvement for Employment Not Covered by OES
OES earnings data provides the bedrock for our occupational earnings, but there are some industries which aren’t covered by the OES, including several agriculture industries. In the past this hasn’t been a major problem, since we can still get earnings data for each occupation from the industries which OES does cover. With this datarun, however, we will begin using ACS earnings data for those industries not covered by OES, and then combining those with our OES occupation earnings. This will provide a more complete picture of occupational earnings, especially for occupations that are mostly found in non-OES industries, such as farmhands. However, in order to continue matching OES earnings where reasonable, if OES covers more than 95% of the employment for an occupation, we do not supplement the earnings for that occupation with ACS data.
Occupational Earnings Unsuppression Improvement
Wherever an occupation’s data is suppressed for an MSA that is located within a single state and the average earnings are disclosed for the state, we now constrain the unsuppressed MSA’s average earnings to be within -20% to +25% of the state’s average earnings. Average earnings are now more consistent between geography levels and are also more consistent with percentile earnings. This change primarily affects area of low employment.
CIP 2020 (Classification of Instructional Programs)
With the 2021.3 datarun, Emsi Burning Glass is updating from CIP 2010 to CIP 2020 for IPEDS completions data and profiles analytics. For more information about CIP codes, see CIP user site
Labor Force Participation Changes
We changed the denominator for labor force participation rates from the population aged 16+ to a total civilian non-institutionalized population aged 16+. The prior denominator included people who should not be included when calculating labor force participation rate, including military, inmates, residents of care facilities, and others.
Read more about the labor force participation rate methodology changes here.
Occupation Data Correction (Self-Employed and Extended Proprietors)
We discovered and resolved mapping inaccuracies from ACS SOC codes to Emsi Burning Glass SOC codes affecting up to 0.7% and 0.9% of Self-Employed and Extended Proprietors classes of worker respectively. This affects products such as occupation employment & earnings, occupation workforce demographics employment, and staffing patterns. While the overall number of jobs remains the same, the distribution of those jobs between SOC codes has changed.
The relative percentages have increased for the following Emsi Burning Glass occupations codes:
13-1198 – Project Management Specialists and Business Operations Specialists, All Other
23-2099 – Legal Support Workers, All Other
25-9021 – Farm and Home Management Educators
25-9031 – Instructional Coordinators
25-9044 – Teaching Assistants, Postsecondary
25-9099 – Educational Instruction and Library Workers, All Other
27-3011 – Broadcast Announcers and Radio Disc Jockeys
27-3092 – Court Reporters and Simultaneous Captioners
27-4011 – Audio and Video Technicians
27-4012 – Broadcast Technicians
27-4014 – Sound Engineering Technicians
27-4098 – Lighting Technicians and Media and Communication Equipment Workers, All Other
33-9098 – School Bus Monitors and Protective Service Workers, All Other
Consequently, the relative percentages decreased for the following Emsi Burning Glass codes:
13-1081 – Logisticians
25-4022 – Librarians and Media Collections Specialists
25-9045 – Teaching Assistants, Except Postsecondary
27-3099 – Media and Communication Workers, All Other
43-2099 – Communications Equipment Operators, All Other
53-6061 – Passenger Attendants
Post-Secondary Teacher Data Improvements
Because of a general lack of quality data for individual post-secondary teacher occupations, Emsi Burning Glass combines all those occupations into one. We are now doing that combination earlier in our chain of processes that produce staffing data, which should result in more stable staffing calculations for post-secondary teachers between future releases. Additionally, we previously disaggregated self-employed and extended proprietors earnings percentiles for post-secondary teachers to those detailed occupations before combining them again later. This pointless transformation, which has now been removed, caused those percentile earnings curves to be artificially flatter than they were when calculated directly from ACS microdata.
Occupation Earnings Improvement
We are now calculating state and national percentile earnings for individual 5-digit occupations that were not disclosed in OES using our occupational earnings aggregation methodology rather than the older less precise jobs-weighted averaging methodology. As a result, percentile wage curves are more reasonable for national and state occupation data.
Population Demographics Improvements
In the previous datarun, Emsi Burning Glass began publishing single-age population breakouts using a new cohort model. Some improvements have been made to that model, including:
When breaking out five-year age groups into single ages, it was previously assumed that the breakouts of each age group were the same across all areas within a state. However, college towns (for example) should have a higher share of people of ages 18 and 19 than the rest of the state. Emsi Burning Glass now uses county-level census data for certain age groups to better approximate age breakouts.
We now use logarithmic curve fitting to better project birth, death, and migration rates. Previously we had used a linear regression to project these values.
OES National Staffing Pattern Back-Projections
Emsi Burning Glass is now back-projecting OES national staffing patterns from 2005 to 2001 before regionalizing them and combining them with historical industry data to create historical regional staffing patterns and occupation data. This should better reflect the trajectory of staffing pattern changes looking back in time from current patterns. This change slightly affects 2001-2004 staffing and occupation data.
IRS Migration Unsuppression
Migration flow data is sometimes suppressed by the IRS. Previously, that suppressed data was distributed based on migration patterns from an ACS-based population migration dataset. We now use unsuppression methods to distribute those flows more correctly, using the ACS migration dataset as initial seeds for the unsuppression. The overall result is that the migration flows will be more consistent with the totals published by the IRS and less likely to be erroneous because of differences in methodology/coverage between the IRS and ACS migration datasets.
Occupation Diversity Group Updated
The diversity groups from C2ER changed, which will affect occupation diversity reports in Emsi Burning Glass’s tools.
Correction: Military Occupation Percentiles
Earnings percentiles for military-only occupations (55-9999) in the non_QCEW class of worker have been incorrectly based off of only the latest earnings year, even though the data was available for all earnings years. This release corrects that error.
Census Tract Industry and Occupation Data is Final
After making some final alterations to the methodology, Emsi Burning Glass is removing the experimental status of census tract industry and occupation data. For more information on the methodology, see this article.
Population Demographics New Cohort Model
At long last, Emsi Burning Glass has replaced the cohort model that projects US population counts by cohort. With better source data and more robust methodology, Emsi Burning Glass is able to more accurately project the demographic makeup of the population of the United States. For a more detailed description of Emsi Burning Glass’s new cohort model methodology, see this article. The largest methodological differences are listed below:
Emsi Burning Glass now runs the cohort model at the census tract level and then sums census tracts to aggregate up to counties, states, and nation. Previously, Emsi Burning Glass used the latest year of historical tract data to disaggregate county data to the tract level.
Emsi Burning Glass now uses local birth and death data from the CDC to calculate local birth and death rates, as well as local net in- or out-migration rates. Previously, Emsi Burning Glass used county cohort population data to regionalize national birth and death rates from the CDC.
Emsi Burning Glass now uses linear regression to estimate the birth, death, and migration rates used by the cohort model. Previously, Emsi Burning Glass used an average of all historical rates, which largely ignored local birth, death, and migration trends for individual cohorts.
Emsi Burning Glass now uses published state-level population counts by single-age from the Census Bureau’s Population Estimates program to estimate population counts by single-age in addition to their county-level population counts by age-group. Previously, Emsi did not incorporate this data, but because a cohort model requires single-age data to operate, Emsi was estimating single-age data from the age-group data using naive breakouts and multiple rounds of smoothing algorithms.
Because of the improvements in methodology, Emsi Burning Glass is also able to publish demographic data for single ages in addition to the previously available five-year age groups.
Occupational Earnings Percentiles Minor Improvement
When calculating earnings for occupations with small employment in metropolitan divisions in older years of the OES occupational wages timeseries, we now use earnings percentiles from the parent Metropolitan Statistical Area (MSA) for that occupation, rather than using national earnings for that occupation.
State Minimum Wage Data Source Change
Emsi Burning Glass now uses the Bureau of Labor Statistics (BLS) Occupational Employment Statistics (OES) program as the source for our state-level minimum wage data, instead of the Department of Labor (DOL). The primary reason for the change is that Emsi Burning Glass’s minimum wage data is used for unsuppressing OES occupational wages, and the DOL’s minimum wage data was sometimes inconsistent with that used by the BLS.
Correction: Emsi Burning Glass Population Projections Adjustments
Emsi Burning Glass has communicated in the past that we adjust our population demographics projections to the published national projections from the Census Bureau. While reviewing and improving the code, we realized that we have been inadvertently publishing the pre-adjustment data. This release fixes that bug.
Changes to the “Non-QCEW Employees” Class of Worker
In an effort to continuously improve our core LMI industry data, this quarter we revisited and improved the processes that create Emsi Burning Glass industry data for employees not included in the Quarterly Census of Employment and Wages (QCEW). The Bureau of Economic Analysis (BEA) releases employment estimates as part of the State and Local Area Personal Income (SPI/LPI) programs that include these “non-covered” employees. The majority of Emsi Burning Glass’s non-QCEW employment estimates are calculated by subtracting QCEW employment from total SPI employment. We reviewed the SPI methodology documentation and identified ways to improve our methodology to more accurately take into account the differences in coverage between SPI/LPI and QCEW. In summary, Emsi Burning Glass’s methodology changes result in:
Changes to QCEW source data
QCEW recently announced a change to the methodology they use to estimate employment and earnings for non-responding establishments. Under the current methodology, QCEW takes the missing establishment’s growth rate from this same time last year and applies that to the previous month. They’ve begun to rethink this, especially in light of COVID, and have decided to begin calculating missing establishments based on the growth rates of similar establishments within the same time frame.
Beyond this, they’ve also mentioned two other improvements they’d like to make: (1) implementing a methodology change that will immediately identify employers who have ceased operations and (2) use benefit claim counts as a supplement to existing manual data review.
Population Educational Attainment
Emsi Burning Glass improved the model that creates educational attainment data for the population aged 25 and above. This is done by breaking out final Emsi Burning Glass population demographics data by seven educational attainment categories using breakouts from 2009-2019 5yr ACS and 2000 Census data.
Emsi Burning Glass publishes data from the BLS Employment Projections (EP) program for certain national occupation characteristics. EP is now using the same occupation definition as OES, which transitioned to 2018-based occupation codes earlier this year. This update makes a few national occupation datapoints more accurate/relevant to Emsi Burning Glass SOC:
We stopped using national tabulations from the Current Population Survey (CPS) for unemployment breakouts in favor of searching for the nearest good breakout from the DOL’s Characteristics of the Insured Unemployed (CIU) monthly state unemployment data.
Occupation by Residence Employment
Emsi Burning Glass uses a ZIP Code to Census Tract mapping from Housing and Urban Development to create ZIP Code level occupation by place of residence employment. We improved our usage of the mapping which improved the accuracy of occupation by place of residence data by more closely mapping published data from LODES.
Staffing and Occupation Data Tuning
We tuned our expansion and unsuppression algorithms for OES staffing and occupation data to produce fewer unreasonable data points, and cluster employment and earnings data around more likely places.
New Datasets in Core LMI API (API Only)
We’ve added a few new datasets to the Core LMI API. See the changelog for more information.
Correction: Historical Regional Occupation data for regionalizing Historical OES Staffing Patterns
In a previously released methodology document, Emsi Burning Glass claimed to have begun using historical regional OES occupation data to regionalize historical national OES staffing patterns. This was incorrect – Emsi Burning Glass had begun using historical national OES staffing patterns with the 2018.3 release, but did not begin using historical regional OES occupation data to regionalize those staffing patterns until the 2020.4 release. As a result, users should expect to see more consistency between published occupation data from OES and Emsi Burning Glass’s staffing and occupation data.
Changes to the “QCEW Employees” Class of Worker
In an effort to continually improve our core LMI industry data, this quarter we revisited and improved the processes that create Emsi Burning Glass industry data based on the Quarterly Census of Employment and Wages. In summary, the improvements will be visible as more reasonable earnings estimates, fewer unreasonably small employment numbers (less than one job), and fewer jobs and earnings arbitrarily added where QCEW published zeros for certain establishments. Due to these changes, users should not expect historical data to match previous Emsi Burning Glass dataruns except where that historical data was completely disclosed by QCEW. There may also be slight differences between published annual data from QCEW and Emsi Burning Glass’s employment estimates. The specifics:
ZIP Breakouts for State and Local Government Higher Education
We improved our usage of staff employment data from IPEDS to better break out county data to ZIP codes for state and local government higher education.
Experimental Census Tract Industry and Occupation Data
We created experimental industry and occupation datasets at the Census Tract level to provide better geographical granularity. The methodology is still undergoing improvements and the data has not yet been included in our tools, but the data is available via API and datapulls. If you would like advanced access to help us validate the data, please contact your customer service representative.
Accuracy of the Social Accounting Matrix
We fixed a bug in the Commodity Flow Survey unsuppression and improved accuracy in the gravity flows model of Emsi Burning Glass’s Social Accounting Matrix (SAM) Input-Output model.
Emsi Burning Glass’s occupation classification (SOC) is being updated to match the reduced level of detail available in the 2019 release of OES. This reduction in detail serves as preparation for OES adopting the official 2018 SOC (to be released spring 2022 with OES 2021 data). For more information on the timeline for the SOC 2018 transition, see Emsi Burning Glass’s SOC transition article. For more information on how the SOC 2018 transition affects 2019 OES data, see Question 10 of OES’s FAQ.
In addition, because of funding constraints, the OES has been reducing their survey sample size, which will reduce the accuracy of the data, but prove the need for Emsi Burning Glass’s modeling to make detailed estimates available.
“The OES sample has been reduced in recent survey panels. The May 2019 OES survey panel had a sample of approximately 183,000 establishments. The November 2017, May 2018, and November 2018 survey panels each had a sample of approximately 186,000 establishments. The May 2017 panel sample consisted of approximately 195,000 establishments, and the November 2016 panel sample consisted of approximately 202,000 establishments” (see the end of “Technical Notes for May 2019 OES Estimates”, from BLS).
In order to match BLS published separation rates to the new aggregate occupation codes in Emsi Burning Glass SOC 2019, we sum total employment and total separations for the aggregated occupations and recalculate the rate. Prior to this data release, separation rates for aggregate occupations were calculated by averaging the detailed rates. Separation rates are used in Emsi Burning Glass’s calculation of openings. For reference, Openings = Growth + Separations.
New Micropolitan Statistical Area
Emsi Burning Glass updated to the Census Bureau’s March 2020 MSA delineations. This update includes only one change, a new micropolitan statistical area: Bluffton Indiana (MSA 14160), which consists of Wells County Indiana (FIPS 18179).
Reduced Volatility in Historical Occupation Earnings
We improved the methodology for estimating suppressed occupational employment and earnings. This improvement targeted areas of low employment and volatile earnings. Previously, we used disclosed data from prior and subsequent years to form an initial estimate (seed) for suppressed values, and then allowed our algorithms to adjust those seeds to match known disclosed data. The constraints imposed by the disclosed data were not enough to keep the seeds from changing drastically and causing volatile earnings across the time series. We resolved this by adjusting the seeds to match the known national occupation change, then imposing tighter constraints on those particular seeds so our algorithms could not adjust them more than ten percent.
In short, we started with better guesses and constrained our models to stay within ten percent of that original guess. The effect of this change is a noticeable reduction in volatility for occupational earnings associated with low-confidence data points.
ZIP Breakout for State and Local Government
We fixed a bug in our ZIP code methodology that was inadvertently using total employment instead of state and local government employment to break out the state and local government industries (902999, 903999) from counties to ZIP codes.
Extensive changes to our Extended Proprietor class of worker
We extensively reviewed and revised our methodology for our Extended Proprietors class of worker, improving our unsuppression of BEA proprietors employment and earnings by using newer algorithms and by making better use of NES and QCEW to reduce the volatility present in the data and isolate earnings from income. Users of Extended Proprietors data may notice differences in how earnings are distributed between industries and areas. Accordingly, users of the Social Accounting Matrix (SAM) and Input-Output (I-O) model may notice some changes to industry multipliers.
New Datasets in Core LMI API (API Only)
IPEDS Institution Updates (API Only)
We added indicators to to IPEDS institution definitions for Historically Black Colleges/Universities (HBCU) and Hispanic Serving Institutions (HSI). The goal is to add these indicators to reports in our Analyst tools; however, at the time of this datarun release, they are only available in the API.
We fixed a bug in how we calculated growth for occupational employment. Growth is part of the calculation for openings, so openings were also affected. Previously, if employment was zero for a year and greater than zero for the following year, growth was incorrectly calculated to be zero. We’ve corrected the calculation so growth now equals the employment in the following year. Nationwide for all occupations, the amount of growth we were previously dropping was between 6,000 and 16,000 jobs for historical years and between 500 and 3,000 for projected years. The maximum percentage error was just under 0.6%.
QCEW Supplemental Earnings
We improved our process for estimating supplemental earnings for QCEW industry data to reduce the volatility in areas where our BEA source data was suppressed. The new methodology relies more heavily on the parent geography’s supplemental earnings data and thus will more closely match the source data.
Veterans Population Bug (API only)
We fixed a bug in 2009 veterans population data that was overstating the veteran population by roughly 5x. We pull veterans population counts from the Census Bureau’s ACS API, and failed to notice that the variable definitions changed between 2009 and 2010. Only latest year (currently 2018) veterans population data is currently shown in our tools, so only API users may notice a change.
Geography Constraint Changes in Core LMI API (API only)
We’ve simplified the interface for accessing different levels of geography. For example, instead of querying a separate endpoint – Emsi.US.Industry.ZIP – for ZIP code industry data, users can simply prepend “ZIP” to geography constraints sent to Emsi.US.Industry. MSAs are also available by prepending “MSA” to geography constraint. For more details and a full list of datasets that now support this simplified interface, see https://api.emsidata.com/updates/core-lmi-changelog.
We neglected to include 2012 IPEDS distance completions data when we originally began publishing this data in 2013. Thanks to an observant client pointing this out, we now include 2012 data in our distance completions dataset.
Extended Proprietors Updates
We made some minor modifications to the methodology used to estimate our Extended Proprietors (Class of Worker 4) data to improve its quality and reliability. The immediate effect of the changes is minor: industry earnings per job increased by 0.2%
IRS Migration Data Improvements
The underlying data behind our population migration data is based on IRS tax exemptions, which do not always represent a person. Following a recommendation from the IRS, we now multiply the number of exemptions by 0.9 to better estimate the actual number of people migrating between counties.
Occupation Data Changes (API only)
We’re working hard to unify related datasets to simplify the API user experience. Part of that effort is visible in this release. API users can now access ZIP code and MSA data directly from the Emsi.US.Occupation endpoint by prefixing ZIP codes with “ZIP” and MSA codes with “MSA”. Nation, state, and county FIPS codes do not need to be prefixed and will work exactly as they did before (and don’t worry, we didn’t get rid of Emsi.US.Occupation.ZIP yet).
State Minimum Wage Bug Fix
The minimum wage data that informs our occupational earnings models formerly contained the state minimum wage even if it was lower than the mandated federal minimum wage. We’ve modified the minimum wage for such states so that they now match the federal minimum wage. The increased minimum wage will be the lower bound for earnings percentiles in the lowest earning occupations.
We made some improvements to our staffing pattern process to improve the loading speed of staffing pattern reports. This change will result in extremely small employment numbers clustering more tightly around disclosed data and being less widely distributed (e.g. there will be more occupations with three jobs and fewer occupations with one job).
Improved ACS Methodology
For certain records, ACS cannot determine a specific industry to classify a job as, so they place these indeterminate records in less-detailed “catchall” categories. Emsi Burning Glass now distributes these values among the industries represented by the catchall, instead of dropping these records. This will result in 50k-80k jobs, depending on the year, being added to self-employed.
Improved Earnings Estimations
Emsi Burning Glass has reduced the volatility of earnings in occupation data for areas with small employment. Year over year data should be more continuous and less erratic as a result.
Occupation Hires and Separations Improvements
Previous to this release, Emsi Burning Glass estimated occupation hires and separations using a combination of industry data from QWI and Emsi Burning Glass employment staffing patterns, implicitly assuming that hires and separations rates within an industry were the same across all occupations. Emsi Burning Glass’s new methodology estimates occupation hires and separations by combining QWI industry hires and separations with national occupation separation rates from the BLS and regional staffing pattern growth and decline. This change will not take effect immediately in Emsi Burning Glass’s tools, but will be released and messaged sometime later in the quarter.
In order to improve the quality and stability of their estimates, the BLS reduced the level of geography detail available in the 2018 release of Occupational Employment Statistics (OES):
Consolidation of some nonmetropolitan areas. Some nonmetropolitan areas published in the May 2017 estimates have been combined to form larger nonmetropolitan areas. The May 2018 estimates contain data for 134 nonmetropolitan areas, compared with 167 nonmetropolitan areas in the May 2017 estimates.
Elimination of metropolitan division data. OES no longer publishes data for the metropolitan divisions within the 11 large metropolitan areas that are further broken down into divisions. Data for these 11 areas are available at the Metropolitan Statistical Area (MSA) only.
This change does not affect the level of geography detail available to Emsi Burning Glass clients, though detailed regions that fall within the same consolidated OES region may look more similar to one another.
Historical Occupational Wage Estimates
Emsi Burning Glass is now publishing historical occupational wage estimates back to 2005. Current year occupational wage estimates are more stable and accurate as a result. This data is currently accessible only via Emsi Burning Glass’s LMI data API, but will appear in Emsi Burning Glass’s tools in the near future.
Self Employed Earnings
Emsi Burning Glass sources self-employed earnings from the American Community Survey public use microdata. Over the past year, Emsi Burning Glass conducted a complete methodology review of ACS usage and decided to make these changes:
Five-year sample window: Because ACS is a survey with limited sample size, it is standard practice to combine five years of data to improve coverage and stability. When Emsi Burning Glass began using ACS data, ACS three-year tabulations were the standard, and five-year tabulations were relatively new; consequently, Emsi Burning Glass created an ad hoc combination of years (1, 3, 5, 9) to increase sample size. Not seeing any advantage to this ad hoc methodology over the standard, Emsi Burning Glass now follows the example of official ACS tabulations and uses a five-year window. Total self-employed employment is roughly identical to what it was previously, but how those jobs are distributed across industries and occupations will be minorly affected. On average, staffing patterns have 5% more occupations than before.
Improved earnings outlier removal: Another drawback to the limited sample size of surveys is that outliers can disproportionately affect the quality of estimates, especially in granular tabulations. In this release, Emsi Burning Glass improved the methodology for ACS earnings outlier removal to prevent earnings from being skewed low (because of older years of data) and to allow removal of low-earning outliers (only high outliers could be detected in Emsi Burning Glass’s previous methodology). The end result is that average self-employed industry and occupation earnings are on average 30% higher.
Improved occupation earnings percentiles estimation: Prior to this release, Emsi Burning Glass estimated percentiles after tabulating ACS microdata, assuming that it was valid to aggregate weighted percentiles. In 2017, Emsi Burning Glass stopped aggregating weighted earnings percentiles elsewhere in US data, but neglected to fix the initial creation of self-employed earnings percentiles. Consequently, these percentiles were artificially flat (skewed toward the average). The new methodology correctly builds percentiles from the original microdata, resulting in more reasonable percentile curves.
Earnings for Military-Only Occupation
See the changes to Self Employed Earnings; Emsi Burning Glass’s earnings for the military-only occupation come from ACS. On average, earnings are 5% higher and percentile curves are no longer skewed toward the average.
Local Absorption in the Input-Output Model
Emsi Burning Glass adjusted the methodology in the input-output model that regulates local absorption of goods produced in region.
We discovered a bug in our process that estimates gross regional product for years where the detailed regional estimates in BEA’s GDP by state lag behind national estimates from the BEA’s National Income and Product Accounts (NIPA). When regional data lags behind national data, we fill in the missing regional detail using previous-year regional data and current NIPA data. Previously, Emsi Burning Glass worked under the assumption that Gross Domestic Income (GDI) was an alternative way to calculate GDP and should be equal to it, so we used GDI data from NIPA to bring forward the regional estimates. However, GDI is not equivalent to GDP in BEA estimates, despite the definition of the term, so we were incorrectly using GDI when we should have been using GDP. With this release, Emsi Burning Glass is now correctly using GDP data to bring forward the regional GDP estimates. This bug caused the most recent two years of GDP to be approximately 2% higher than they should have been at all levels of detail. This also affected the US SAM Input-Output model’s induced multipliers.
Military Employment Bug
We discovered a minor bug in the process that estimates non-QCEW employment for federal government military. Military employment data was missing from some counties affected by the county redefinitions in data prior to 2009.
Government Hires and Separations Bug
We discovered a minor bug in the process that estimates hires and separations (sourced from QWI) for industry and occupation workforce demographics. Hires and separations were erroneously zero for government industries and related occupations in a few state-year combinations where QWI lacked sufficient data. We now (correctly) use ratios from less detailed data to fill in those holes. This data is only visible to API clients and users of the hires portion of the job-postings report in Analyst.
Incorporated Self-Employed and Unemployed Employees from ACS
We discovered an error in how we use the Incorporated Self-Employed (ISE) class of worker from American Community Survey (ACS) data. ISE represents individuals who are employees of their own corporations, and thus are included in QCEW (Emsi Burning Glass CoW 1). In some cases, Emsi Burning Glass was previously treating these incorporated self-employed incorporated individuals as CoW 3 instead of CoW 1. Emsi Burning Glass’s self-employed class of worker (CoW 3) now does not include these individuals. See below for a summary of the effects on final data.
We also began making use of data from ACS respondents that were unemployed at the time the survey was collected, but had worked as an employee within the last year. Similar to how annualized QCEW jobs are reported, we down-weight unemployed ACS records based on the number of weeks the person worked (e.g. – a person who works three months of the year is counted as a quarter of a job). These new data feed into:
Workforce demographics (detailed industry and occupation data): Emsi Burning Glass uses ACS data to supplement and help unsuppress Quarterly Workforce Indicators (QWI) data, which is in turn combined with final Emsi Burning Glass industry and occupation data to estimate workforce demographic breakouts (e.g. – “We know how many plumbers work in X region, how many of them are white males?”). Previously, we were estimating a small portion of the demographic breakouts, where estimates were not available from QWI, using the wrong subsection of the US economy; the breakouts for CoW 1 and 2 mistakenly did not make use of ISE and the breakouts for CoW 3 and 4 mistakenly did make use of ISE. However, because the primary data source for demographic breakouts is QWI not ACS, the effects on final data are minor.
Staffing Patterns for QCEW and Non-QCEW (Emsi Burning Glass CoW 1 and 2): ACS data is used to create staffing patterns for industries that are not included in OES staffing patterns. These ACS staffing patterns were mistakenly created from all employee records from ACS except ISE and the currently unemployed. Again, since ACS is not the primary dataset from which Emsi Burning Glass creates staffing patterns, the effects of the bug are limited in scope to only industries that are missing from OES: Agriculture (NAICS 111000, 112000, 113110, 113210, 115310, 114111, 114112, 114119, 114210) and Private Households (NAICS 814110). The differences in the ACS-based staffing patterns will be minor, since most employees were correctly included. Note that this also minorly affects Emsi Burning Glass’s final occupation data which is calculated from industry and staffing data.
In September 2018, the Census Bureau released new delineation files for Micro- and Metropolitan Statistical Areas. The 138 changes are fairly extensive and affect most of the major metropolitan areas in the United States. While county definitions did not change, 56 counties are now included in MSAs that were not previously, 42 counties have been removed from MSAs, and 40 counties have moved to new or different MSAs. If you have previously viewed data at the MSA level and are noticing large changes between current and previous data for your MSA, this is likely the cause. For more detail, contact your customer service representative.
Occupational Employment Statistics
Emsi Burning Glass continues to do research and preparation for creating a time series out of OES data. The objectives are to provide more accurate historical occupation employment estimates and begin producing historical occupation earnings. The research is still ongoing on both fronts, but various changes have been implemented to lay the groundwork for these future improvements. First, we tightened controls on occupational employment estimates, so they are now clustered more tightly around disclosed OES data. Second, we improved the algorithms that estimate top-coded earnings (earnings that OES suppresses because they are above $100/hour). Third, we began applying our occupational earnings aggregation methodology to the earnings unsuppression process. On the whole, the accuracy of unsuppressed earnings estimates did not change, but percentile wage curves are more reasonable and much more statistically defensible.
Changes in QCEW in South Carolina
QCEW is the foundation of Emsi Burning Glass’s employment and earnings data. The BLS has reported that QCEW data in South Carolina “are showing unusual movements which may be a result of a change in reporting.” For more information, see: https://www.bls.gov/cew/2018-notice-regarding-south-carolina-employment-and-wages-data.htm
New Benchmark Data from the BEA
For the past five or more years, the Social Accounting Matrix (SAM) and Input-Output (I-O) data have had at their core the 2007 benchmark make and use tables from the BEA. This year, the BEA published 2012 benchmark files, and Emsi Burning Glass incorporated them alongside the previously existing 2007 benchmark files. Because this update accounts for five years of data updates, we expect to see substantial changes throughout Emsi Burning Glass’s SAM and I-O data and models.
Emsi Burning Glass’s occupation classification (SOC) is being updated to match the reduced level of detail available in the 2017 release of OES. See https://www.bls.gov/oes/changes_2017.htm for a summary of the changes. Where the BLS aggregated detailed occupations to the broader “parent” occupation, Emsi Burning Glass copied the parent code’s data down to a new Emsi Burning Glass-specific detailed code ending with an “8”, so clients can still browse occupation data by detail level. Data will no longer be available for the old detailed SOC codes. For more details or if you have questions, please contact your customer service representative.
Historical Staffing Patterns
Emsi Burning Glass is now using historical staffing patterns based on historical Occupational Employment Statistics (BLS) data. All QCEW and Non-QCEW historical staffing and occupation data is affected. OES advises against using historical OES data as a time series for reasons here: https://www.bls.gov/oes/oes_ques.htm#qf1. Emsi Burning Glass took those factors into account and made a significant effort to smooth the volatility between historical OES releases. In the near future, we hope to extend the use of historical OES data beyond national staffing patterns to regional employment and earnings.
Military Staffing Pattern
Due to lack of good, detailed military occupation data, Emsi Burning Glass previously lumped all military employment (NonQCEW, industry 901200) into a single occupation (55-9999). In response to various requests from clients, we now use a national staffing pattern from the American Community Survey (Census) to break out military employment between military-specific occupations (55-9999) and occupations that are shared with civilians (eg. pilots, mechanics). In 2016, approximately 44% of military employment was in military-specific occupations and 56% was in occupations shared with civilians. This change more closely reflects the design of the official SOC classification and also clears up an equivocation on the definition of “military” that was causing some confusion about earnings (Emsi Burning Glass was using earnings for military-specific occupations for all of military employment).
Non-QCEW (Emsi Burning Glass Class of Worker 2) Employment
We discovered a long-standing bug in one of our Non-QCEW Employees processes that was underestimating employment in a handful of industries:
Users should notice a 2x or 3x (depending on the industry) increase in Non-QCEW employment and total earnings for these industries and their corresponding occupations (via the staffing pattern). Occupations in SOC 41 will be the most affected by the increase in employment.
Local Absorption in the I-O model
Emsi Burning Glass uses data from the Census Bureau’s Commodity Flow Survey (CFS) supplemented with proprietary data to create industry-specific transportation impedance estimates. We recently discovered that CFS is too general to create accurate impedance estimates for some industries, so we opted for using only the proprietary data in those cases. Consequently, users should expect to see changes in local absorption (demand satisfied in region) for the following industry sectors: 211, 213, 425, 44-45 (except 454), 4542, 45439, 48-49 (except 493), 51 (except 511), 551111, 551112. Retail industries, particularly, will have much higher local absorption.
Emsi Burning Glass uses industry commuting data from Census’s Longitudinal Origin-Destination Employment Statistics (LODES) dataset to create occupation data by place of residence. Previously we made use of the job type JT02 (private jobs) and assumed no commuting for government jobs, so that the place of work and place of residence were the same. Emsi Burning Glass now uses JT00 (all jobs) as well as JT02 to create commuting patterns for government jobs. This also improves the accuracy of the regional Social Accounting Matrix.
This release uses the latest MSA delineations from August 2017 as provided by the Census Bureau (previous releases used the July 2015 delineations). The only change was that Twin Falls, ID was changed from a micropolitan statistical area to a metropolitan statistical area.
Input-Output Transaction (Z) Matrix
Emsi Burning Glass now defines national inter-industry transactions as significant only if they sum to more than $500K. This matches the BEA’s functional definition, since they round their transaction matrix to the nearest $1M. Emsi Burning Glass previously had no minimum value on national inter-industry transactions, allowing scores of insignificant transactions to clutter and add noise to the matrix. This change will have minimal visible affects, since the primary goal is to remove insignificant transactions and reduce noise in industry supply chains.
Emsi Burning Glass switched from using NAICS 2012 to NAICS 2017. Summary of the changes:
Occupational Information Network (O*NET) Version 22.0
With this release we updated from O*NET version 21.3 to version 22.0, to include updated Knowledge, Skill, and Abilities for 100 occupations. This will slightly impact the compatibility indices of occupation pairs.
Non-QCEW (Class of Worker 2) Earnings
We fixed a bug in the code that was causing earnings to be skewed low. In addition to normal data changes, earnings in this class of worker are approximately 1.5% higher than previous releases.
ZIP Code Population Demographics
We improved the usage of the probability matrices used to disaggregate county-level population demographics to ZIP codes, resulting in more accurate estimates for approximately 25% of the nation. Rural areas are the most improved.
ZIP Code Industry Employment Estimates
We improved our estimation of ZIP code industry employment with a more nuanced use of ZBP (ZIP Code Business Patterns, Census). Previously, if a county’s industry was not found in ZBP, we would default to using an USPS business delivery statistics to distribute that industry’s employment to all ZIP codes in the county. With the new methodology, if a county’s industry isn’t found in ZBP, we search for similar industries in ZBP before defaulting to USPS data. The use of USPS dropped from 20% to 0.5%, significantly increasing the accuracy of ZIP code employment estimates.
This change affects all ZIP code employment data (industry and occupation) and does not target any specific counties or industries. However, the most significant changes will be in rural areas and in regions where an industry’s employment is small. Changes will appear as a more focused distribution of data within the ZIP codes in each county.
Local Absorption in Emsi Burning Glass’s Social Accounting Matrix (I-O)
We slightly changed the constraints on the SAM’s bi-proportional algorithm to further maximize local absorption. This fixes some anomalies uncovered in the 2017.3 release where local absorption was too low.
Unemployment data has been brought up to date and the methodology adjusted to reflect current conditions more closely. The new methodology averages the past 12 months of unemployment to form a figure free from seasonal volatility. This change has been introduced with the 2017.4 data release; because of this, clients should expect to see differences in unemployment between the 2017.4 data release and prior releases.
We changed and simplified our methodology for estimating unemployment by 2-digit industry and occupation to more accurately reflect the data from the US Department of Labor’s Characteristics of the Insured Unemployed and rely less heavily on survey-based unemployment data from the Current Population Survey. This change allows us to publish monthly unemployment estimates, which may be available soon; in the meantime, we will publish only the latest month of data. Note that the new monthly data may be less stable when compared to our previous methodology, which was annualized.
We changed how we calculate both replacements and growth. For details, see https://kb.economicmodeling.com/openings-changes-set-for-2017
QCEW uses a BOS county code to track employment where no county was reported. To make Emsi Burning Glass’s ZIP code and county data more comparable, Emsi Burning Glass is introducing BOS ZIP codes that match the BOS county codes. Where “SS” is the state FIPS code, BOS county codes are formatted as “SS999” and BOS ZIP codes are formatted as “001SS.” As with BOS county codes, BOS ZIP codes will represent all data for a state that was not able to be tagged with a valid ZIP code. They will appear in ZIP industry, occupation, and demographics data.
Estimates Derived from Quarterly Workforce Indicators (QWI)
We fixed a bug in our methodology for estimating industry and occupation employment by demographics that caused the wrong year of national-average QWI data to be used as an input when an entire state was missing from QWI. Wyoming employment breakouts were affected for Emsi Burning Glass’s 2016.3-2017.1 releases.
Occupation by Place of Residence Data
Emsi Burning Glass’s 2017.1 release was the first to include occupation by place of residence data. One of the known limitations with that initial release was the absence of data in “999” or balance-of-state counties. This only affected state and national sums, and given that balance-of-state counties account for only a fraction of each state’s total employment and that most users will not be using place of residence data at those geography levels, the missing data should not have caused any major problems. Regardless, in Emsi Burning Glass’s 2017.2 release, the data for 999 counties are included and the state and national totals by place of residence should better reflect reality.
Occupational Earnings Percentiles
State and national occupational earnings percentiles for employee classes of worker (1 and 2) are no longer based on job-weighted aggregations of county estimates. Instead, earnings percentiles at the state and national levels are preserved from OES. The result is more accurate earnings percentiles at the state and national levels. Average hourly earnings are unaffected.
The data is now for 2015-2016.
Occupation by Place of Residence Data
We now have occupational employment data by place of residence derived from Emsi Burning Glass industry and staffing data and Census LODES data.
ZipCode Industry and Occupation Data
We improved our method of breaking out Emsi Burning Glass county-level data to the ZipCode level, making better use of USPS Delivery Statistics and Census’ ZipCode Business Patterns (ZBP) and adding and making use of the following datasets:
Earnings for Self-Employed and Extended Proprietors
We improved our methodology for estimating Self-Employed and Extended Proprietors earnings, improving accuracy in both industry and occupation data. While some differences may be noticeable, most will not exceed the normal variability between releases.
Employment for Extended Proprietors
We improved our methodology for producing employment estimates for Extended Proprietors, slightly lowering employment counts across the nation. In certain circumstances, our previous methodology double-counted a small number of Self-Employed workers as Extended Proprietors.
We reduced the level of detail we accept from state-published industry projections due to their varying quality and currency, effectively weighting Emsi Burning Glass and BLS industry projections more heavily.
Current Employment Statistics
Due to recent improvements in QCEW and Emsi Burning Glass release schedules, we no longer use CES to inform our current year industry employment estimates.