Emsi Burning Glass has made two improvements in how we use the BLS’s Occupational Employment Statistics (OES) dataset. These improvements allow us to drastically improve the quality of occupation employment and earnings data that we deliver to our clients.
First, Emsi Burning Glass now offers historical occupation earnings data back to 2005. Because the BLS advises against combining multiple years of OES to create a time series (a multi-year dataset) of employment or earnings, Emsi Burning Glass has only ever provided earnings from the latest year of OES available. Emsi Burning Glass has always provided historical occupation employment counts as a time series because our employment counts are based on the BLS’s QCEW dataset, which is a time series by definition. In the past, we used OES’s latest staffing patterns and regional employment estimates, back-projected for older years, to transform QCEW-based industry job counts into occupation job counts.
Second, we now use historical OES staffing patterns and regional employment data to transform industry data, rather than just using back-projected, current-year OES.
Both improvements are possible because Emsi Burning Glass has created a time series of OES data. The remainder of this overview will briefly explain the process of creating a time series of OES data, and will address the BLS’s cautions regarding using OES as a time series.
Emsi Burning Glass rewrote the entire occupation data pipeline, creating a time series of OES data to allow us to utilize historical years of OES to improve occupation job counts and earnings and to provide historical occupation earnings.
OES data is first imported and cleaned. Because OES is not designed as a time-series, some preliminary work is necessary, such as ensuring that all years’ file formats are the same, and removing duplicate data and certain mid-level aggregations that can’t be used.
The data is then prepared for unsuppression in the seeding process. The seeding process provides estimate values (“seeds”) for suppressed data points. Seeds come from other years of OES and from QCEW.
Once the data has been seeded, it is unsuppressed, meaning that the seed values (not disclosed values) are allowed to change until each suppressed data point has an estimate that allows the whole dataset to sum properly in accord with all known (disclosed) data (e.g. all data points for occupation employment in an MSA sum to total employment for the MSA).
Earnings data are similarly seeded (with OES data from neighboring years) and unsuppressed using techniques specific to earnings unsuppression.
After employment and earnings have been unsuppressed to produce individual years of internally cohesive data, the unsuppressed OES years are then combined to create a time series. To create a time series, outliers caused by hidden methodology shifts in OES data between years must be smoothed, and earnings must be re-calculated to take into account classification shifts (e.g. switch from SOC 2010 to 2018).
The BLS cautions users against treating OES as a time series for several reasons. Emsi Burning Glass’s OES time series process fully accounts for the BLS’s concerns.
Emsi Burning Glass expects this methodology to lead to more stable occupation employment and earnings data over time. Volatility inherent in the OES data will be muted although not eliminated. Volatility, where still present, is mostly seen in geographical regions with low employment and a large number of suppressed data points. Improving employment and earnings stability for such regions is a current area of research for Emsi Burning Glass.
A detailed methodology document is available for download here.
Let us know what specific questions we can help you with (we may even add your question to our knowledge base).
Let us know what specific questions we can help you with (we may even add your question to our knowledge base).