The American Community Survey (ACS) is a nationwide survey designed to provide communities a fresh look at how they are changing. It is a critical element in the Census Bureau’s decennial census program. The ACS collects information such as age, race, income, commute time to work, home value, veteran status, and other important data. As with the 2010 decennial census, information about individuals remains confidential.
The ACS collects and produces population and housing information every year instead of every ten years. Collecting data every year provides more up-to-date information throughout the decade about the U.S. population at the local community level. About 3.5 million housing unit addresses are selected annually, across every county in the nation.
Emsi’s Self-Employed Class of Worker includes all people who consider self-employment a significant part of their income and/or taking a significant part of their time. Emsi largely bases job counts, hourly earnings, and projections for these unincorporated self-employed jobs on responses to the American Community Survey (with additional input from other sources).
Emsi’s Extended Proprietors Class of Worker represents jobs that generate miscellaneous labor income, such as very small self-employment income and partnerships with many partners having limited involvement. Emsi derives job counts and hourly earnings for Extended Proprietors from differences between ACS and other proprietor counts, the latter of which are based on tax returns and other data compiled by the Bureau of Economic Analysis as well as local personal income reports.
Emsi also uses ACS to construct our demographic data for the Non-QCEW Employees, Self-Employed, and Extended Proprietors Classes of Worker.
In addition, Emsi uses ACS to build national staffing patterns for the Self-Employed and Extended Proprietor Classes, and for the Employee Classes for some industries that the BLS’s OES dataset does not cover.
For more information visit the ACS website.
A standard numerical code for a post-secondary course of study, developed and defined by the U.S. Department of Education’s National Center for Education Statistics. The classification of instructional programs provides a taxonomic scheme that supports the accurate tracking and reporting of fields of study and program completions activity.
Emsi compares the SOC code of a profile’s most recent job to a custom CIP-SOC mapping to determine whether the job is in or out of the field of study indicated by the CIP code your institution provided.
The emp_wageAgeAdjMax column shows the median wage for each profile’s SOC code in the county that best describes the profile’s location, adjusted for the individual’s age and highest level of education completed at your institution. Occupation earnings data comes from the BLS’s OES dataset, adjusted to take into account QCEW and ACS.
Age-adjusted Median Wage estimates current annual earnings using the median wage for the graduate’s most recent occupation and county and adjusting for age and degree level using the Mincer function.
When selected, the Highest Award checkbox will limit report results to one award per student. This enables school wide analysis by headcount and is selected by default. Deselect the checkbox to analyze all members of a group if it may include students who earned other awards at your institution.
See this article for more.
When selected, the Job Started After Grad Year checkbox will limit report results to profiles whose most recent job started after the year of graduation. This enables analysis of employment outcomes for selected groupings and is selected by default. Deselect the checkbox to include profiles whose most recent job started on or before the year of graduation or did not include a job start year.
To count as a matched record, a profile has to match an institution’s past student information on name and at least one of the following: contact information or award information (such as graduation year, program name, etc.). Using data obtained from public profiles, Emsi’s deliverables will show the most recent job listed for matched records.
Files include one row per award (major) per student. The record that contains the highest degree for any particular student will be marked “Highest,” and any other record(s) tied to that particular student will be marked “Other.”
Emsi classifies various profile field values such as company and school into a smaller number of fixed categories to enable meaningful aggregation and analysis. We call this process “normalization.” An example of normalization would be to normalize free-form variations of “St. Louis, Missouri” as found in different profiles to “St. Louis. MO”. One person might list their location as “Saint Louis Missouri”; another might list “ST Louis MO”; and a third might list “St. Louis Missouri”. Normalization corrects all variations of this city name to “St. Louis, MO”. Without the normalization step, aggregate analysis of profiles would be impossible–searching for profiles in “St. Louis Missouri” would automatically exclude profiles where the person wrote their location as “Saint Louis Missouri”.
The North American Industry Classification System (NAICS) is the standard federal system for classifying business establishments. Each establishment is assigned a six-digit code and category title, organizing them primarily by similar production processes into five levels: sectors, subsectors, industry groups, industries, and national industries (national industries are specific to one or more of the United States, Canada, and Mexico). Codes are hierarchical: less detailed categories are derived by removing digits from the end of more detailed codes.
The NAICS classification is updated every five years to better reflect economic realities.
For information on Emsi’s use of NAICS codes (including departures from the standard classification), see this article.
Occupation earnings data comes from the BLS’s OES dataset. It is collected from the employer’s perspective, meaning earnings data is pre-tax (individual employees’ tax withholdings will vary, so earnings are reported pre-tax). Occupations have average hourly earnings as well as percentile earnings for five percentiles (10th, 25th, 50th (median), 75th, and 90th).
Average earnings are determined by dividing the total earnings for the occupation by the number of jobs in the occupation. Percentile earnings indicate what percent of the jobs in the occupation earn that amount or less. For example, 10th percentile earnings of $12/hr. indicate that 10% of the workers in that occupation make $12/hr. or less. Median earnings of $15/hr. would mean that half of workers in that occupation make more than $15/hr., and half make less than $15/hr. 10th percentile earnings are often used as a proxy for entry level wages, as they represent some of the lowest earnings in the occupation.
Earnings are reported in terms of hourly income rather than annual income for all but a handful of occupations. For occupations with earnings reported annually, we divide by 2080 (number of hours in a working year) to determine hourly earnings.
Occupation earnings include the following:
Occupation earnings do not include the following:
OES provides definitions for all the categories listed above.
Various reports within Emsi’s Analyst and Developer tools allow users to combine occupation percentile earnings for various occupations or regions. These combinations are powered by a proprietary occupation aggregation methodology that represents the combined wage curves of various occupations better than a weighted average. For this reason, users should not expect to be able to combine percentile earnings by hand and match combined percentile figures as displayed in Analyst. More information on Emsi’s occupation percentile earnings aggregation can be found here.
Source: Emsi’s proprietary employment data, relying heavily on occupational earnings reported in OES.
The Occupational Employment Statistics (OES) program estimates employment and wages for most occupations by industry and sector at the national level, and by occupation at the state and metropolitan statistical area (MSA), and non-MSA levels in the 50 states and the District of Columbia. OES accounts for 1.2 million establishments and 62% of national employment, including railroad, but excluding military, agriculture, fishing, forestry, private households, self-employment, and others.
OES is our primary source of occupation data, but we compensate for OES’s general weaknesses and lack of valid historical data by utilizing stronger, more accurate industry employment counts from QCEW, County Business Patterns (CBP), and American Community Survey (ACS), among others. We then apply regionalized, OES-based staffing patterns to the industry data to show the distribution of jobs by occupation.
Emsi gathers occupation earnings data from OES. We use unsuppression techniques to fill in missing values as appropriate, and also build a time series of OES data in order to present historical occupation earnings.
For a more detailed explanation of how Emsi incorporates OES data into occupational processes, see this article.
O*NET provides occupation data such as knowledge, skills, and abilities needed to perform the work, as well as education and training requirements and alternate job titles. Emsi incorporates this data throughout its tools in various ways.
The O*NET Program is the nation’s primary source of occupational information. The data are essential to understanding the rapidly changing nature of work and how it impacts the workforce and U.S. economy. From this information, applications are developed to facilitate the development and maintenance of a skilled workforce.
Central to the project is the O*NET database, containing hundreds of standardized and occupation-specific descriptors on almost 1,000 occupations covering the entire U.S. economy. The database, which is available to the public at no cost, is continually updated from input by a broad range of workers in each occupation.
O*NET updates do not follow a schedule; Emsi monitors O*NET for updates and downloads new data as it becomes available.
Emsi Profile Analytics is built from individual profiles of over a hundred million workers in the United States. Typical fields available are city/state/nation of residence, job history, education history, and skills. Many profiles also contain names, phone numbers, and email addresses, but these are not made available in bulk to Emsi users.
Quarterly Census of Employment and Wages (QCEW) is a dataset published by the Bureau of Labor Statistics (BLS). QCEW is the backbone of Emsi’s core LMI data, providing establishment counts, monthly employment, and quarterly wages, by NAICS industry, by county, and by ownership sector, for the entire United States. These data are aggregated to annual levels, to higher industry levels (NAICS industry groups, sectors, and supersectors), and to higher geographic levels (national, State, and Metropolitan Statistical Area (MSA)).
Emsi produces a slightly modified form of the BLS QCEW dataset.
Emsi uses the list of counties or states that defines your service region (as specified by your institution) to determine if a matched profile currently resides in or out of your region. *Note: Filtering a report by geographies outside of this service region will always result in 0 alumni in region.
The Standard Occupational Classification (SOC) system is used by Federal statistical agencies to classify workers into occupational categories for the purpose of collecting, calculating, or disseminating data. All workers are classified into one of about 775 detailed occupations according to their occupational definition. To facilitate classification, detailed occupations are combined to form about 450 broad occupations, about 95 minor groups, and 23 major groups. Detailed occupations in the SOC with similar job duties, and in some cases skills, education, and/or training, are grouped together.
The SOC system uses hyphenated codes to divide occupations into four levels: major groups, minor groups, broad occupations, and detailed occupations.
The SOC classification system was updated in 2010, and the update to the 2018 classification is currently happening across various government LMI datasets.
For more information on Emsi’s use of SOC codes (including departures from the standard classification), see this article.