This dataset combines historical electoral roll and consumer register data (on surnames, forenames and locations) between 1997 and 2016 with a special aggregated metric derived from ONS data* which lists the most frequently selected second-level ethnicity category for most common forenames and surnames - see the section "What is the recommended ethnic group question for use on a survey in England?" at

Aggregated ethnicity categories used are (with codes used in the data files):

  • WBR - White: British (including English/Welsh/Scottish/Northern Irish)
  • WIR - White: Irish
  • WAO - White: Any Other
  • ABD - Asian/Asian British: Bangladeshi
  • ACN - Asian/Asian British: Chinese
  • AIN - Asian/Asian British: Indian
  • APK - Asian/Asian British: Pakistani
  • AAO - Asian/Asian British: Any Other
  • BAF - Black/Black British: African
  • BCA - Black/Black British: Caribbean
  • OXX - Any Other Ethnic Group (including Mixed; Black/Black British: Any Other; Arab; All Other Ethnicities; &c.)

Applicants should be mindful that this is a dataset of ethnicity categories and not one showing migration, citizenship, nationality or country of origin.

The dataset was first used for the GISRUK Data Challenge in early 2018. The data distributed here is substantially the same, however it has received some minor technical updates to correct some data issues.

*The data was derived as part of a ESRC-funded project "Ethnicity Estimator" - Virtual Microdata Laboratory project number: 0000013. It is a diagnostic table resulting from the application of CDRC algorithms. The aggregate data was provided by ONS within the VML.

This safeguarded dataset is only available on a contract basis and the application forms and more details can be found at

Data and Resources

Additional Info

Field Value
Product CDRC Ethnicity Modelling
Date Range 1997-2016
Collecting Date 2018
Data Collector ONS, CDRC
Update Frequency Ad-Hoc
Geographical Scales Lower Layer Super Output Area
Analytical Units Person
Data Kind CSV
Bounding box UK
Nation United Kingdom
Source ONS, CDRC
Creator CDRC
Creator Email
Publisher Consumer Data Research Centre
Publisher Email
Publication Year 2018
Maintainer Consumer Data Research Centre
Maintainer Email