You are here

Local Data Company - SmartStreetSensor Footfall Data – Research Aggregated data

Local Data Company - SmartStreetSensor Footfall Data - Research Aggregated data is an aggregated version of the secure version of this dataset. The aggregated level footfall sensor data is a derived product producing five-minute footfall counts.

Also available is a less granular version of this dataset's research aggregated data safeguarded and secure version. The aggregate level footfall measures are a derived product producing weekly footfall counts for select retail centres across the UK.

This SmartStreetSensor footfall dataset produced by Local Data Company (LDC) in partnership with the CDRC contains weekly footfall counts of passive WiFi signal probing from a sensor network across Great Britain between 2015 and 2020. The data are used as a proxy for estimating footfall at retail locations.

The dataset includes details about the location of the sensors (description as well as latitude, longitude, height, depth, installation dates) and cleaned five-minute interval footfall estimates which include timestamps, locations, adjusted and unadjusted footfall counts. A complete description of this dataset can be found below in the Data and Resources section.

Content and Size

The dataset includes information from 1151 sensor locations across 107 cities in Great Britain, identified by addresses including building numbers, street names, and unit postcodes. Data spans from July 2015 to September 2020, aggregated into five-minute intervals.
• Rows: Over 20 million records are across approximately 67 monthly files.
• Columns: The dataset comprises 22 variables distributed across 71 files.

Quality Representation and Bias

The quality and representation of the SmartStreetSensor Footfall dataset are influenced by the methodologies employed and the inherent biases associated with its collection process:
1. Sensor Range: The signal strength and sensor range are variable, influenced by environmental conditions and technical specifications. This variability introduces inconsistencies in coverage.
2. Probing Frequency: Devices probe for Wi-Fi signals at differing frequencies based on manufacturer, operating system, and usage state, affecting the detection consistency.
3. MAC Address Collisions: A minor percentage (0.01%) of MAC addresses are reported by multiple devices due to MAC randomization techniques, adding complexity to data cleaning.
4. Human Error: Sensor power disconnections and operational disruptions result in occasional data gaps.
5. Postprocessing Assumptions: The process of transforming probe requests into footfall estimates involves assumptions that may lead to overcounting or undercounting in specific scenarios.
• Geographical Representation: The dataset is heavily skewed toward Greater London, with one-third of sensor locations situated in this region. Consequently, national-level aggregated metrics may disproportionately reflect patterns in London.
• Temporal Coverage: Early stages of data collection, before July 2016, included fewer sensors (approximately 200), mostly located in London, further amplifying initial geographical biases.
• Device Misclassification: Sensors cannot distinguish between mobile devices and other Wi-Fi-enabled devices (e.g., printers or routers), potentially inflating counts.
• City-Level Distribution: The highest sensor concentration is in cities like London (318 locations), followed by Edinburgh (46) and Manchester (32). In contrast, smaller towns often have one or two sensors, limiting granularity in those areas.

Controller: 
University College London (UCL)
Additional Info: 
FieldValue

Source

Local Data Company

Attribution

Data provided by the Consumer Data Research Centre, an ESRC Data Investment: ES/L011840/1, ES/L011891/1

Rows

over 20 million

Columns

22 variables across 72 files

FieldValue
Modified
2024-12-16
Release Date
2019-11-20
Frequency
Hourly
Spatial / Geographical Coverage Location
United Kingdom
Temporal Coverage
January 2015 to September 2020
Granularity
Location; Five minutes
Author
Local Data Company; CDRC
Contact Name
Dr Maurizio Gibin
Contact Email
POLYGON ((-8.9948498999 49.688302644, 2.0867431164 49.688302644, 2.0867431164 61.0684288668, -8.9948498999 61.0684288668, -8.9948498999 49.688302644))
License Not Specified

Data Extent

Apply for the data:

To apply for the data, please login or register.

License

License Not Specified