Datasets & Competitions
The MLC ETI is dedicated to foster the application of ML in communications by presenting datasets and competitions tailored for communication society. The target is to establish a set of common problems and corresponding datasets on which researchers can benchmark and compare their algorithms in a reproducible and credible way.
Datasets
- 5G Performance: throughput performance, latency measurements, impact of mobility and obstructions, handoff analysis.
- Body area network radio channel: Measurement set with transmit-receive link gain.
- Colosseum O-RAN COMMAG Dataset: Radio Access Network Intelligent Controller (RIC) dataset.
- CRAWDAD up/rf_recordings: RF recordings of several communication signals.
- CRAWDAD rutgers/noise: Received signal strength indicator from ORBIT testbed.
- DeepBeam: mmWave I/Q samples from National Instruments mmWave Transceiver System.
- DeepMIMO: A generic deep learning dataset for millimeter wave and massive MIMO applications.
- Device Identification: IoT device identification dataset.
- Distributed Massive MIMO: Outdoor and outdoor-to-indoor measurements with 64 antennas and 18 users.
- Identification of Saturated and Unsaturated WiFi Networks: Inter-Frame Spacing (IFS) data for saturated and unsaturated WiFi Networks.
- IEEE Dataport Collection of Datasets for Communications: 250+ open-source datasets for Communications.
- Industrial-UWB-localization-CIR-dataset: Channel Impulse Response (CIR) data for industrial ultra-wideband localization.
- MIT RFChallenge: Dataset connected to detection, identification, and geolocation of RF signals.
- Power Allocation in Multi-Cell Massive MIMO: Dataset for power allocation in a Massive MIMO network.
- RadioML: Recordings of digital and analog modulation types.
- RF WebLab: Access to an amplifier and measurement system for digital pre-distortion.
- Sub-GHz-IQ-signals-dataset: IQ signals captured from multiple Sub-GHz technologies.
- Sussex-Huawei Locomotion Dataset: Annotated dataset for multimodal locomotion analytics of mobile users.
- Technology-Recognition-dataset-of-real-life-LTE-WiFi-and-DVB-T: IQ signals captured from LTE, WiFi, and DVB-T.
- UWB Localization dataset: UWB localization data set contains measurements from four different indoor environments and can be used for range-based localization evaluation.
- ViWi: A deep learning dataset framework for vision-aided wireless communications.
Open Dataset Initiative
The MLC ETI strongly encourages that new high-quality datasets are made openly available. If you have published a dataset that is not listed above, please contact the Datasets & Competitions Officer over email or slack with your interest. We can also help you with hosting the dataset at the IEEE MLC Datasets Server.
Data Competitions
- 1.25GHz localization dataset: From the IEEE CTW 2019 Challenge.
- IEEE Dataport: Data Competition Contest
- IEEE SPAWC 2021 #2: Wideband Radio Signal Recognition
- IEEE SPAWC 2021 #1: Radio Localization with Multiple Sensors
- ITU AI/ML in 5G Challenge 2020: Applying Machine Learning in Communication Networks
- IEEE ICC 2020: Vision-Aided Beam Tracking for mmWave Systems
- IEEE CTW 2020: Self-Supervised Learning for User Localization
- IEEE CTW 2019: Positioning Algorithm Competition.