Icentia11k Dataset
Overview
This dataset consists of ECG recordings from 11,000 patients and 2 billion labelled beats. The data was collected by the CardioSTAT, a single-lead heart monitor device from Icentia. The raw signals were recorded with a 16-bit resolution and sampled at 250 Hz with the CardioSTAT in a modified lead 1 position. We provide derived version of the dataset where each patient is stored in separate HDF5 files on S3. This makes it faster to download as well as makes it possible to leverage TensorFlow prefetch
and interleave
to parallelize data loading.
More info available on PhysioNet website
Usage
Example
Note
The Icentia11k dataset requires roughly 200 GB of disk space and can take around 2 hours to download.
Funding
This work is partially funded by a grant from Icentia, Fonds de Recherche en Santé du Québec, and the Institute of Data Valorization (IVADO).
Licensing
The Icentia11k dataset is available for non-commercial use only. Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License
Warning
The dataset is intended for evaluation purposes only and cannot be used for commercial use without permission. Please visit Physionet for more details.