A Review of Open-Source Datasets of Physiological Signals for Sleep Research
The collection and labeling of clinical polysomnography data are time-consuming and costly,and the differences between different populations,collection devices,and expert labeling create challenges for sleep-related research.The open-source datasets provide rich data resources and a unified comparison platform for global researchers to conduct sleep studies.This paper reviewed the characteristics and applications of 18 open-source datasets commonly used in the field of sleep.The datasets include electroencephalogram(EEG),electrocardiogram(ECG),electro-oculogram(EOG),electromyography(EMG),etc.,covering multiple clinical fields such as sleep disorders,cardiovascular diseases,obesity,etc.,promoting in-depth research in the field of sleep medicine.This paper also summarized the limitations of existing sleep open-source datasets in terms of data quality,data standards,data security,sample representation and external validity,and put forward specific suggestions and prospects.