TPEHGT DS - the Term-Preterm ElectroHysteroGram DataSet with Tocogram

The Term-Preterm EHG Dataset with Tocogram (TPEHGT DS)

(First Edition, 2017)

The Term-Preterm EHG Dataset with Tocogram (TPEHGT DS) is described in:

Jager F, Libenšek S, Geršak K.
Characterization and automatic classification of preterm and term uterine records.
PLoS ONE, 13(8): e0202125. https://doi.org/10.1371/journal.pone.0202125 (2018). [PDF]

Please cite this publication when referencing this material.

The TPEHGT DS was developed in 2017 in the scope of a research project financed by the Slovenian Research Agency [ARRS]. Project: P3-0124 Metabolic and inborn factors of reproductive health, birth (from 2005-2008, and from 2008-1013, and from 2013-2020).

In 2018 we also published the [TPEHGT DS] on Physionet.

Introduction

The Term-Preterm ElectroHysteroGram DataSet with Tocogram (TPEHGT DS) contains 26 four-signal 30-min uterine EHG records, i.e., three EHG signals accompanied by a simultaneously recorded external tocogram measuring mechanical uterine activity (TOCO signal) of pregnant women, and another five 30-min uterine records (EHG signals and TOCO signal) of non-pregnant women.

Data Description

The records of the pregnant women belong to pregnancies that resulted in spontaneous preterm delivery (13 preterm records from eight pregnancies), and to pregnancies that resulted in spontaneous term delivery (13 term records from ten pregnancies). The main objective of the dataset is to provide a set of annotated contraction intervals (annotated intervals related to uterine contractions), and another set of annotated non-contraction intervals (dummy intervals, i.e., intervals out of uterine contractions). The annotated contraction and dummy intervals of the records allow:

The dataset was developed at the Faculty of Computer and Information Science (Laboratory for Biomedical Computer Systems and Imaging), University of Ljubljana, Ljubljana, while the records were colected at the University Medical Centre Ljubljana, Department of Obstetrics and Gynecology. (Obtaining of the uterine records and the research were approved by the National Medical Ethics Committee of the Republic Slovenia: No. 32/01/97 and No. 108/09/09. The recording protocol (including position of the electrodes) was those which was also used during colecting the records of the Term-Preterm EHG Database (TPEHG DB).

The women participating in the study represented a sample of the general population. The TPEHGT DS contains 31 uterine records of which:

The records of pregnant women were obtained during regular check-ups in pregnancy around 31st week of pregnancy. The mean recording time and standard deviation of the records of pregnant women was 30.2 (± 2.76) weeks of pregnancy. For the manual annotating procedure, we used a graphic user interface and annotation editor. Besides visualizing original signals and annotation editing, the graphic user interface also allows calculating, and visualizing, spectra and spectrograms of the signals. Consensus about the annotated intervals was reached by two annotators.

The EHG signals of the records were colected from the abdominal surface. The electrodes (AgCl2) to measure EHG signals of the records were placed in two horizontal rows, symmetrically above and under the navel, spaced 7 cm apart:

The differences in the electrical potentials of the electrodes were recorded, producing three signals:

Prior to sampling, the EHG signals were filtered using an analog three-pole Butterworth filter with the bandwidth from 0.0 Hz to 5.0 Hz. The fourth simultaneous signal was the analog signal corresponding to an external tocogram (TOCO signal) measuring mechanical uterine pressure acquired using a cardiotocograph (model HP8030) attached at the top of the fundus. The analog TOCO signal was lead to one of the amplifiers of the A/D converter (the value of 150 μV corresponds to a pressure of 1 Pa). The sampling frequency for the EHG and TOCO signals was 20 samples per second per signal, with 16-bit resolution.

The original EHG and TOCO signals of the records were further filtered using the four-pole band-pass digital Butterworth filter with cut-off frequencies at 0.08 Hz and 5.0 Hz applied bi-directionally. The records of the TPEHGT DS contain both, the original and filtered signals.

Files

Record names of the TPEHGT DS are of the following format: tpehgt_TXXX, where capital T refers to the type of record: p - Preterm, t - Term, and n - Non-pregnant, and XXX is the record number of that type.

The records of the dataset are in the WFDB format. Each records consists of three files, a header file (.hea) containing information regarding the record, the data file (.dat) containing original and filtered signal data, and the annotation file (.atr) containing annotations for the record.

The comment section in the header files (.hea) includes clinical information, such as:

If data was not available, the value is marked as None, if data was not applicable, the value is marked as N/A.

The signal data in the data files (.dat) are in the following order:

The annotations files (.atr) cantain manual annotations of contraction and dummy intervals:

Contact

For further information, please contact:

Franc Jager
Laboratory of Biomedical Computer Systems and Imaging
Faculty of Computer and Information Science
University of Ljubljana
Večna pot 113
1000 Ljubljana, Slovenia
email: franc.jager@fri.uni-lj.si

References

  1. Jager F, Libenšek S, Geršak K (2018). Characterization and automatic classification of preterm and term uterine records. PLoS ONE 13(8): e0202125. https://doi.org/10.1371/journal.pone.0202125. [PDF]
Icon  Name                     Last modified      Size  Description
[   ] ANNOTATORS 2021-08-23 12:42 58 [   ] DOI 2021-08-23 12:42 19 [   ] RECORDS 2021-08-23 12:42 372 list of records [   ] journal.pone.0202125.pdf 2021-08-23 12:42 23M [   ] tpehgt_ds.zip 2021-08-23 12:42 12M [   ] tpehgt_n001.atr 2021-08-23 12:42 260 [   ] tpehgt_n001.dat 2021-08-23 12:42 552K digitized signal(s) [   ] tpehgt_n001.hea 2021-08-23 12:42 684 header file [   ] tpehgt_n002.atr 2021-08-23 12:42 248 [   ] tpehgt_n002.dat 2021-08-23 12:42 551K digitized signal(s) [   ] tpehgt_n002.hea 2021-08-23 12:42 682 header file [   ] tpehgt_n003.atr 2021-08-23 12:42 284 [   ] tpehgt_n003.dat 2021-08-23 12:42 552K digitized signal(s) [   ] tpehgt_n003.hea 2021-08-23 12:42 682 header file [   ] tpehgt_n004.atr 2021-08-23 12:42 230 [   ] tpehgt_n004.dat 2021-08-23 12:42 554K digitized signal(s) [   ] tpehgt_n004.hea 2021-08-23 12:42 696 header file [   ] tpehgt_n005.atr 2021-08-23 12:42 260 [   ] tpehgt_n005.dat 2021-08-23 12:42 550K digitized signal(s) [   ] tpehgt_n005.hea 2021-08-23 12:42 692 header file [   ] tpehgt_p001.atr 2021-08-23 12:42 52 [   ] tpehgt_p001.dat 2021-08-23 12:42 563K digitized signal(s) [   ] tpehgt_p001.hea 2021-08-23 12:42 639 header file [   ] tpehgt_p002.atr 2021-08-23 12:42 232 [   ] tpehgt_p002.dat 2021-08-23 12:42 562K digitized signal(s) [   ] tpehgt_p002.hea 2021-08-23 12:42 627 header file [   ] tpehgt_p003.atr 2021-08-23 12:42 52 [   ] tpehgt_p003.dat 2021-08-23 12:42 563K digitized signal(s) [   ] tpehgt_p003.hea 2021-08-23 12:42 628 header file [   ] tpehgt_p004.atr 2021-08-23 12:42 52 [   ] tpehgt_p004.dat 2021-08-23 12:42 563K digitized signal(s) [   ] tpehgt_p004.hea 2021-08-23 12:42 638 header file [   ] tpehgt_p005.atr 2021-08-23 12:42 160 [   ] tpehgt_p005.dat 2021-08-23 12:42 563K digitized signal(s) [   ] tpehgt_p005.hea 2021-08-23 12:42 639 header file [   ] tpehgt_p006.atr 2021-08-23 12:42 280 [   ] tpehgt_p006.dat 2021-08-23 12:42 563K digitized signal(s) [   ] tpehgt_p006.hea 2021-08-23 12:42 631 header file [   ] tpehgt_p007.atr 2021-08-23 12:42 184 [   ] tpehgt_p007.dat 2021-08-23 12:42 563K digitized signal(s) [   ] tpehgt_p007.hea 2021-08-23 12:42 638 header file [   ] tpehgt_p008.atr 2021-08-23 12:42 142 [   ] tpehgt_p008.dat 2021-08-23 12:42 563K digitized signal(s) [   ] tpehgt_p008.hea 2021-08-23 12:42 641 header file [   ] tpehgt_p009.atr 2021-08-23 12:42 328 [   ] tpehgt_p009.dat 2021-08-23 12:42 562K digitized signal(s) [   ] tpehgt_p009.hea 2021-08-23 12:42 632 header file [   ] tpehgt_p010.atr 2021-08-23 12:42 190 [   ] tpehgt_p010.dat 2021-08-23 12:42 563K digitized signal(s) [   ] tpehgt_p010.hea 2021-08-23 12:42 639 header file [   ] tpehgt_p011.atr 2021-08-23 12:42 52 [   ] tpehgt_p011.dat 2021-08-23 12:42 562K digitized signal(s) [   ] tpehgt_p011.hea 2021-08-23 12:42 640 header file [   ] tpehgt_p012.atr 2021-08-23 12:42 52 [   ] tpehgt_p012.dat 2021-08-23 12:42 563K digitized signal(s) [   ] tpehgt_p012.hea 2021-08-23 12:42 639 header file [   ] tpehgt_p013.atr 2021-08-23 12:42 52 [   ] tpehgt_p013.dat 2021-08-23 12:42 562K digitized signal(s) [   ] tpehgt_p013.hea 2021-08-23 12:42 632 header file [   ] tpehgt_t001.atr 2021-08-23 12:42 100 [   ] tpehgt_t001.dat 2021-08-23 12:42 563K digitized signal(s) [   ] tpehgt_t001.hea 2021-08-23 12:42 630 header file [   ] tpehgt_t002.atr 2021-08-23 12:42 226 [   ] tpehgt_t002.dat 2021-08-23 12:42 563K digitized signal(s) [   ] tpehgt_t002.hea 2021-08-23 12:42 632 header file [   ] tpehgt_t003.atr 2021-08-23 12:42 208 [   ] tpehgt_t003.dat 2021-08-23 12:42 562K digitized signal(s) [   ] tpehgt_t003.hea 2021-08-23 12:42 635 header file [   ] tpehgt_t004.atr 2021-08-23 12:42 52 [   ] tpehgt_t004.dat 2021-08-23 12:42 562K digitized signal(s) [   ] tpehgt_t004.hea 2021-08-23 12:42 634 header file [   ] tpehgt_t005.atr 2021-08-23 12:42 52 [   ] tpehgt_t005.dat 2021-08-23 12:42 562K digitized signal(s) [   ] tpehgt_t005.hea 2021-08-23 12:42 628 header file [   ] tpehgt_t006.atr 2021-08-23 12:42 142 [   ] tpehgt_t006.dat 2021-08-23 12:42 563K digitized signal(s) [   ] tpehgt_t006.hea 2021-08-23 12:42 636 header file [   ] tpehgt_t007.atr 2021-08-23 12:42 100 [   ] tpehgt_t007.dat 2021-08-23 12:42 563K digitized signal(s) [   ] tpehgt_t007.hea 2021-08-23 12:42 632 header file [   ] tpehgt_t008.atr 2021-08-23 12:42 232 [   ] tpehgt_t008.dat 2021-08-23 12:42 562K digitized signal(s) [   ] tpehgt_t008.hea 2021-08-23 12:42 633 header file [   ] tpehgt_t009.atr 2021-08-23 12:42 214 [   ] tpehgt_t009.dat 2021-08-23 12:42 562K digitized signal(s) [   ] tpehgt_t009.hea 2021-08-23 12:42 632 header file [   ] tpehgt_t010.atr 2021-08-23 12:42 52 [   ] tpehgt_t010.dat 2021-08-23 12:42 562K digitized signal(s) [   ] tpehgt_t010.hea 2021-08-23 12:42 630 header file [   ] tpehgt_t011.atr 2021-08-23 12:42 430 [   ] tpehgt_t011.dat 2021-08-23 12:42 562K digitized signal(s) [   ] tpehgt_t011.hea 2021-08-23 12:42 627 header file [   ] tpehgt_t012.atr 2021-08-23 12:42 172 [   ] tpehgt_t012.dat 2021-08-23 12:42 563K digitized signal(s) [   ] tpehgt_t012.hea 2021-08-23 12:42 638 header file [   ] tpehgt_t013.atr 2021-08-23 12:42 142 [   ] tpehgt_t013.dat 2021-08-23 12:42 563K digitized signal(s) [   ] tpehgt_t013.hea 2021-08-23 12:42 629 header file