TPEHG DB - the Term-Preterm ElectroHysteroGram DataBase

The Term-Preterm EHG Database (TPEHG DB)

(First Edition, July 2008, revised in November 2010, and in August 2012)

The TPEHG DB is described in:

Gašper Fele-Žorž, Gorazd Kavšek, Živa Novak-Antolič and Franc Jager.
A comparison of various linear and non-linear signal processing techniques to separate uterine EMG records of term and pre-term delivery groups.
Medical & Biological Engineering & Computing, 46(9):911-922 (2008). [PDF]

Please cite this publication when referencing this material.

The TPEHG DB was developed in 2008 in the scope of a research project financed by the Slovenian Research Agency [ARRS]. Project: P3-0124 Metabolic and inborn factors of reproductive health, birth (from 2005-2008, and from 2008-1013, and from 2013-2020).

In 2010 we also published the [TPEHG DB] on Physionet.

The Electrohysterogram records (uterine EMG records) included in the Term-Preterm ElectroHysteroGram DataBase (TPEHG DB) were obtained from 1997 to 2005 at the University Medical Centre Ljubljana, Department of Obstetrics and Gynecology. The records were obtained during regular check-ups either around the 22nd week of gestation or around the 32nd week of gestation. The women participating in the study represented a sample of the general population. In all, almost 1300 records were obtained during these years, and a preliminary database was built and used for studies by Ivan Verdenik, Gorazd Kavšek, Marjan Pajntar and Živa Novak-Antolič [1],[2].

The TPEHG DB posted here contains 300 uterine EMG records from 300 pregnancies (one record per pregnancy) carefully selected from the original database of which:

During the selection of records, all records with apparent recording artifacts,all records from pregnancies where labor was induced, and all records where delivery was performed using a Cesarean section, were rejected.

Each record is composed of three channels, recorded from 4 electrodes:

The differences in the electrical potentials of the electrodes were recorded, producing 3 channels:

The individual records are 30 minutes in duration. Each signal has been digitized at 20 samples per second per channel with 16-bit resolution over a range of ±2.5 millivolts.

Each signal was digitally filtered using 3 different 4-pole digital Butterworth filters with a double-pass filtering scheme. The band-pass cut-off frequencies were:

The records in the database contain both the original and filtered signals. The records are in WFDB format. Each record consists of two files, a header file (.hea) containing information regarding the record and the data file (.dat) containing signal data.

The comment section in the header files (.hea) includes clinical information, such as:

The signal data in the data files (.dat) are in the following order:

When using filtered channels, note that the first and last 180 seconds of the signals should be ignored since these intervals contain transient effects of the filters.

An accompanying file (tpehgdb.smr) summarizes clinical information of each record, describing whether the corresponding pregnancy ended on term (> 37 weeks) or prematurely (≤ 37 weeks), and whether the record was obtained before the 26th week of gestation or during or after the 26th week of gestation.

The columns in the tpehgdb.smr file represent:

The TPEHG DB as posted here was the database used during a study of separating uterine EMG records of term and pre-term delivery groups using various linear and non-linear signal processing techniques [3]. Please cite reference [3] if using the records of the TPEHG.

During a study on comparison of various linear and non-linear signal processing techniques to separate uterine EMG records of term and pre-term delivery groups [3], among the other, following non-linear signal processing techniques to calculate following features for the records were used:

The feature values for all filtered signals of the records of the database are included in the feature values files (.fvl) of the database. For further details on how the features were calculated, see [3].

The .fvl files are organised according to the filter used:

The columns in the .fvl files represent:

For further information, please contact:

Prof. Franc Jager, head
Laboratory of Biomedical Computer Systems and Imaging
University of Ljubljana
Faculty of Computer and Information Science
Trzaska 25
1000 Ljubljana, Slovenia
email: franc.jager@fri.uni-lj.si

References:

  1. Ivan Verdenik. Multilayer prediction model for preterm delivery. PhD thesis. University of Ljubljana, Medical faculty, Ljubljana, 2002.
  2. Gorazd Kavšek. Electromyographic activity of the uterus in threatened preterm delivery. MsC thesis. University of Ljubljana, Medical faculty, Ljubljana, 2001.
  3. Gašper Fele-Žorž, Gorazd Kavšek, Živa Novak-Antolič and Franc Jager. A comparison of various linear and non-linear signal processing techniques to separate uterine EMG records of term and pre-term delivery groups. Medical & Biological Engineering & Computing, 46(9):911-922 (2008). [PDF]

Icon  Name                                        Last modified      Size  Description
[   ] RECORDS 2021-08-23 12:43 2.8K list of records [   ] tpehgdb.pdf 2021-08-23 12:43 918K records [   ] tpehgdb.smr 2021-08-23 12:43 19K notes on records [   ] tpehgdb.zip 2021-08-23 12:43 248M records [   ] tpehgdb_features__filter_0.08_Hz-4.0_Hz.fvl 2021-08-23 12:43 98K records [   ] tpehgdb_features__filter_0.3_Hz-3.0_Hz.fvl 2021-08-23 12:43 98K records [   ] tpehgdb_features__filter_0.3_Hz-4.0_Hz.fvl 2021-08-23 12:43 98K records [DIR] tpehgdb/ 2021-08-23 12:43 - records