The FIMMG_obs dataset is a subset of the FIMMG dataset, because not all the patients and the EHR fields have been selected for this study.
The FIMMG_obs dataset contains a total of 968 patients and 3 main EHR fields.
All the type 2 diabetes patients were excluded from the FIMMG_obs dataset.
The number of different features for each main field is enclosed in square brackets:
- Demographic (Gender, Age) [2]
- Monitoring (Systolic and diastolic blood pressure, Height, Weight, BMI) [5]
- Clinical (Laboratory exams) [73]
The date of each laboratory exam and blood pressure measurement is reported for each patient. This aspect assumes a relevant significance, because it allows to trace up the patient’s longitudinal clinical history from 2010 to 2018, by collecting a total of 2276 observations in accordance with the triglyceride-glucose (TyG) index measurements for each patient.

To request the FIMMG_obs dataset:
- Send an email to: vrai@dii.univpm.it (Note: you should send the email from an email address that is linked to your research institution/university)
- You will be sent a form to fill out and after that, a link for the download
Please cite our work using the following bib:
@article{bernardini2019discovering,
  title={TyG-er: An Ensemble Regression Forest Approach for Identification of Clinical Factors Related to Insulin Resistance Condition Using Electronic Health Records},
  author={Bernardini, Michele and Morettini, Micaela and Romeo, Luca and Frontoni, Emanuele and Burattini, Laura},
  journal={Computers in Biology and Medicine},
  year={2019},
  publisher={Elsevier}
}
The code associated to our work can be found here
