Next Article in Journal
An Open GMNS Dataset of a Dynamic Multi-Modal Transportation Network Model of Melbourne, Australia
Previous Article in Journal
High-to-Low (Regional) Fertility Transitions in a Peripheral European Country: The Contribution of Exploratory Time Series Analysis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Data Descriptor

Dataset of Two-Dimensional Gel Electrophoresis Images of Acute Myeloid Leukemia Patients before and after Induction Therapy

by
Juan E. Urrea
1,
Luisa F. Restrepo
2,
Jeanette Prada-Arismendy
2,
Erwing Castillo
3,
Manuel M. Goez
4,
Maria C. Torres-Madronero
4,
Edilson Delgado-Trejos
1 and
Sarah Röthlisberger
2,*
1
Measurement Analysis and Decision-Making Support Laboratory (AMYSOD Lab), Quality, Metrology and Production (CM&P) Research Group, Instituto Tecnologico Metropolitano (ITM), 76A354 Medellin, Colombia
2
Biomedical Research and Innovation Group (GI2B), Instituto Tecnologico Metropolitano ITM, 76A354 Medellin, Colombia
3
Hematology and Oncology Unit, Hospital Manuel Uribe Angel, 38S57 Envigado, Colombia
4
Smart Machines and Pattern Recognition (MIRP) Research Group, Instituto Tecnologico Metropolitano ITM, 76A354 Medellin, Colombia
*
Author to whom correspondence should be addressed.
Submission received: 3 December 2020 / Revised: 18 December 2020 / Accepted: 19 December 2020 / Published: 18 February 2021

Abstract

:
Acute myeloid leukemia (AML) is a malignant disorder of the hematopoietic stem and progenitor cells, which results in the build-up of immature blasts in the bone marrow and eventually in the peripheral blood of affected patients. Accurately assessing a patient´s prognosis is very important for clinical management of the disease, which is why there are several prognostic factors such as age, performance status at diagnosis, platelet count, serum creatinine and albumin that are taken into account by the clinician when deciding the course of treatment. However, proteomic changes related to treatment response in this patient group have not been widely explored. Here, we make available a set of 22 two-dimensional gel electrophoresis (2DGE) images obtained from the peripheral blood samples of 11 patients with AML, taken at the time of diagnosis and after induction therapy (approximately 21–28 days after starting treatment). The same set of 2DGE images is also made available after a preprocessing stage (an additional 22 2DGE pre-processed images), which was performed using algorithms developed in Python, in order to improve the visualization of characteristic spots and facilitate proteomic analysis of this type of images.
Dataset: The dataset will be published as a supplement to this paper, so this field will be filled by the editors of the journal.
Dataset License: CC-BY 4.0

1. Summary

According to the Global Cancer Observatory (Globocan 2018), each year, 437,033 patients worldwide are diagnosed with some type of leukemia, and 309,006 people die from this disease. Acute myeloid leukemia (AML) is a type of leukemia that mainly occurs in older adults; 42% of Americans diagnosed with AML are over 65 years of age, and their diagnosis is rarely made before 40 years of age, although cases have progressively increased over time [1]. AML is the result of an accumulation of acquired genetic alterations in the DNA of hematopoietic progenitor cells, and accurately assessing a patient’s prognosis is very important for clinical management of the disease. The patient’s cytogenetic profile is currently the strongest prognostic factor. For example, a complex karyotype, monosomy 5 or 7, t(6;9), inv(3), or 11q changes, other than t(9;11), have all been associated with a significantly lower response to treatment and overall survival [2]. It is clear that genetic studies are very valuable; however, isolated from a context in which thousands of proteins mediate cellular function, this prognostic model is not complete.
The images of this dataset were obtained by two-dimensional gel electrophoresis (2DGE), a technique that separates proteins according to their isoelectric point and molecular weight [3], followed by protein staining and image capture. Often, 2DGE images include anomalies [4,5] such as vertical lines, horizontal lines, diffuse points, and noise, among others, which make it difficult to identify spots that contain valuable information. Therefore, a preprocessing stage is often necessary in order to discriminate stains and noise from real protein spots [6]. Omitting this stage can affect the interpretation of the data, as noise could be identified as false protein spots [7]. Image preprocessing is responsible for reducing or correcting these irregularities in 2DGE images. The authors have implemented an approach that integrates the techniques of image normalization, noise reduction by non-linear techniques, and background correction [4,8], sequentially applying the following structure: adaptive piecewise histogram equalization for image normalization, a geometric nonlinear diffusion filter (GNDF) for filtering, and multilevel thresholding for background correction, obtaining favorable results [9].

2. Data Description

The database consists of a set of 22 2DGE images obtained from the peripheral blood samples of 11 patients with acute myeloid leukemia. Of these, 11 images correspond to samples taken at the time of diagnosis, and the other 11 correspond to samples taken from the same patients after induction therapy (approximately 21–28 days after starting treatment). Images named with the suffix BEFORE refer to 2DGE images of samples taken at the time of diagnosis (before treatment), while images named with the suffix AFTER correspond to 2DGE images of samples taken after treatment. These 22 images are also made available with the preprocessing stage applied, to which the prefix PREPROC has been applied. Each image in the database is in tagged image file format (TIFF) format with a resolution of 300 dots per inch (DPI). In total, the database, which can be found in the Supplementary Materials, contains 44 images (22 raw 2DGE images and 22 pre-processed 2DGE images). The characteristics corresponding to each image are summarized in Table 1.

3. Methods

3.1. Patients

Peripheral blood was obtained from 11 newly diagnosed patients with de novo AML at Hospital Manuel Uribe Angel in Colombia. Two blood samples were taken from each patient: at the time of diagnosis (before the start of chemotherapy) and once again after completion of the first round of induction therapy, which was typically 2–3 weeks after induction or when neutrophil and platelet recovery was achieved. Relevant clinical information of the patients involved in this study is summarized in Table 2.

3.2. Protein Extraction

Peripheral blood mononuclear cells (PBMCs) were isolated from the blood samples by standard density gradient centrifugation with a Ficoll Histopaque®-1077 (Sigma-Aldrich, St. Louis, MO, USA). In order to extract proteins from the PBMCs, the cells were lysed (0.5% Triton x-100, 50 mM Tris-HCL pH 8.0, 150 mM NaCL, 1 mM Ethylenediaminetetraacetic acid (EDTA), protease inhibitors) and the proteins precipitated in a 20% (v/v) final concentration of trichloroacetic acid. The protein pellet was resuspended in a rehydration buffer (7 M urea, 2% 3-cholamidopropyl dimethylammonio 1-propanesulfonate (CHAPS), 0.5% carrier ampholytes) and stored at −70 °C.

3.3. Two-Dimensional Gel Electrophoresis

Proteins (50 µg) were loaded by passive rehydration onto 7 cm ZOOM® immobilized pH gradient (IPG) strips with a pH of 3–10 NL (ThermoFisher Scientific, Waltham, MA, USA) at room temperature. Isoelectric focusing was carried out using the following voltage ramp: 100 V for 1 h, 150 V for 1 h, 200 V for 5 min, 450 V for 5 min, 600 V for 5 min, 750 V for 5 min, 950 V for 5 min, 1200 V for 10 min, 1400 V for 10 min, 1600 V for 10 min, and 2000 V for 45 min. The IPG strips were then reduced with 100 mM Dithiothreitol (DTT) and alkylated with 2.5% iodacetamide, according to the manufacturer’s recommended protocol. After this, the IPG strips were loaded onto SDS-PAGE NuPAGE Novex 4–12% Bis-Tris protein gels 1.5 mm in size (ThermoFisher Scientific) and run at 200 V for 45 min. After electrophoresis, these were stained with SYPRO® Ruby (Invitrogen, ThermoFisher Scientific), and the gel images were acquired using the ChemiDoc MP System (Biorad).

3.4. Image Pre-Processing

This step was performed in order to mitigate anomalies due to the acquisition routines and improve spot detection. The approach proposed in [9] was applied, integrating the following techniques for image normalization, noise reduction, and background correction: adaptive piecewise histogram equalization, a geometric nonlinear diffusion filter (GNDF), and multilevel thresholding. The algorithm was executed in Python, which is an open-source programming language, with free access to permanent online support through a considerable number of available libraries, accelerating the creation of multi-stage structure codes with the aim of obtaining consistent, reliable, and potentially integrable results.

Supplementary Materials

The following are available online at https://www.mdpi.com/2306-5729/6/2/20/s1.

Author Contributions

Conceptualization, S.R., J.P.-A., M.C.T.-M. and E.D.-T.; methodology, S.R., M.C.T.-M. and E.D.-T.; software, J.E.U., M.M.G., M.C.T.-M. and E.D.-T.; validation, S.R., M.C.T.-M. and E.D.-T.; formal analysis, M.M.G., M.C.T.-M. and E.D.-T.; investigation, L.F.R.; resources, E.C.; data curation, S.R.; writing—original draft preparation, J.E.U., S.R. and E.D.-T.; writing—review and editing, J.E.U., S.R., J.P.-A., M.C.T.-M. and E.D.-T.; visualization, J.E.U., S.R. and E.D.-T.; supervision, S.R., M.C.T.-M. and E.D.-T.; project administration, S.R.; funding acquisition, S.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Instituto Tecnologico Metropolitano ITM, grant number P17215. J.E.U. was recipient of the Jovenes Investigadores e Innovadores ITM 2020 program of the Instituto Tecnologico Metropolitano ITM of Medellin-Colombia.

Institutional Review Board Statement

This study was conducted according to the guidelines of the Declaration of Helsinki, and approved both by the Research Ethics Committee of the INSTITUTO TECNOLOGICO METROPOLITANO (6 June 2014, project code P17215) and the Ethics Committee of the HOSPITAL MANUEL URIBE ANGEL (17 April 2015).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data presented in this study are available in the Supplementary Materials.

Acknowledgments

The authors would like to thank the E.S.E. Hospital Manuel Uribe Angel, the Biomedical Sciences Laboratory, the Smart Machine and Pattern Recognition Laboratory (MIRP Lab), and the Measurement Analysis and Decision Support Laboratory (AMYSOD Lab) of Parque i, Medellin, Colombia.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  1. Deschler, B.; Lubbert, M. Acute myeloid leukemia: Epidemiology and etiology. Cancer 2006, 107, 2099–2107. [Google Scholar] [CrossRef] [PubMed]
  2. Estey, E.H. Acute myeloid leukemia: 2019 update on risk-stratification and management. Am. J. Hematol. 2018, 93, 1267–1291. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Beranova-Giorgianni, S. Proteome analysis by two-dimensional gel electrophoresis and mass spectrometry: Strengths and limitations. Trac. Trends Anal. Chem. 2003, 22, 273–281. [Google Scholar] [CrossRef]
  4. Kaczmarek, K.; Walczak, B.; De Jong, S.; Vandeginste, B.G. Preprocessing of two-dimensional gel electrophoresis images. Proteomics 2004, 4, 2377–2389. [Google Scholar] [CrossRef] [PubMed]
  5. Nugues, P.M. Two-dimensional electrophoresis image interpretation. IEEE Trans. Biomed. Eng. 1993, 40, 760–770. [Google Scholar] [CrossRef] [PubMed]
  6. Villegas-Rivera, G.A.; Torres-Madronero, M.C.; Röthlisberger Booth, S.; Delgado-Trejos, E. Procesamiento de imágenes de electroforesis bidimensional: Una revisión. Sci. Tech. 2019, 24, 76–84. [Google Scholar] [CrossRef]
  7. Tsakanikas, P.; Manolakos, E.S. Improving 2-DE gel image denoising using contourlets. Proteomics 2009, 9, 3877–3888. [Google Scholar] [CrossRef] [PubMed]
  8. Goez, M.M.; Torres-Madronero, M.C.; Rothlisberger, S.; Delgado-Trejos, E. Preprocessing of 2-Dimensional Gel Electrophoresis Images Applied to Proteomic Analysis: A Review. Genom. Proteom. Bioinform. 2018, 16, 63–72. [Google Scholar] [CrossRef] [PubMed]
  9. Goez, M.M.; Torres-Madronero, M.C.; Rothlisberger, S.; Delgado-Trejos, E. Joint pre-processing framework for two-dimensional gel electrophoresis images based on nonlinear filtering, background correction and normalization techniques. BMC Bioinform. 2020, 21, 376. [Google Scholar] [CrossRef] [PubMed]
Table 1. Image data description.
Table 1. Image data description.
2DGE Image2DGE Image Size2DGE Pre-Processed
Image
2DGE Pre-Processed Image SizeWidth
(Pixels)
Height
(Pixels)
HMUA02_BEFORE564 KBPREPROC_HMUA02_BEFORE1.63 MB965757
HMUA02_AFTER530 KBPREPROC_HMUA02_AFTER1.54 MB1006803
HMUA03_BEFORE607 KBPREPROC_HMUA03_BEFORE1.94 MB972820
HMUA03_AFTER554 KBPREPROC_HMUA03_AFTER1.73 MB974798
HMUA04_BEFORE589 KBPREPROC_HMUA04_BEFORE1.65 MB1021842
HMUA04_AFTER607 KBPREPROC_HMUA04_AFTER1.88 MB1035866
HMUA05_BEFORE556 KBPREPROC_HMUA05_BEFORE1.90 MB1018810
HMUA05_AFTER558 KBPREPROC_HMUA05_AFTER1.65 MB1012788
HMUA010_BEFORE584 KBPREPROC_HMUA010_BEFORE2.21 MB1021838
HMUA010_AFTER585 KBPREPROC_HMUA010_AFTER2.01 MB1012828
HMUA011_BEFORE527 KBPREPROC_HMUA011_BEFORE1.70 MB985777
HMUA011_AFTER506 KBPREPROC_HMUA011_AFTER1.78 MB974781
HMUA012_BEFORE603 KBPREPROC_HMUA012_BEFORE2.14 MB971798
HMUA012_AFTER430 KBPREPROC_HMUA012_AFTER1.75 MB969775
HMUA013_BEFORE498 KBPREPROC_HMUA013_BEFORE1.73 MB1024803
HMUA013_AFTER561 KBPREPROC_HMUA013_AFTER1.85 MB1012820
HMUA015_BEFORE552 KBPREPROC_HMUA015_BEFORE1.67 MB1054857
HMUA015_AFTER576 KBPREPROC_HMUA015_AFTER1.88 MB1046870
HMUA017_BEFORE501 KBPREPROC_HMUA017_BEFORE1.91 MB983757
HMUA017_AFTER476 KBPREPROC_HMUA017_AFTER1.84 MB1102921
HMUA018_BEFORE432 KBPREPROC_HMUA018_BEFORE1.38 MB1111858
HMUA018_AFTER526 KBPREPROC_HMUA018_AFTER1.79 MB1036795
Table 2. Clinical Information.
Table 2. Clinical Information.
PatientAgeSexKaryotypeAML Subtype 1Blasts (i) 2Induction ProtocolBlasts (f) 3Response to Induction
HMUA_0235MNormalM386%PETHEMA1%CR 4
HMUA_0356MNormalM465%7 + 34.5%CR
HMUA_0467MNormalM414%FLUGA35%Resistant
HMUA_0542FT(8:21)M385%PETHEMA1.2%CR
HMUA_1018FMissingM1/M274%7 + 30.5%CR
HMUA_1165FCCR 5M442%7 + 311%PR 6
HMUA_1253FT(8:21)M220%7+31.3%CR
HMUA_1346MNormalM268%7+335.4%Resistant 7
HMUA_1550MCCRM1/M247%7+38.4%PR
HMUA_1767MT(8:21)M466%7+30.27%CR
HMUA_1818FNormalM1/M228%7+30.6%CR
1 According to the French–American–British (FAB) classification system. 2 (i): initial blast count, before induction therapy. 3 (f): final blast count, after induction therapy. 4 CR: complete remission, defined as <5% blasts in bone marrow after induction therapy. 5 CCR: complex chromosome rearrangement. 6 PR: partial response, defined as 5–20% blasts in bone marrow after induction therapy. 7 Resistance to therapy, defined by >20% blasts in bone marrow after induction therapy.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Urrea, J.E.; Restrepo, L.F.; Prada-Arismendy, J.; Castillo, E.; Goez, M.M.; Torres-Madronero, M.C.; Delgado-Trejos, E.; Röthlisberger, S. Dataset of Two-Dimensional Gel Electrophoresis Images of Acute Myeloid Leukemia Patients before and after Induction Therapy. Data 2021, 6, 20. https://doi.org/10.3390/data6020020

AMA Style

Urrea JE, Restrepo LF, Prada-Arismendy J, Castillo E, Goez MM, Torres-Madronero MC, Delgado-Trejos E, Röthlisberger S. Dataset of Two-Dimensional Gel Electrophoresis Images of Acute Myeloid Leukemia Patients before and after Induction Therapy. Data. 2021; 6(2):20. https://doi.org/10.3390/data6020020

Chicago/Turabian Style

Urrea, Juan E., Luisa F. Restrepo, Jeanette Prada-Arismendy, Erwing Castillo, Manuel M. Goez, Maria C. Torres-Madronero, Edilson Delgado-Trejos, and Sarah Röthlisberger. 2021. "Dataset of Two-Dimensional Gel Electrophoresis Images of Acute Myeloid Leukemia Patients before and after Induction Therapy" Data 6, no. 2: 20. https://doi.org/10.3390/data6020020

Article Metrics

Back to TopTop