CIBION   24492
CENTRO DE INVESTIGACIONES EN BIONANOCIENCIAS "ELIZABETH JARES ERIJMAN"
Unidad Ejecutora - UE
congresos y reuniones científicas
Título:
IMPROVING IDENTIFICATION OF COMPOUNDS IN METABOLOMIC STUDIES THROUGH CORRELATION AND STATISTICS
Autor/es:
HOIJEMBERG, PABLO ARIEL; PELCZER, ISTVÁN
Lugar:
Rosario
Reunión:
Simposio; Second Latin American Metabolic Profiling Symposium (LAMPS); 2016
Institución organizadora:
IBR (UNR-CONICET)
Resumen:
In an untargeted metabolomic study the search for biomarker molecules serves to answer many questions, for which there is a need to find out the identity of these compounds of interest. Several dozen metabolites are normally detected by NMR analysis of a biofluid in measurable quantities, where the spectra can show about a few hundred peaks.The identification of compounds is normally done with the aid of commercial software packages containing their own databases, by literature search, and/or by searches in public databases by lists of chemical shifts. The input for the database query can be improved using STOCSY, Statistical TOtal Correlation SpectroscopY [1], which is attainable given the amount of data collected for the multivariate data analysis and finds highly correlated peaks that would belong to a same molecule.Despite its usefulness, the STOCSY analysis is tedious and cumbersome, normally obtaining a trace by selecting a driver peak to find the peaks highly correlated to it, and it is performed in a trace-by-trace fashion over all peaks of interest. Here we present a methodology developed to reduce the analysis time by increasing information recovery applying further statistical analysis on the ?information redundant? STOCSY correlation matrix (all peaks included at once), yielding lists of grouped peaks for database queries that produce more reliable hits during the identification step.The methodology adds an automated step after the correlation matrix calculation to group traces from different driver peaks based on their similarity. As the STOCSY tool suffers from overlapped peaks (that abound in some regions of the 1D 1H spectrum of biofluids), good alternatives are its use on 1D 13C spectra, 1D projections from 1H 2D J-Resolved spectra, and small size data matrices like in a ?spectrum-to-spreadsheet? procedure. Examples of the application of this methodology on biological samples and synthetic mixtures will be addressed.REFERENCES1.Cloarec, O. et al. Anal. Chem., 2005, 77, 128