Machine learning-based classification to improve Gas Chromatography-Mass spectrometry data processing. - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Accéder directement au contenu
Poster De Conférence Année : 2020

Machine learning-based classification to improve Gas Chromatography-Mass spectrometry data processing.

Emilie Sicard
  • Fonction : Auteur
  • PersonId : 1066183
Stéphanie Durand
Carole Migné
Mélanie Pétéra
Franck Giacomoni

Résumé

Introduction Lack of reliable peak detection impedes automated analysis of large-scale gas chromatography-mass spectrometry (GCMS) metabolomics datasets. Performance and outcome of individual peak-picking algorithms can differ widely depending on both algorithmic approach and parameters, as well as data acquisition method. Therefore, comparing and contrasting between algorithms is difficult. Technological and methodological innovation We present part of the work published in [1] and implemented in our workflow for improved peak picking (WiPP), focusing on the use of machine learning-based classification to optimize and improve different steps of the common GC-MS metabolomics data processing workflow. Our approach evaluates the quality of detected peaks using a machine learning based classification scheme based on seven peak classes. The quality information returned by the classifier for each individual peak is merged with results from different peak detection algorithms to create one final high-quality peak set for immediate down-stream analysis. Results and impact We benchmarked our workflow to standard compound mixes and a complex biological dataset, demonstrating that peak detection is improved. Furthermore, the approach can provide an impartial performance comparison of different peak picking algorithms. We also discuss the applicability of the approach to liquid chromatography-mass spectrometry data. References [1] Gloaguen, Y.; Borgsmüller, N. et al. WiPP: Workflow for Improved Peak Picking for Gas Chromatography-Mass Spectrometry (GC-MS) Data. Metabolites 2019, 9, 171.
Fichier principal
Vignette du fichier
2020_BookOfAbstract_Metabomeeting_Toulouse_1.pdf (9.35 Mo) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-02505901 , version 1 (02-06-2020)

Licence

Paternité

Identifiants

  • HAL Id : hal-02505901 , version 1
  • PRODINRA : 496662

Citer

Yoann Gloaguen, Nico Borgsmüller, Tobias Opialla, Eric Blanc, Emilie Sicard, et al.. Machine learning-based classification to improve Gas Chromatography-Mass spectrometry data processing.. European RFMF Metabomeeting 2020, Jan 2020, Toulouse, France. 263 p., 2020, Oral and poster abstracts European RFMF Metabomeeting 2020. ⟨hal-02505901⟩
462 Consultations
185 Téléchargements

Partager

Gmail Facebook X LinkedIn More