A model-based approach to gene clustering with missing observation reconstruction in a Markov random field framework - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Accéder directement au contenu
Article Dans Une Revue Journal of Computational Biology Année : 2009

A model-based approach to gene clustering with missing observation reconstruction in a Markov random field framework

Résumé

The different measurement techniques that interrogate biological systems provide means for monitoring the behavior of virtually all cell components at different scales and from complementary angles. However, data generated in these experiments are difficult to interpret. A first difficulty arises from high-dimensionality and inherent noise of such data. Organizing them into meaningful groups is then highly desirable to improve our knowledge of biological mechanisms. A more accurate picture can be obtained when accounting for dependencies between components (e.g., genes) under study. A second difficulty arises from the fact that biological experiments often produce missing values. When it is not ignored, the latter issue has been solved by imputing the expression matrix prior to applying traditional analysis methods. Although helpful, this practice can lead to unsound results. We propose in this paper a statistical methodology that integrates individual dependencies in a missing data framework. More explicitly, we present a clustering algorithm dealing with incomplete data in a Hidden Markov Random Field context. This tackles the missing value issue in a probabilistic framework and still allows us to reconstruct missing observations a posteriori without imposing any pre-processing of the data. Experiments on synthetic data validate the gain in using our method, and analysis of real biological data shows its potential to extract biological knowledge.
Fichier principal
Vignette du fichier
A Model-Based Approach to Gene Clustering with Missing Observation Reconstruction in a Markov Random Field Framework_1.pdf (392.63 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02665301 , version 1 (31-05-2020)

Identifiants

Citer

Juliette Blanchet, Matthieu Vignes. A model-based approach to gene clustering with missing observation reconstruction in a Markov random field framework. Journal of Computational Biology, 2009, 16 (3), pp.475-486. ⟨10.1089/cmb.2008.0078⟩. ⟨hal-02665301⟩
22 Consultations
49 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More