Skip to Main content Skip to Navigation
Journal articles

Towards omics-based predictions of planktonic functional composition from environmental data

Abstract : Marine microbes play a crucial role in climate regulation, biogeochemical cycles, and trophic networks. Unprecedented amounts of data on planktonic communities were recently collected, sparking a need for innovative data-driven methodologies to quantify and predict their ecosystemic functions. We reanalyze 885 marine metagenome-assembled genomes through a network-based approach and detect 233,756 protein functional clusters, from which 15% are functionally unannotated. We investigate all clusters' distributions across the global ocean through machine learning, identifying biogeographical provinces as the best predictors of protein functional clusters' abundance. The abundances of 14,585 clusters are predictable from the environmental context, including 1347 functionally unannotated clusters. We analyze the biogeography of these 14,585 clusters, identifying the Mediterranean Sea as an outlier in terms of protein functional clusters composition. Applicable to any set of sequences, our approach constitutes a step towards quantitative predictions of functional composition from the environmental context.
Complete list of metadata
Contributor : Gestionnaire Hal-Su <>
Submitted on : Monday, July 19, 2021 - 11:38:12 AM
Last modification on : Friday, July 23, 2021 - 8:19:52 AM


Publication funded by an institution



Emile Faure, Sakina-Dorothée Ayata, Lucie Bittner. Towards omics-based predictions of planktonic functional composition from environmental data. Nature Communications, Nature Publishing Group, 2021, 12, pp.4361. ⟨10.1038/s41467-021-24547-1⟩. ⟨hal-03290250⟩



Record views


Files downloads