Abstract
Despite being information rich, the vast majority of untargeted mass spectrometry data are underutilized; most analytes are not used for downstream interpretation or reanalysis after publication. The inability to dive into these rich raw mass spectrometry datasets is due to the limited flexibility and scalability of existing software tools. Here we introduce a new language, the Mass Spectrometry Query Language (MassQL), and an accompanying software ecosystem that addresses these issues by enabling the community to directly query mass spectrometry data with an expressive set of user-defined mass spectrometry patterns. Illustrated by real-world examples, MassQL provides a data-driven definition of chemical diversity by enabling the reanalysis of all public untargeted metabolomics data, empowering scientists across many disciplines to make new discoveries. MassQL has been widely implemented in multiple open-source and commercial mass spectrometry analysis tools, which enhances the ability, interoperability and reproducibility of mining of mass spectrometry data for the research community.
Original language | English |
---|---|
Article number | 113 |
Number of pages | 8 |
Journal | Nature Methods |
Early online date | 12 May 2025 |
DOIs | |
State | Published - Jun 2025 |
Bibliographical note
Publisher Copyright:© The Author(s), under exclusive licence to Springer Nature America, Inc. 2025.
ASJC Scopus subject areas
- Biotechnology
- Biochemistry
- Molecular Biology
- Cell Biology