Discoveries

See the impact of SIRIUS firsthand! This archive showcases exciting discoveries and major breakthroughs made by research groups worldwide. Explore how the power of SIRIUS is elevating small molecule data analysis across diverse fields, including drug discovery, human health, diagnostics, food industry, monitoring, microbiomics, environmental toxicology, and materials science. For an extensive list of publications by independent research groups using the SIRIUS software framework, click here.

SIRIUS Background

Why Training Data Matters: Exploring Coverage Bias in Small Molecule Machine Learning

Machine learning is transforming analytical chemistry by enabling predictions of small molecule properties, crucial for drug development and other applications. However, ensuring reliable results requires careful selection of training data to avoid biases that can mislead models. Here, we explain why it was important to prepare high-quality training datasets for the machine learning methods in SIRIUS, especially given that many widely used datasets fail to evenly represent the diversity of biomolecular structures.

Read More »