Scientific Background

SIRIUS is setting new standards in molecular identification, enabling the elucidation of previously uncharted compounds. In this archive, you’ll find insightful articles about the science behind SIRIUS, offering a deeper understanding of its capabilities. For further information, explore the comprehensive SIRIUS Documentation, browse our company brochure, or visit the SIRIUS Wikipedia page. Or you can check our YouTube Playlist for tutorials and valuable learning resources.

SIRIUS Background

Why Training Data Matters: Exploring Coverage Bias in Small Molecule Machine Learning

Machine learning is transforming analytical chemistry by enabling predictions of small molecule properties, crucial for drug development and other applications. However, ensuring reliable results requires careful selection of training data to avoid biases that can mislead models. Here, we explain why it was important to prepare high-quality training datasets for the machine learning methods in SIRIUS, especially given that many widely used datasets fail to evenly represent the diversity of biomolecular structures.

Read More »