Machine Learning Provides Unprecedented View of Small Molecules

Open-Source Model Machine Learning Provides Unprecedented View of Small Molecules

2022-12-19 Source: Aalto University

Related Vendor

Bimos - eine Marke der Interstuhl Büromöbel GmbH & Co. KG

A new machine learning model will help scientists identify small molecules, with applications in medicine, drug discovery and environmental chemistry. Developed by researchers at Aalto University and the University of Luxembourg, the model was trained with data from dozens of laboratories to become one of the most accurate tools for identifying small molecules.

Metabolites are extremely small — the diameter of a human hair is 100,000 nanometers, while that of a glucose molecule is approximately one nanometer.(Source: Matti Ahlgren/ Aalto University) — Metabolites are extremely small — the diameter of a human hair is 100,000 nanometers, while that of a glucose molecule is approximately one nanometer.
(Source: Matti Ahlgren/ Aalto University)

Thousands of different small molecules, known as metabolites, transport energy and transmit cellular information throughout the human body. Because they are so small, metabolites are difficult to distinguish from each other in a blood sample analysis — but identifying these molecules is important to understand how exercise, nutrition, alcohol use and metabolic disorders affect wellbeing.

Metabolites are normally identified by analysing their mass and retention time with a separation technique called liquid chromatography followed by mass spectrometry. This technique first separates metabolites by running the sample through a column, which results in different flow rates — or retention times — through the measurement device. Mass spectrometry is then used to fine-tune the identification process by sorting metabolites according to their mass. Researchers can also break metabolites into smaller pieces to analyse their composition using a technique called tandem mass spectrometry. “Even the best methods can’t identify more than 40 percent of the molecules in samples without making some additional assumptions about the candidate molecules,” says Professor Juho Rousu of Aalto University.

Now, Rousu’s group has developed a novel machine learning model to identify small molecules. It was recently published in Nature Machine Intelligence. “This new open-source model offers the whole research community an enriched view of small molecules. It will help research into methods to identify metabolic disorders, such as diabetes, or even cancer,” says Rousu.

The new approach elegantly sidesteps one of the challenges facing conventional methods. Because the retention times of molecules vary from lab to lab, data cannot be compared between labs. Eric Bach, a doctoral student at Aalto, came up with an alternative during his PhD research that solved the problem. “Our research shows that while absolute retention times may vary, the retention order is stable across measurements by different labs,” Bach explains. “This allowed us to merge all publicly available data on metabolites for the first time ever and feed it into our machine learning model.”

Researchers at Princeton University combined artificial intelligence and quantum mechanics to simulate what happens at the molecular level when water freezes. The result is the most complete yet simulation of the first steps in ice “nucleation,” a process important for climate and weather modeling. (Source: Pablo Piaggi, Princeton University)

With the incorporation of data from dozens of laboratories around the globe, the machine learning model is accurate enough to distinguish between mirror image molecules, known as stereochemical variants. So far, identification tools have not been able to tell stereochemical variants apart, and the new capability is expected to open up new avenues in drug design and other fields.

“The fact that using stereochemistry improved the identification performance is a revelation for all developers of metabolite identification methods,” says Emma Schymanski, associate professor at the Luxembourg Centre for Systems Biomedicine (LCSB) of the University of Luxembourg. “This method could also be used to help identify and trace micropollutants in the environment or characterise new metabolites in plant cells.”

References: Joint structural annotation of small molecules using liquid chromatography retention order and tandem mass spectrometry data Nature Machine Intelligence; DOI:10.1038/s42256-022-00577-2

(ID:48961801)

Consent to the use of data for promotional purposes

I hereby consent to Vogel Communications Group GmbH & Co. KG, Max-Planck-Str. 7-9, 97082 Würzburg including any affiliated companies according to §§ 15 et seq. AktG (hereafter: Vogel Communications Group) using my e-mail address to send editorial newsletters. A list of all affiliated companies can be found here

Newsletter content may include all products and services of any companies mentioned above, including for example specialist journals and books, events and fairs as well as event-related products and services, print and digital media offers and services such as additional (editorial) newsletters, raffles, lead campaigns, market research both online and offline, specialist webportals and e-learning offers. In case my personal telephone number has also been collected, it may be used for offers of aforementioned products, for services of the companies mentioned above, and market research purposes.

Additionally, my consent also includes the processing of my email address and telephone number for data matching for marketing purposes with select advertising partners such as LinkedIn, Google, and Meta. For this, Vogel Communications Group may transmit said data in hashed form to the advertising partners who then use said data to determine whether I am also a member of the mentioned advertising partner portals. Vogel Communications Group uses this feature for the purposes of re-targeting (up-selling, cross-selling, and customer loyalty), generating so-called look-alike audiences for acquisition of new customers, and as basis for exclusion for on-going advertising campaigns. Further information can be found in section “data matching for marketing purposes”.

In case I access protected data on Internet portals of Vogel Communications Group including any affiliated companies according to §§ 15 et seq. AktG, I need to provide further data in order to register for the access to such content. In return for this free access to editorial content, my data may be used in accordance with this consent for the purposes stated here. This does not apply to data matching for marketing purposes.

Right of revocation

I understand that I can revoke my consent at will. My revocation does not change the lawfulness of data processing that was conducted based on my consent leading up to my revocation. One option to declare my revocation is to use the contact form found at https://contact.vogel.de. In case I no longer wish to receive certain newsletters, I have subscribed to, I can also click on the unsubscribe link included at the end of a newsletter. Further information regarding my right of revocation and the implementation of it as well as the consequences of my revocation can be found in the data protection declaration, section editorial newsletter.

Open-Source Model Machine Learning Provides Unprecedented View of Small Molecules

Subscribe to the newsletter now

Don't Miss out on Our Best Content

Consent to the use of data for promotional purposes

Right of revocation