Designing molecules is one of chemistry’s most complex challenges. From life-saving drugs to advanced materials, each compound requires a precise sequence of reactions. Planning these steps demands both technical knowledge and strategic insight, making it a task that often relies on years of experience.
A concept illustration of the development of Synthegy.
(Source: Ella Maru Studio)
Two problems plague much of modern chemistry. The first is retrosynthesis: Chemists start from a target molecule and work backward to identify simpler building blocks and viable reaction pathways. Retrosynthesis involves countless decisions, from choosing starting materials to determining when to form rings or protect sensitive functional groups. While computers can explore vast “chemical spaces", they often struggle to capture the strategic reasoning used by human experts.
The second problem is reaction mechanisms. These describe how chemical reactions unfold step by step through movements of electrons. Mechanistic insight helps scientists predict new reactions, improve efficiency, and reduce costly trial and error. Existing computational methods can generate many possible pathways, but often lack the chemical intuition needed to identify the most plausible ones.
A study led by Philippe Schwaller at EPFL has now developed a new approach that uses large language models (LLMs) as reasoning engines for chemistry. Instead of generating chemical structures directly, the models evaluate and guide traditional computational tools.
The framework, called Synthegy, combines established search algorithms with artificial intelligence capable of interpreting chemical strategies expressed in natural language.
“When making tools for chemists, the user interface matters a lot, and previous tools relied on cumbersome filters and rules,” says Andres M Bran, the first author of the Synthegy paper published in Matter. “With Synthegy, we're giving chemists the power to just talk, allowing them to iterate much faster and navigate more complex synthetic ideas.”
Synthegy on Retrosynthesis
Synthegy begins with a target molecule and a plain-language instruction from the user. For example, a chemist may request early formation of a specific ring or ask to avoid unnecessary protecting groups. Traditional retrosynthesis software then generates numerous potential synthetic routes. Each route is translated into text and analyzed by a language model.
Synthegy evaluates how well each pathway aligns with the user’s goals, assigns scores, and explains its reasoning. This process allows researchers to rank and filter candidate routes efficiently. By steering computational searches with natural-language queries, chemists can focus on strategies that best meet their objectives.
Synthegy addresses reaction mechanisms in a similar way: It breaks reactions into elementary electron movements and explores multiple possibilities. The LLM assesses each step, guiding the search toward chemically plausible mechanisms. Additional information, such as reaction conditions or expert hypotheses, can also be incorporated as text.
In synthesis planning, Synthgey successfully identified routes that match complex strategic requests. In a double-blind expert study, 36 chemists provided 368 valid evaluations, and their judgments aligned with the system’s assessments 71.2% of the time on average. The framework can detect unnecessary protecting steps, assess reaction feasibility, and prioritize efficient pathways.
Synthegy shows that LLMs can analyze chemistry across multiple levels. They can interpret functional groups, assess individual reactions, and evaluate complete synthetic pathways. Larger and more advanced models demonstrate the strongest performance, while smaller models show limited capability.
The work redefines how AI can support chemistry. By positioning LLMs as evaluators rather than generators, the Synthegy approach allows chemists to express their goals in plain language and receive strategically relevant solutions. The technology could accelerate drug discovery, improve reaction design, and make advanced computational tools more accessible to researchers.
“The connection between synthesis planning and mechanisms is very exciting: we usually use mechanisms to discover new reactions that enable us to synthesize new molecules,” says Andres M Bran. “Our work is bridging that gap computationally through a unified natural language interface.”
Original Article: Chemical reasoning in LLMs unlocks strategy-aware synthesis planning and reaction mechanism elucidation, Matter; DOI:10.1016/j.matt.2026.102812
Date: 08.12.2025
Naturally, we always handle your personal data responsibly. Any personal data we receive from you is processed in accordance with applicable data protection legislation. For detailed information please see our privacy policy.
Consent to the use of data for promotional purposes
I hereby consent to Vogel Communications Group GmbH & Co. KG, Max-Planck-Str. 7-9, 97082 Würzburg including any affiliated companies according to §§ 15 et seq. AktG (hereafter: Vogel Communications Group) using my e-mail address to send editorial newsletters. A list of all affiliated companies can be found here
Newsletter content may include all products and services of any companies mentioned above, including for example specialist journals and books, events and fairs as well as event-related products and services, print and digital media offers and services such as additional (editorial) newsletters, raffles, lead campaigns, market research both online and offline, specialist webportals and e-learning offers. In case my personal telephone number has also been collected, it may be used for offers of aforementioned products, for services of the companies mentioned above, and market research purposes.
Additionally, my consent also includes the processing of my email address and telephone number for data matching for marketing purposes with select advertising partners such as LinkedIn, Google, and Meta. For this, Vogel Communications Group may transmit said data in hashed form to the advertising partners who then use said data to determine whether I am also a member of the mentioned advertising partner portals. Vogel Communications Group uses this feature for the purposes of re-targeting (up-selling, cross-selling, and customer loyalty), generating so-called look-alike audiences for acquisition of new customers, and as basis for exclusion for on-going advertising campaigns. Further information can be found in section “data matching for marketing purposes”.
In case I access protected data on Internet portals of Vogel Communications Group including any affiliated companies according to §§ 15 et seq. AktG, I need to provide further data in order to register for the access to such content. In return for this free access to editorial content, my data may be used in accordance with this consent for the purposes stated here. This does not apply to data matching for marketing purposes.
Right of revocation
I understand that I can revoke my consent at will. My revocation does not change the lawfulness of data processing that was conducted based on my consent leading up to my revocation. One option to declare my revocation is to use the contact form found at https://contact.vogel.de. In case I no longer wish to receive certain newsletters, I have subscribed to, I can also click on the unsubscribe link included at the end of a newsletter. Further information regarding my right of revocation and the implementation of it as well as the consequences of my revocation can be found in the data protection declaration, section editorial newsletter.