BLOG

Published 18 June 2025 by Andrei Mihai

How Is AI Changing the Chemistry Game?

AI tools are making their way into laboratories. Photo/Credit: phuttaphat tipsana/iStockphoto

The 2024 Nobel Prize in Chemistry was awarded to David Baker, Demis Hassabis, and John M. Jumper for their groundbreaking contributions to understanding and designing protein structures. It was, to some, a rather surprising decision. For instance, Hassabis isn’t even a chemist; he is an artificial intelligence researcher. David Baker’s work also lies in computational chemistry. With John M. Jumper attending the 2025 Chemistry Meeting, this topic will be highlighted in his Lecture and by one of the programme-shaping Panel Discussions (1 July, 15:00 CEST, also available via livestream). The 2024 Nobel Laureate will be joined by the Laureates Joachim Frank and Michael Levitt, Lindau Alumna Danielle Belgrave, and Anima Anandkumar, Caltech.

For anyone paying attention, however, this was hardly a surprise. A new era is dawning in chemistry, one where chemistry isn’t shaped by classic reactions but rather by GPUs and algorithms. This transformation, recognized by the International Union of Pure and Applied Chemistry (IUPAC) as one of the “Top 10 Emerging Technologies in Chemistry 2023,” is ushering in an era of unprecedented speed, efficiency, and discovery.

The Algorithmic Chemist

Until not that long ago, chemistry was a quintessential hands-on science. However, machine learning has barged into the lab, changing everything. By now, we’ve all heard about AlphaFold and its remarkable potential, but AlphaFold is just one of several promising innovations mixing AI and chemistry. Just take a look at DreaMS, a self-supervised transformer model recently presented in a new study. DreaMS learns molecular representations directly from millions of raw, unannotated mass spectra. By training in a self-supervised way, it creates molecular representations that can be fine-tuned for tasks like identifying unknown compounds, predicting chemical properties and constructing large-scale molecular networks.

Researchers had the ability to do this before, but deep learning tools (ranging from Graph Neural Networks to algorithms using reinforcement learning) are helping them do more, faster. What used to take months can now be done in a few hours. Machine learning models can, for instance, assess molecular characteristics and help identify potential drug candidates faster than traditional methods, which saves considerable time and reduces costs associated with synthesizing and testing physical compounds.

This expansion in the scale of exploration is not just finding materials faster, but it is enabling the systematic exploration of previously uncharted swaths of the vast chemical space. In materials science, for instance, Google’s GNoME used Graph Neural Networks to discover 380,000 new crystal structures. That is more than the total known in human history up to that point. In virtual screening, AI-powered tools can rapidly analyze vast chemical databases to predict compounds most likely to exhibit desirable biological properties. There is even a paradigm shift in drug design called “De novo drug design” where generative models enable the creation of novel molecular structures with desired pharmacological properties from scratch.

Synthetic Algorithms, Real Results

Meanwhile, in analytical chemistry, AI doesn’t just parse data—it perfects it. It refines noisy NMR, IR, and mass spectrometry data, predicts structures, and enables real-time monitoring of industrial chemical processes. Smart factories are emerging, where AI detects impurities, adjusts parameters, and even halts production if something goes wrong.

But AI’s ambitions don’t stop at planning reactions. It’s also mastering the “art” of catalysis. AI-designed catalysts are already outperforming some traditional approaches, speeding up reactions, reducing waste, and enabling cleaner industrial processes. One model designed hydrogen fuel cell catalysts without relying on expensive platinum. Another optimized CO2 reduction pathways, making sustainable chemistry more viable than ever.

In another sustainability approach, researchers from Microsoft and Pacific Northwest National Laboratory identified 23 viable battery material candidates with reduced lithium content in just 80 hours, a process that would be unfeasible for humans to perform manually.

Molecule on black background
C60 molecule with isosurface of ground-state electron density as calculated with density functional theory. Credit: CC BY 3.0 (Wiki Commons).

These approaches are different from “traditional” computational chemistry, which uses powerful computers and accurate physics-based simulations. The problem with this approach is computational cost, which increases dramatically with larger molecules and complex systems. This makes it extremely challenging to model real-world complexity.

A machine learning approach to reduce this computational cost is called Machine Learned Interatomic Potentials (MLIPs). This approach can offer results with thousands of times less computation, unlocking the ability to simulate large atomic systems that have been out of reach. This massive speed up and reduced computational requirement mean that high-fidelity atomistic simulations, once confined to supercomputing centers, are now accessible to a much broader range of researchers and institutions. This democratization accelerates research across the board.

Algorithms Are Not Magic Wands

Machine learning algorithms are the backbone of this seismic shift in chemistry. But algorithms alone are not enough. Instead, it’s the convergence of these algorithms with massive datasets and powerful computers that makes it all possible. Decades of experimental results, reaction pathways, and molecular structures have finally become machine-readable. The painstaking work by chemists worldwide can now be used to train algorithms, while GPUs and cloud platforms support the necessary algorithms.

Essentially, this convergence offers a new way of approaching chemistry. Traditional ways of finding new molecule candidates, for instance, were slower and required a greater number of researchers with great expertise. AI models can evaluate millions of potential compounds in hours, ranking them by predicted effectiveness, safety, and novelty. They automate the early trial-and-error stage, flagging promising leads before a single chemical is synthesized. What once required years of painstaking lab work can now happen in a weekend—guided by probability, not just expertise.

And these models are getting paired with robotic labs. AI doesn’t just think; it acts (when supported by a robotic body). Systems like MIT’s Chemputer or the University of Amsterdam’s RoboChem can plan and execute multi-step syntheses autonomously. We’re watching the rise of the self-driving chemistry lab.

However, this is not without challenges. As with all AI applications, we are faced with the “black box” problem. Many advanced AI techniques, particularly deep learning, operate as “black boxes,” making it difficult to understand how they arrive at their predictions. This lack of interpretability is a significant barrier to scientific understanding, trust, and regulatory acceptance.

Another issue is data. Machine learning algorithms can only be as good as the data they use. But this data has not been standardized and is often biased. For decades, researchers have rarely published negative results, focusing only on successful attempts. This makes it difficult for AI (and humans) to know what doesn’t work. Incorporating “negative data” is crucial for improving prediction accuracy; other biases can also distort predictions. Data scarcity and incomprehensibility also remain challenges in areas like drug discovery.

Managing AI in chemistry requires a delicate balance between leveraging AI’s immense potential with its power demands and opaqueness. For such technologies to be put to use optimally, we need ensuring rigorous human oversight, ethical deployment, and transparency to safeguard public well-being.

However, ongoing advancements in data infrastructure, multimodal AI, and the emerging field of quantum machine learning promise a future where AI-driven chemistry will continue to push the boundaries of innovation, addressing global challenges from healthcare to sustainability, and even pushing forth fundamental discoveries.

Plenty of topics to be debated, don’t miss the livestream of the referring Panel Discussion on 1 July 2025!

AI in the #LINO25 Programme

Andrei Mihai

Andrei is a science communicator and a PhD candidate in geophysics. He co-founded ZME Science, where he tries to make science accessible and interesting to everyone and has written over 2,000 pieces on various topics – though he generally prefers writing about physics and the environment. Andrei tries to blend two of the things he loves (science and good stories) to make the world a better place – one article at a time.

Latest Posts