亚色影库app

An open access publication of the 亚色影库app & Sciences
Winter/Spring 2026

Geometry-Informed AI for Scientific Discovery

Author
Melanie Weber
View PDF

Melanie Weber is an Assistant Professor of Applied Mathematics and of Computer Science at Harvard University, where she leads the Geometric Machine Learning Group. She is also a 2025 Early Career Fellow with the Schmidt Sciences AI2050 Project and a 2024 Research Fellow in Mathematics with the Alfred P. Sloan Foundation. Her research studies geometric structure in data and models and how to leverage such information for the design of new, efficient machine-learning methods with provable guarantees.

Artificial intelligence offers tremendous opportunities to accelerate scientific discovery. Already, AI models have driven major breakthroughs in areas such as protein folding and weather forecasting. Going forward, AI may hold the keys to the development of treatments for presently incurable diseases, to the design of innovative new materials, and to resolving decades-long open problems in mathematics. Currently, the model landscape is dominated by transformers; however, their extreme data and compute requirements and limited interpretability present fundamental challenges for the road ahead. Progress toward the next generation of AI for science will require a shift toward smaller, structured models that are efficient, interpretable, and capable of acting as genuine scientific collaborators: guiding experiments, revealing hidden patterns in data, and helping to chart new scientific frontiers.

One possible avenue for developing such next-generation models, which I am pursuing in my research, is geometry-informed AI.1 This is motivated by the observation that encoding data geometry as inductive bias into models can mitigate high resource demands by reducing the model鈥檚 data and computing needs, as demonstrated by recent work from my group and others.2 To illustrate this, consider the example of image classification, in which labels are typically assigned based on the objects shown in the image, irrespective of their location. One can encode such structure into a model by only considering model architectures whose outputs are agnostic to shifts of objects within the image. Convolutional neural networks, which achieved a major breakthrough in image classification in the 2010s, are an early example of a model architecture that has this property.3 More broadly, models can encode a variety of geometric structures, such as symmetries arising from fundamental laws of physics or low-dimensional structure reflecting inherent correlations in high-dimensional data.

Why does this matter for the sciences? Take state-of-the-art weather prediction models, which require training on millions of examples to provide accurate forecasts. By encoding known geometric structure, we can prevent models from expending resources on implausible scenarios. This is accomplished by constraining model outputs to remain consistent with established domain knowledge, such as physical laws that enforce specific symmetries.

Geometric models are already driving progress across scientific domains: Equivariant models accelerate materials discovery and physics-informed architectures achieve high-resolution weather forecasting.4 In my own interdisciplinary work, we have developed geometric approaches for representing large-scale single-cell data in a way that makes it easier for researchers to trace how cells mature and branch into different types in developmental processes.5 For astronomy, we developed a method that computes interpretable representations of galaxies that preserve information about their surrounding cosmic environment, uncovering patterns tied to galaxy mass and star formation.6 In structural biology, we introduced a geometry-aware framework for heterogeneous cryo-EM reconstruction that predicts atomic backbone conformations, allowing researchers to recover protein structural variability from single-particle images using geometric priors.7 

State-of-the-art geometric models can already rival transformer-based architectures in specialized tasks, yet they represent only an early glimpse of what geometry-informed AI could enable. To move beyond specialized solutions and toward more versatile AI assistants, both mathematics and engineering advances are needed. Recent theoretical work by my group and others has begun to clarify when and why geometric priors yield provable advantages, but deeper mathematical foundations, and their incorporation into broadly applicable foundation models, will be key to accelerating progress in geometry-informed AI.8 Equally important is the development of architectures that encode richer and more heterogeneous geometric structure, along with the software and hardware infrastructure needed to scale them efficiently.9 Together, these efforts could enable geometric models capable of driving major advances in AI-assisted scientific discovery.

While I have emphasized that mathematical insights can advance artificial intelligence, the converse is increasingly true as well. AI-assisted reasoning tools are being introduced into mathematical research, with early successes showing that they are particularly effective for 鈥渘eedle in a haystack鈥 problems, where a correct construction, counterexample, or identity exists but is extremely difficult to find with classical approaches.10 Since mathematics is fundamentally concerned with identifying and exploiting structure, I expect that structured models, such as geometry-informed AI, will play an important role going forward. Such approaches could mitigate some of the growing pains of current methods: They rely heavily on large language models that require time-consuming human formalization of mathematical data as text or draw on unreliable informal sources. Their outputs are also difficult for humans to interpret, limiting the extraction of proof strategies and mathematical intuition. Encoding mathematical principles directly into models could enable them to reason in ways that more closely align with human intuition, strengthening human-AI collaboration in research mathematics. In my own research, we have already seen early successes with geometric models in combinatorics.11

Looking ahead, I envision a scientific ecosystem in which AI is not merely a tool but a collaborator. While current models primarily assist with data analysis, future models could help researchers interrogate existing literature, act as sounding boards for hypothesis generation, and collaborate in the conception of new methodologies. Such human-AI collaborations have the potential to accelerate progress toward answering the great scientific questions of our time, and those yet to emerge.

Endnotes

  • 1

    Melanie Weber, 鈥,鈥 AI Magazine, January 10, 2025.

  • 2

    Alberto Bietti, Luca Venturi, and Joan Bruna, 鈥淥n the Sample Complexity of Learning Under Invariance and Geometric Stability,鈥 in NeurIPS 鈥21: Proceedings of the 35th International Conference on Neural Information Processing Systems, ed. Marc鈥橝urelio Ranzato, Alina Beygelzimer, Yann N. Dauphin, et al. (Curran Associates, Inc., 2021); Song Mei, Theodor Misiakiewicz, and Andrea Montanari, 鈥淟earning with Invariances in Random Features and Kernel Models,鈥 Proceedings of Machine Learning Research 134 (2021); Bobak Kiani, Thien Le, Hannah Lawrence, et al., 鈥,鈥 paper presented at the Twelfth International Conference on Learning Representations, Vienna, Austria, May 7, 2024; Bobak T. Kiani, Jason Wang, and Melanie Weber, 鈥淥n the Hardness of Learning Neural Networks Under the Manifold Hypothesis,鈥 in NeurIPS 鈥24: Proceedings of the 38th International Conference on Neural Information Processing Systems, ed. Amir Globerson, Lester Mackey, Danielle Belgrave, et al. (Curran Associates, Inc., 2024); and Johann Brehmer, S枚nke Behrends, Pim De Haan, and Taco Cohen, 鈥溾 Transactions on Machine Learning Research (2025). 

  • 3

    Yann LeCun, L茅on Bottou, Yoshua Bengio, and Patrick Haffner, 鈥淕radient-Based Learning Applied to Document Recognition,鈥 Proceedings of the IEEE 86 (11) (1998).

  • 4

    Simon Batzner, Albert Musaelian, Lixin Sun, et al., 鈥淓(3)-Equivariant Graph Neural Networks for Data-Efficient and Accurate Interatomic Potentials,鈥 Nature Communications 13 (2022); and Jaideep Pathak, Shashank Subramanian, Peter Harrington, et al., 鈥,鈥 arXiv (2022).

  • 5

    Nithya Bhasker, Hattie Chung, Louis Boucherie, et al., 鈥,鈥 bioRxiv (2025).

  • 6

    Ana Sofia Uzoy, Claire Lamman, and Melanie Weber, 鈥,鈥 paper presented at the Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025), San Diego, California, December 1, 2025.

  • 7

    Jonathan Krook, Axel Janson, Joakim And茅n, et al., 鈥,鈥 paper presented at the IEEE International Conference on Image Processing, Tampere, Finland, September 13鈥17, 2026.

  • 8

    Bietti, Venturi, and Bruna, 鈥淥n the Sample Complexity of Learning Under Invariance and Geometric Stability鈥; Mei, Misiakiewicz, and Montanari, 鈥淟earning with Invariances in Random Features and Kernel Models鈥; Kiani, Le, Lawrence, et al., 鈥淥n the Hardness of Learning Under Symmetries鈥; and Kiani, Wang, and Weber, 鈥淥n the Hardness of Learning Neural Networks Under the Manifold Hypothesis.鈥

  • 9

    Neil He, Jiahong Liu, Buze Zhang, et al., 鈥,鈥 paper presented at Learning on Graphs Conference 2025, Arizona State University, December 10, 2025.

  • 10

    Fran莽ois Charton, Jordan S. Ellenberg, Adam Zsolt Wagner, and Geordie Williamson, 鈥,鈥 arXiv (2024); Alex Davies, Petar Veli膷kovi膰, Lars Buesing, et al., 鈥淎dvancing Mathematics by Guiding Human Intuition with AI,鈥 Nature 600 (7887) (2021); and Alexander Novikov, Ng芒n V农, Marvin Eisenberger, et al., 鈥,鈥 arXiv (2025).

  • 11

    Knut Vanderbush and Melanie Weber, 鈥,鈥 arXiv (2026).