A roadmap for an atomistic machine learning software ecosystem
        Event details
| Date | 19.01.2026 › 21.01.2026 | 
| Location | |
| Category | Conferences - Seminars | 
| Event Language | English | 
You can apply to participate and find all the relevant information (speakers, abstracts, program,...) on the event website: https://www.cecam.org/workshop-details/a-roadmap-for-an-atomistic-machine-learning-software-ecosystem-1376.
Registration is required to attend the full event, take part in the social activities and present a poster at the poster session (if any).  However, the EPFL community is welcome to attend specific lectures without registration if the topic is of interest to their research. Do not hesitate to contact the CECAM Event Manager if you have any question.
Description
The learning of accurate and efficient atomic interaction potentials from quantum mechanical calculations has been one of the first applications of machine learning in the physical sciences [1], and it is arguably one of the most successful. In the last 15 years, the development of machine-learned interaction potentials (MLIP) has evolved significantly and many different models have been proposed and applied to a broad set of systems in material science, chemistry, and biophysics [2-4], expanding also beyond the prediction of interatomic potentials to include any quantity accessible to electronic-structure calculations. The mathematical and conceptual framework underlying ML architectures for atomistic simulations is now relatively well understood [5-7] and different models share many common features. For instance, equivariance is a common theme as well as the use of message passing mechanisms between local atomic environments - even though schemes to incorporate long-range physics [8,9] and unrestricted models that relax some of the symmetry constraints are also actively developed [10].
Despite the many common ideas, the software ecosystem is currently very fragmented: each model usually comes with its own monolithic implementation, and there is little shared infrastructure, even though lately some efforts have started to appear that aim to provide basic functionalities in a more general setting (e.g. e3nn [11], dscribe [12], sphericart [13]). This is due in part to the very fast development of the field, in part to the fact that rapid prototyping is possible using general-purpose ML libraries such as Pytorch and Jax, in part to the difficulty in producing efficient domain-specific libraries that can fully exploit accelerated hardware platforms. 
Another open question is how to provide easy-to-use interfaces between the ML core and different types of traditional modelling software. Most machine-learning interatomic potentials provide a library interface to call them from a molecular dynamics software such as LAMMPS or OpenMM, but this requires substantial effort for each target code. When one considers the prediction of properties such as electronic densities or Hamiltonians that involve a interfacing with quantum chemistry software, the situation is even more problematic, as one has to handle quantities with a considerably more complicated structure, and performing a machine learning task is more intimately coupled to the technical choices in the host code, such as the choice of basis set used to discretize the electronic wavefunction.[14-16].
References
[1] J. Behler, M. Parrinello, Phys. Rev. Lett., 98, 146401 (2007)
[2] J. Smith, O. Isayev, A. Roitberg, Chem. Sci., 8, 3192-3203 (2017)
[3] Y. Zhou, W. Zhang, E. Ma, V. Deringer, Nat. Electron., 6, 746-754 (2023)
[4] J. Abramson, J. Adler, J. Dunger, R. Evans, T. Green, A. Pritzel, O. Ronneberger, L. Willmore, A. Ballard, J. Bambrick, S. Bodenstein, D. Evans, C. Hung, M. O’Neill, D. Reiman, K. Tunyasuvunakool, Z. Wu, A. Žemgulytė, E. Arvaniti, C. Beattie, O. Bertolli, A. Bridgland, A. Cherepanov, M. Congreve, A. Cowen-Rivers, A. Cowie, M. Figurnov, F. Fuchs, H. Gladman, R. Jain, Y. Khan, C. Low, K. Perlin, A. Potapenko, P. Savy, S. Singh, A. Stecula, A. Thillaisundaram, C. Tong, S. Yakneen, E. Zhong, M. Zielinski, A. Žídek, V. Bapst, P. Kohli, M. Jaderberg, D. Hassabis, J. Jumper, Nature, 630, 493-500 (2024)
[5] F. Musil, A. Grisafi, A. Bartók, C. Ortner, G. Csányi, M. Ceriotti, Chem. Rev., 121, 9759-9815 (2021)
[6] J. Nigam, S. Pozdnyakov, G. Fraux, M. Ceriotti, The Journal of Chemical Physics, 156, (2022)
[7] I. Batatia, S. Batzner, D. P. Kovács, A. Musaelian, G. N. C. Simm, R. Drautz, C. Ortner, B. Kozinsky, and G. Csányi, "The design space of E(3)-equivariant atom-centered interatomic potentials," arxiv:2205.06643 (2022)
[8] T. Ko, J. Finkler, S. Goedecker, J. Behler, Nat. Commun., 12, 398 (2021)
[9] K. Huguenin-Dumittan, P. Loche, N. Haoran, M. Ceriotti, J. Phys. Chem. Lett., 14, 9612-9618 (2023)
[10] S. Pozdnyakov and M. Ceriotti, "Smooth, exact rotational symmetrization for deep learning on point clouds," in Advances in Neural Information Processing Systems (Curran Associates, Inc., 2023), 36, pp. 79469–79501.
[11] M. Geiger and T. Smidt, "e3nn: Euclidean neural networks," arxiv:2207.09453 (2022).
[12] L. Himanen, M. Jäger, E. Morooka, F. Federici Canova, Y. Ranawat, D. Gao, P. Rinke, A. Foster, Computer Physics Communications, 247, 106949 (2020)
[13] F. Bigi, G. Fraux, N. Browning, M. Ceriotti, The Journal of Chemical Physics, 159, (2023)
[14] J. VandeVondele, M. Krack, F. Mohamed, M. Parrinello, T. Chassaing, J. Hutter, Computer Physics Communications, 167, 103-128 (2005)
[15] P. Giannozzi, S. Baroni, N. Bonini, M. Calandra, R. Car, C. Cavazzoni, D. Ceresoli, G. Chiarotti, M. Cococcioni, I. Dabo, A. Dal Corso, S. de Gironcoli, S. Fabris, G. Fratesi, R. Gebauer, U. Gerstmann, C. Gougoussis, A. Kokalj, M. Lazzeri, L. Martin-Samos, N. Marzari, F. Mauri, R. Mazzarello, S. Paolini, A. Pasquarello, L. Paulatto, C. Sbraccia, S. Scandolo, G. Sclauzero, A. Seitsonen, A. Smogunov, P. Umari, R. Wentzcovitch, J. Phys.: Condens. Matter, 21, 395502 (2009)
[16] V. Blum, R. Gehrke, F. Hanke, P. Havu, V. Havu, X. Ren, K. Reuter, M. Scheffler, Computer Physics Communications, 180, 2175-2196 (2009)
Practical information
- Informed public
 - Registration required
 
Organizer
- Jörg Behler, Ruhr-Universität Bochum; Michele Ceriotti, EPFL ; Cecilia Clementi, Freie Universität Berlin ; Gabor Csanyi, University of Cambridge
 
Contact
- Cornelia Bujenita, CECAM Events and Operations Manager