Leonardo Guerreiro Azevedo, Julio Cesar Cardoso Tesolin, et al.
ACS Fall 2023
Designing stable crystal structures is central to accelerating the discovery of new materials, yet most generative approaches remain limited to reproducing known patterns rather than exploring novel possibilities. We present a method that trains large language models with reinforcement learning guided by verifiable energy-based rewards, optimizing toward physically grounded stability objectives. Compared to supervised finetuning and base models, our reinforcement learning–trained model generates crystals with higher predicted stability and a greater proportion of previously unreported structures. These results suggest that combining verifiable energy rewards and reinforcement learning provides a powerful path toward automated discovery of novel, stable materials.
Leonardo Guerreiro Azevedo, Julio Cesar Cardoso Tesolin, et al.
ACS Fall 2023
Viviane T. Silva, Rodrigo Neumann Barros Ferreira, et al.
ACS Fall 2024
Amol Thakkar, Andrea Antonia Byekwaso, et al.
ACS Fall 2022
Philippe Schwaller, Benjamin Hoover, et al.
Science Advances