Understanding the behaviour of electrons in strongly correlated materials presents a fundamental challenge in modern physics, and now, Huan Ma, Bowen Kan from the Chinese Academy of Sciences, and Honghui Shang and Jinlong Yang from the University of Science and Technology of China, demonstrate a significant advance in tackling this problem. The team successfully adapts Transformer neural networks, originally developed for natural language processing, to model the complex interactions between electrons in these materials. This innovative approach, termed Transformer-based neural network backflow, accurately predicts the ground-state energy and magnetic properties of challenging iron-sulfur clusters, surpassing the performance of established methods like DMRG and CCSD(T). By achieving both chemical accuracy in energy calculations and improved predictions of magnetic exchange coupling, this work establishes a powerful new variational ansatz for exploring strongly correlated electronic structure and opens avenues for studying larger, previously inaccessible systems.
Transformer Networks Solve Quantum Chemistry Problems
Scientists have developed a new approach to solving the quantum many-body problem, a significant challenge in quantum chemistry. They are exploring the use of Transformer-based neural networks as a way to approximate solutions to the Schrödinger equation for complex molecular systems. This research details the network architecture, optimization techniques, and performance, comparing it to established methods like quantum Monte Carlo and tensor networks. The work represents a highly technical advancement aimed at researchers in computational chemistry, physics, and machine learning. The quantum many-body problem arises when attempting to accurately describe systems with many interacting particles, such as electrons in molecules.
Exact solutions are generally impossible, necessitating the use of approximations. Researchers employ variational wavefunctions, trial solutions to the Schrödinger equation, seeking those that minimize the system’s energy and provide the best possible approximation to the true ground state. This study utilizes Neural Network Quantum States, representing the wavefunction using neural networks, allowing for flexible and accurate representations of complex quantum states. Transformers, a type of neural network originally developed for natural language processing, are adapted to represent these quantum wavefunctions, leveraging their ability to capture long-range dependencies.
A backflow transformation further improves the flexibility and accuracy of the neural network wavefunctions by allowing the coordinates of the particles to be transformed. The team also investigated various optimization algorithms to minimize the system’s energy, improving optimization stability and efficiency. This research introduces a Transformer-based wavefunction, a novel approach departing from more traditional Neural Network Quantum State methods. The scientists also explored combining Transformer-based Neural Network Quantum States with tensor networks, leveraging the strengths of both methods.
They present techniques to improve the optimization process, making it more stable and efficient for large-scale systems, and emphasize the scalability of their approach, aiming to tackle systems that challenge traditional methods. The method was applied to various molecular systems, including iron-sulfur clusters and other complex materials. The results demonstrate that their Transformer-based Neural Network Quantum State approach achieves competitive or superior performance compared to existing methods on a range of benchmark problems. They accurately calculate ground-state energies and other properties of complex molecules, and the scalability of their approach allows it to handle larger systems than some traditional methods. This research represents a significant step forward in developing machine learning-based methods for solving the quantum many-body problem, potentially overcoming limitations of existing methods and enabling the study of increasingly complex molecular systems, leading to new insights in chemistry, physics, and materials science.
Transformers Model Electronic Correlations in Clusters
Scientists have achieved a breakthrough in solving the electronic Schrödinger equation for strongly correlated systems by adapting Transformer architectures and integrating neural network backflow. This work demonstrates a new method for capturing complex electronic correlations by processing electronic configurations as token sequences, where attention layers learn non-local orbital correlations. The method learns contextual representations and maps them into refined orbitals, offering a powerful new approach to understanding material properties. The team validated this approach by applying it to iron-sulfur clusters, notoriously difficult systems for traditional computational methods.
Results show the method achieves ground-state energies with chemical accuracy, comparable to, and often exceeding the precision of, established techniques like Density Matrix Renormalization Group and Coupled Cluster Singles and Doubles with perturbative triples. Importantly, the method accurately predicts magnetic exchange coupling constants, aligning more closely with experimental values than existing computational approaches. The approach scales favorably to large active spaces, exceeding the capabilities of exact methods, and benefits from stable convergence enabled by distributed Variational Monte Carlo optimization. These results establish Transformer-based backflow as a powerful variational ansatz for strongly correlated electronic structure, achieving superior magnetic property predictions while maintaining chemical accuracy in total energies. The team’s framework dynamically re-defines the single-particle orbital basis as a function of the many-electron configuration, allowing each electron’s effective orbital to respond to the positions and occupancies of all others. This backflow transformation introduces nonlinearity and correlation, enabling the wavefunction to adapt to electron interactions and accurately model complex systems beyond the reach of conventional methods.
👉 More information
🗞 Transformer-Based Neural Networks Backflow for Strongly Correlated Electronic Structure
🧠 ArXiv: https://arxiv.org/abs/2509.25720
