Reinforcement Learning Optimizes CNOT Quantum Circuits, Achieving High Efficiency for Matrices up to Size 8

Quantum circuits, the building blocks of quantum computation, rely on fundamental gates to create entanglement, a key resource for powerful algorithms, and minimising the number of these gates is a critical challenge. Riccardo Romanello from Ca’ Foscari University of Venice, Daniele Lizzio Bosco, and Jacopo Cossio from the University of Udine, along with their colleagues, now demonstrate a significant advance in this area by introducing a novel reinforcement learning approach to circuit minimisation. Unlike previous methods that require separate training for different circuit sizes, their system employs a single agent capable of handling circuits up to a fixed size, utilising embedding or Gaussian striping to process matrices of varying dimensions. The team trained this agent on circuits of size eight and then tested its performance on larger circuits, revealing that their method consistently outperforms existing state-of-the-art algorithms as circuit complexity increases, representing a substantial step towards more efficient quantum computation.

Reducing CNOT gate count directly impacts circuit complexity and feasibility, and is essential for achieving fault-tolerant quantum computation. Researchers are exploring a variety of approaches, from traditional algorithms like the Solovay-Kitaev algorithm and Gaussian elimination, to more modern machine learning techniques. Other methods include Boolean satisfiability solvers and Answer Set Programming, offering alternative routes to circuit optimization. Researchers are also incorporating topological constraints, reflecting the physical layout of qubits on a quantum device, and exploring hybrid approaches that combine different techniques to achieve better results.

Reinforcement Learning Minimizes CNOT Gate Count

Scientists have developed a novel reinforcement learning approach to minimize CNOT gates in quantum circuits, a critical step towards efficient quantum computation. Recognizing that fewer CNOT gates reduce noise and improve the feasibility of quantum error correction, the team focused on finding the shortest sequence of operations to transform a given matrix into the identity matrix. This problem was simplified by leveraging the relationship between quantum circuits and binary invertible matrices. The study pioneered a method where a single reinforcement learning agent is trained to operate on matrices up to a fixed size, a departure from traditional approaches that require separate agents for different circuit sizes. Matrices exceeding this size are preprocessed using techniques like embedding or Gaussian striping, ensuring compatibility with the trained agent. Rigorous evaluation demonstrated that this method consistently outperforms existing algorithms as the matrix size increases, indicating a significant advancement in scalability.

Reinforcement Learning Optimizes Quantum Circuit Gate Count

Scientists have achieved a breakthrough in minimizing CNOT gates in quantum circuits using a novel reinforcement learning approach. Reducing gate counts directly impacts circuit performance and feasibility, and is a key challenge in quantum computing. The team developed an agent trained on matrices of a specific size, and then successfully applied this agent to larger matrices, demonstrating a significant advancement in scalability. The core of this work lies in a reinforcement learning technique where an agent learns to transform a given matrix into the identity matrix using a sequence of CNOT operations.

Each successful transformation, bringing the matrix closer to the identity, is rewarded, effectively training the agent to optimize gate usage. The team overcame a common limitation of reinforcement learning, the need for retraining with every change in problem setting, by combining the RL agent with matrix manipulation techniques. Specifically, they employed sub-matrix embeddings and a greedy algorithm to pre-process matrices, allowing the agent to effectively solve problems with varying dimensions. Experiments demonstrate the efficacy of this combined approach, with the agent consistently outperforming existing algorithms as the matrix size increases. This breakthrough delivers a significant step towards building practical and scalable quantum computers by minimizing a key source of noise and complexity in quantum circuits.

Scalable Quantum Circuit Optimisation via Reinforcement Learning

This research presents a novel reinforcement learning approach to minimize CNOT gates in quantum circuits. By training a single agent to handle circuits up to a certain size and then applying it to larger circuits through preprocessing techniques, the team demonstrates improved performance compared to existing algorithms, particularly as circuit size increases. This advancement contributes to the ongoing effort to optimize quantum circuits, a crucial step towards mitigating the effects of decoherence and reducing the number of qubits required for computation. The method’s efficacy stems from its ability to generalise beyond the specific circuits used during training, offering a scalable solution to a problem often limited by the need for individual agents per circuit size. While acknowledging the constraints of the specific problem variant addressed, unconstrained CNOT minimization, the authors highlight the potential for extending this approach to more complex scenarios. Future work could explore adapting the agent to handle constrained minimization problems or applying the methodology to optimize other types of quantum gates, further contributing to the development of efficient and practical quantum computing technologies.

👉 More information
🗞 CNOT Minimal Circuit Synthesis: A Reinforcement Learning Approach
🧠 ArXiv: https://arxiv.org/abs/2510.23304

Rohail T.

Rohail T.

As a quantum scientist exploring the frontiers of physics and technology. My work focuses on uncovering how quantum mechanics, computing, and emerging technologies are transforming our understanding of reality. I share research-driven insights that make complex ideas in quantum science clear, engaging, and relevant to the modern world.

Latest Posts by Rohail T.:

Spiking Neural Networks Enable 80% Accurate UWB Channel Estimation at the Edge

Advances LLM Capabilities: Self-Critique Boosts Planning Performance on Mini-grid and Logistics by a Substantial Margin

January 9, 2026
Ruo2 Altermagnet Enables Field-Free Switching with Chiral Dual Spin Currents

Edge AI Inference Enabled by TYTAN, Delivering 56% Improved Energy Efficiency

January 9, 2026
Bayesian Transformers Achieve Diverse Intelligence with Sampling from a Single Model

Rogue-wave Statistics Achieve Robust Spiking Neural Networks for Low-Power AI

January 9, 2026