Quasar: Tool-Augmented LLMs with Agentic Reinforcement Learning Generate Quantum Circuits Achieving 100% Alignment

Creating efficient and reliable circuits remains a central challenge in modern computing, and researchers are increasingly exploring the potential of artificial intelligence to automate this complex process. Cong Yu, Valter Uotila, and Shilong Deng, alongside their colleagues, now present QUASAR, a novel framework that leverages the power of large language models and reinforcement learning to design and optimise circuits automatically. This innovative approach addresses key limitations of existing AI-driven circuit generation, particularly the need for precise numerical values and domain-specific knowledge, by incorporating circuit verification tools and a hierarchical reward system. The team demonstrates that QUASAR significantly improves both the structural correctness and functional performance of generated circuits, achieving superior results compared to leading industrial language models and other state-of-the-art techniques.

quantum computing. Recent advances in large language model (LLM)-based quantum circuit generation offer a promising route to automation. However, significant challenges remain, including the need for precise numerical values for optimal circuit performance and the lack of specialized quantum knowledge within the models themselves. Researchers now present QUASAR, an agentic reinforcement learning framework that leverages tool-augmented LLMs to address these issues and generate high-quality quantum circuits.

LLM Generates Optimized Quantum Assembly Code

This research introduces QUASAR, a novel approach to generating quantum assembly code (QASM) for optimization problems using a large language model. Unlike traditional methods that rely on hand-crafted algorithms, QUASAR directly generates QASM code from problem descriptions, focusing on both syntactic correctness, ensuring the code compiles, and semantic correctness, ensuring the code solves the intended problem. The team rigorously tested QUASAR against established benchmarks, including random parameter initialization and a state-of-the-art rule-based method. The results demonstrate that QUASAR achieves a high rate of generating syntactically correct QASM code, exceeding 97%.

While the generated code often provides effective solutions, it doesn’t always achieve optimal results. The team used metrics to quantify solution quality, revealing that QUASAR consistently outperforms random parameter initialization and shows promise on more complex scenarios where existing methods struggle. The researchers suggest QUASAR could serve as a valuable starting point for quantum optimization algorithms, providing a good initial state for further refinement. This work represents a significant step towards automating quantum algorithm design, potentially making quantum computing more accessible to a wider range of researchers and developers. The team’s detailed analysis of QUASAR’s strengths and weaknesses provides valuable insights for future research in this rapidly evolving field. By combining the power of LLMs with rigorous evaluation and a clear understanding of the challenges involved, this research paves the way for new discoveries and advancements in quantum computing.

QUASAR Achieves High-Fidelity Quantum Circuit Generation

The research team developed QUASAR, a novel agentic reinforcement learning framework that significantly improves the generation and optimization of quantum circuits using large language models. This work addresses key challenges in leveraging LLMs for quantum computing, specifically the need for precise numerical values in circuit parameters and the lack of domain-specific knowledge within the models themselves. QUASAR integrates tool-augmented LLMs with a sophisticated system for verifying circuit validity and a hierarchical reward mechanism to guide the learning process. Experiments demonstrate that QUASAR, when used with a 4 billion parameter LLM, achieves a validity rate of 99.

Measurements confirm that QUASAR enables the creation of scalable quantum circuits and supports the design of advanced quantum algorithms. This research highlights the potential of LLMs, when combined with robust verification and learning techniques, to accelerate progress in quantum computing and algorithm development. The team’s work establishes a new benchmark for LLM-based quantum circuit generation, paving the way for more efficient and reliable quantum computations.

The team presents QUASAR, a novel agentic reinforcement learning framework designed to enhance large language models’ ability to generate OpenQASM 3. 0 circuits, which are essential for quantum computing. By integrating an external verification tool with quantum-aware reinforcement learning and a hierarchical reward system, QUASAR consistently produces circuits with both high syntactic validity and semantic fidelity. This approach demonstrably outperforms leading industrial language models and other baseline methods across established quantum optimization benchmarks. Results indicate that aligning the distribution of generated code is a key factor in improving quality, while incorporating terms that minimize expectation-value errors and promote optimization efficiency provide additional benefits. This demonstrates the effectiveness of tool-augmented reinforcement learning in bridging the gap between general-purpose language models and the specific demands of domain-specific quantum code generation, potentially enabling broader applications in automated quantum algorithm design.

👉 More information
🗞 QUASAR: Quantum Assembly Code Generation Using Tool-Augmented LLMs via Agentic RL
🧠 ArXiv: https://arxiv.org/abs/2510.00967

Rohail T.

Rohail T.

As a quantum scientist exploring the frontiers of physics and technology. My work focuses on uncovering how quantum mechanics, computing, and emerging technologies are transforming our understanding of reality. I share research-driven insights that make complex ideas in quantum science clear, engaging, and relevant to the modern world.

Latest Posts by Rohail T.:

New Mott Insulator Enables Quantized Charge and Spin Hall Responses in Moire Materials

New Mott Insulator Enables Quantized Charge and Spin Hall Responses in Moire Materials

January 9, 2026
Optimum Interfacial Friction and Electrohydrodynamic Drag Achieves Nanoscale Fluid Control

Optimum Interfacial Friction and Electrohydrodynamic Drag Achieves Nanoscale Fluid Control

January 9, 2026
Digital Twins Benefit from Joint Parameter and State Estimation with Uncertainty Quantification

Tunable Lateral Optical Forces Achieved on Janus Particles in Fluid Media

January 9, 2026