Bina Nusantara University Team Introduces Dqmw-Sample for Classically Hard Online Learning

A new online-learning primitive, Dissipative Quantum Multiplicative Weights with Sampling Feedback (DQMW-Sample), prepares a Gibbs state for loss feedback by using engineered open quantum-system dynamics. Agung Trisetyarso of Bina Nusantara University and colleagues from Institut Teknologi Bandun transform the computational hardness of constant-temperature Gibbs sampling into a physically realisable process, achieving asymptotically sublinear regret, a sharp improvement over the constant average regret experienced by all efficient classical learners on a specific instance. The engineered dissipator contracts hardware noise, resulting in sublinear noise-induced regret and strengthening the single-round hardness to encompass the full adaptive interaction, potentially collapsing the polynomial hierarchy if efficiently simulated classically. Initial hardware characterisation on the IBM Heron r2 processor positions DQMW-Sample as a promising pathway towards achieving computational advantage in online learning, grounded in complexity theory and compatible with near-term superconducting hardware.

Guiding quantum systems to complex probability distributions using engineered dissipation

Engineered dissipation, a mechanism analogous to applying a controlled damping force to a quantum system, is central to this novel approach. Unlike traditional methods that rely on preserving quantum coherence, engineered dissipation intentionally introduces interactions with the environment, guiding the quantum system towards a desired, stable state known as a Gibbs state. This Gibbs state represents a probability distribution over all possible computational basis states, effectively encoding a complex landscape of probabilities. Imagine a scenario akin to drawing marbles from a bag containing multiple colours; the Gibbs state defines the frequency with which each colour is drawn. However, accurately simulating this probability distribution becomes exponentially more difficult for classical computers as the number of possible outcomes, or ‘colours’, increases. This computational bottleneck is precisely what DQMW-Sample aims to circumvent. The team demonstrated feasibility by implementing DQMW-Sample on the IBM Heron r2 processor, a superconducting quantum computer, showcasing compatibility with current hardware capabilities. The fundamental motivation behind this approach lies in the inherent difficulty classical computers face when sampling from a Gibbs state, contrasting with their relative ease in calculating average values. This hardness is not merely a technical hurdle; it is the cornerstone of potentially achieving a computational advantage.

The engineered dissipator, a carefully crafted quantum operator, acts as the ‘brake’ on the quantum system, driving it towards the target Gibbs state. This is achieved through a Davies-type dissipator, a specific mathematical formulation describing the interaction between the quantum system and its environment. The dissipator’s design is crucial, as it not only steers the system towards the desired state but also actively mitigates the effects of hardware noise, a pervasive challenge in quantum computing. Noise introduces errors into the quantum computation, potentially corrupting the learning process. By strategically contracting these noise sources, the dissipator enhances the robustness of the algorithm, ensuring more reliable performance even in the presence of imperfections. This noise mitigation is a significant practical consideration for near-term quantum devices.

Quantum dissipation enables sublinear regret in online learning

A regret rate of O(√T log d) has been achieved, representing a substantial improvement over the constant average regret inherent in all efficient classical online-learning methods. Here, ‘T’ represents the number of rounds of learning, and ‘d’ denotes the number of possible actions the algorithm can take. Regret, in the context of online learning, quantifies the cumulative difference between the performance of the algorithm and the optimal strategy in hindsight. A lower regret rate indicates a more efficient learning process. This breakthrough, accomplished with DQMW-Sample, arises from translating the computational difficulty of constant-temperature Gibbs sampling into a physically viable learning process. Engineered dissipation, a process removing energy from a quantum system, was used to create a Gibbs state, utilising its measurement as loss feedback, effectively transforming a complex computational problem into an online-learning primitive. The measurement outcome provides a signal indicating the ‘loss’ or error associated with the algorithm’s current action, guiding it towards better choices in subsequent rounds.

Furthermore, a rigorous mathematical proof demonstrates that simulating the entire learning process, encompassing all T rounds of feedback, would be computationally intractable for classical computers unless the polynomial hierarchy collapses. The polynomial hierarchy is a fundamental concept in computational complexity theory, classifying problems based on their difficulty. Its collapse would imply that problems currently considered intractable could be solved efficiently, a highly unlikely scenario. This result suggests that DQMW-Sample may offer a genuine quantum advantage, meaning it can solve certain problems faster than any classical algorithm. Initial characterisation of the method was performed on the IBM Heron r2 processor, confirming compatibility with near-term superconducting technology. However, current results rely on a specific, explicit realizability condition, namely, the ability to accurately implement the engineered dissipator. The team acknowledges that demonstrating this condition in full generality, and scaling the system to handle more complex problems with a larger number of actions ‘d’, remains a significant challenge before practical applications become feasible. The engineered dissipator actively contracts hardware noise, resulting in sublinear regret even with imperfect quantum systems, meaning the learning process remains effective despite realistic imperfections in the quantum hardware. This durability is key for implementation on existing quantum devices and extends the algorithm’s practical applicability.

Quantum algorithms demonstrate promise but require broader validation across diverse learning tasks

DQMW-Sample’s development offers a potential route to improved online learning, tackling problems where algorithms must continuously adapt to new information, such as financial trading, resource allocation, or robotic control. However, the current demonstration relies on a “suitably constructed instance”, a specific problem carefully designed to highlight the method’s strengths; this raises a key tension, as it remains unclear whether this advantage extends to all real-world online learning challenges. Messier and less predictable real-world data often differs significantly from this tailored problem, potentially diminishing the observed performance gains. Further research is needed to assess the algorithm’s robustness and generalizability across a wider range of problem domains.

This work also establishes a key link between quantum dynamics and a fundamental challenge in machine learning, offering a pathway to algorithms that could, in principle, outperform classical methods when faced with continuous streams of information. A new method establishes a connection between quantum systems and online learning, where algorithms continuously adapt to incoming data. This approach transforms constant-temperature Gibbs sampling, a computationally difficult problem, into a physically realisable learning process, achieving a learning speed exceeding that of traditional methods. The process involves utilising mechanisms that remove energy from a quantum system to create a specific quantum state, and measurement of this state provides important feedback for the learning process. Future research will focus on exploring alternative dissipator designs, investigating the algorithm’s performance on more complex datasets, and developing techniques to further mitigate the effects of hardware noise, ultimately paving the way for practical quantum-enhanced online learning systems.

The research demonstrated a new online learning method, Dissipative Quantum Multiplicative Weights with Sampling Feedback (DQMW-Sample), which achieved asymptotically sublinear regret while classical learners experienced constant average regret on a specific instance. This suggests a potential advantage for quantum systems in continuously adapting to new information, as the method leverages engineered quantum dynamics to prepare a state used for learning feedback. The study also proved the engineered dissipator contracts hardware noise, yielding sublinear noise-induced regret. Researchers intend to explore alternative designs and assess performance on more complex datasets to further refine the algorithm.

👉 More information
🗞 Dissipative Quantum Multiplicative Weights with Sampling Feedback: A Classically Hard Primitive Realized via Engineered Open-System Dynamics
✍️ Agung Trisetyarso, Lenny Putri Yulianti and Kridanto Surendro
🧠 ArXiv: https://arxiv.org/abs/2606.26162

Stay current. See today’s quantum computing news on Quantum Zeitgeist for the latest breakthroughs in qubits, hardware, algorithms, and industry deals.
Avatar photo

Latest Posts by Muhammad Rohail T.: