Researchers at Kosuke Ito and collaborators from Keio University, EPFL, and partner institutions, have developed a new unsupervised domain adaptation framework that allows machine learning models to effectively learn from imperfect quantum data, addressing a significant obstacle in the advancement of quantum machine learning. This work presents a technique utilising classical representations of quantum states obtained via classical shadows, performing the adaptation entirely within a classical computational pipeline following quantum measurements. Numerical evaluations on quantum phases of matter and entanglement classification tasks demonstrate improved performance compared to existing methods, highlighting the potential for practical applications in realistic quantum data learning.
Classical shadows enable domain adaptation for enhanced quantum data classification
Predictive performance on quantum data improves substantially with the use of classical shadows, exceeding source-only baselines and target-only unsupervised learning by a sharp margin across both quantum phase and entanglement classification tasks. This advancement overcomes a long-standing limitation, as adapting machine learning models to imperfect quantum data previously required fully labelled datasets, which are rarely obtainable in practical quantum experiments. The core innovation lies in the creation of simplified classical representations of quantum states through measurement, enabling adaptation to occur entirely on conventional computers, streamlining the process and reducing reliance on quantum processing power. Classical shadows are probabilistic representations of quantum states constructed from the results of numerous measurements; they allow for the estimation of expectation values of observables without requiring complete state tomography, significantly reducing the computational burden.
The framework’s success highlights a practical route towards robust learning from realistic, imperfect quantum data, paving the way for more reliable quantum machine learning applications. It effectively bridges the gap between labelled source data, typically obtained under idealised conditions, and unlabelled, imperfect target quantum data, representative of real-world experimental limitations. Experiments involving quantum phases of matter and entanglement classification revealed that this unsupervised approach consistently outperformed source-only baselines and target-only unsupervised learning. Specifically, varying Hamiltonian parameters, ground-state preparation methods, and introducing hardware noise created the domain shift in the quantum phase benchmarks, simulating the discrepancies between training and deployment environments. Entanglement benchmarks utilised differing system sizes and state-generation procedures to further challenge the model’s adaptability. This adaptation occurs entirely on conventional computers, avoiding the need for extensive quantum processing, and opens possibilities for applying the technique to more complex quantum systems and datasets, such as those arising from quantum simulations or near-term quantum devices. The use of classical computation for adaptation is particularly advantageous given the current limitations in scalable quantum processing.
Balancing quantum measurement cost against accuracy gains in domain adapted learning
While this new framework demonstrably outperforms simpler methods, its reliance on classical shadows introduces a potential bottleneck; generating these representations requires numerous quantum measurements, a resource often constrained in real-world experiments. The number of measurements required to accurately construct classical shadows scales with the size of the quantum system and the desired fidelity of the representation. Existing unsupervised learning approaches, focused solely on the target data, may offer computational advantages in scenarios where measurement costs are prohibitive. This raises a key tension: does the improved accuracy gained through domain adaptation justify the increased demand on quantum resources, or are there situations where a less accurate, but more efficient, method is preferable. The trade-off between measurement cost and accuracy is a critical consideration for practical implementation.
Generating classical shadows demands substantial quantum measurement resources, representing a clear trade-off between accuracy and experimental cost. The precise number of measurements needed depends on the complexity of the quantum state and the desired accuracy of the classical shadow. However, a valuable methodology for improving quantum machine learning models when labelled data is scarce is now available. Performance gains across different quantum tasks validate the approach’s broad applicability, even if resource limitations present challenges for some implementations. Future research focusing on reducing measurement overhead will be vital for wider adoption, potentially through optimised measurement strategies, such as compressed sensing techniques, or alternative shadow construction techniques that require fewer measurements. Exploring methods to efficiently estimate the required number of measurements for a given level of accuracy is also crucial. Furthermore, investigating the use of partially labelled data, combining the benefits of both supervised and unsupervised learning, could offer a promising avenue for future research.
A new technique for training machine learning models on quantum data without requiring pristine, fully labelled datasets is now established. By employing ‘classical shadows’, the framework adapts models to imperfect data entirely on standard computers, circumventing a key limitation of previous methods that relied on idealised conditions rarely found in practical quantum experiments. The approach demonstrably improves performance on tasks such as identifying quantum phases and entanglement, offering a significant step towards practical quantum machine learning. The ability to leverage classical computational resources for adaptation is particularly important given the current limitations of quantum hardware. This work contributes to the growing field of near-term quantum machine learning, where algorithms are designed to run on noisy intermediate-scale quantum (NISQ) devices. The framework’s success suggests that classical-quantum hybrid approaches, combining the strengths of both classical and quantum computation, are a promising path towards realising the potential of quantum machine learning soon. Further investigation into the robustness of this framework against different types of noise and imperfections in quantum devices is warranted.
The research successfully demonstrated a new unsupervised domain adaptation framework for learning from imperfect quantum data using classical computational pipelines. This is important because it addresses the practical challenge of training machine learning models when clean, labelled quantum data is unavailable, a common issue in real-world quantum experiments. By utilising classical representations of quantum states obtained via classical shadows, the method outperformed existing approaches on quantum phase and entanglement classification tasks. The authors suggest future work will focus on reducing measurement overhead to facilitate wider adoption of the technique.
👉 More information
🗞 Learning from imperfect quantum data via unsupervised domain adaptation with classical shadows
🧠ArXiv: https://arxiv.org/abs/2603.28294
