Quantum Data Classification via Geometric Modeling

A new quantum-inspired classification framework combining geometric principles with variational quantum computation improves data analysis. Nishikanta Mohanty and colleagues at University of Technology Sydney detail a method utilising Correlation Group Structures and compact overlap estimation to create key, correlation-weighted representations of data. The geometry-first approach achieves competitive and stable performance on datasets including Heart Disease, Breast Cancer, and Wine Quality, with test accuracies reaching 0.9556. Furthermore, the method enables effective rare-event detection, achieving approximately 0.85 minority recall on the highly imbalanced Credit Card Fraud dataset, highlighting a flexible and interpretable set of tools for heterogeneous data classification.

Enhanced classification across diverse datasets utilising hybrid quantum and classical techniques

A classification accuracy of 0.9556 on the Wine Quality dataset signifies a major advancement over traditional classification methods previously limited by complex, high-dimensional data. The new framework successfully adapts to varying data conditions, employing a fusion-score classifier for smaller datasets and a Delta + Variational Quantum Classifier (VQC) pipeline for larger, imbalanced ones. This dual approach overcomes challenges in both data volume and distribution, delivering robust performance.

The Delta + VQC pipeline attained approximately 0.85 minority recall on the Credit Card Fraud dataset, a vital metric in scenarios where identifying rare events is important, and where previous systems struggled with the 0.17% prevalence of fraudulent transactions. Test accuracy of 0.8478 was achieved on the Heart Disease dataset and 0.8881 on Breast Cancer, alongside macro-F1 scores of 0.8463 and 0.8703 respectively, demonstrating consistent performance across varied medical datasets. The framework’s geometry-driven approach utilises Correlation Group Structures, which organise features into correlation-based neighbourhoods, enhancing durability in complex data. Furthermore, the system employs a compact SWAP-test, a quantum-inspired technique, to efficiently estimate overlap and derive Euclidean-like and angular similarity channels, allowing for subtle comparison of data points. While these results are promising, a PR-AUC score of 0.3251 on the Credit Card Fraud dataset indicates that improvements are still needed to effectively balance detection rates and false positives in real-world, high-stakes scenarios.

Prioritising model interpretability and scalability for imbalanced fraud detection datasets

Classification systems capable of handling complex, real-world data are increasingly the focus of attention, moving beyond simple probability calculations to understand underlying relationships. Using concepts from quantum computing, a geometry-driven approach creates more strong and interpretable models; however, the precision-recall area under the curve of 0.3251 on the Credit Card Fraud dataset does not diminish the value of this approach. Fraud detection inherently involves imbalanced datasets where identifying genuine positive cases is important, and therefore, focusing solely on overall accuracy can be misleading.

This work prioritises interpretability and scalability, offering a viable alternative to complex ‘black box’ models, particularly for smaller datasets where computational resources are limited. Geometric signals prove flexible and interpretable for data analysis, especially with limited data. A new classification framework blending geometric data organisation with quantum-inspired computation is established, moving beyond traditional probability-based methods. By initially structuring data into Correlation Group Structures, neighbourhoods defined by feature similarity, the system creates durable representations suitable for diverse datasets. A key new feature lies in the use of geometric signals derived from these structures and SWAP-test overlap estimation, enabling effective classification, particularly with limited data, via a fusion-score approach.

The research demonstrated a new classification framework achieving test accuracies of 0.8478, 0.8881, and 0.9556 on the Heart Disease, Breast Cancer, and Wine Quality datasets respectively. This geometry-driven method organises data using Correlation Group Structures and incorporates quantum-inspired computation to create robust and interpretable models. It offers a potential alternative to complex classification systems, particularly when working with small-to-moderate datasets. The authors noted that further improvements are needed to balance detection rates and false positives when applied to highly imbalanced datasets like credit card fraud.

👉 More information
🗞 Quantum-Inspired Geometric Classification with Correlation Group Structures and VQC Decision Modeling
🧠 ArXiv: https://arxiv.org/abs/2604.01930

Muhammad Rohail T.

Latest Posts by Muhammad Rohail T.: