High-Performance Recursive TRMM/TRSM Implementation in Julia for GPUs Across Architectures

On April 18, 2025, a team led by Vicki Carrica and Maxwell Onyango published Toward Portable GPU Performance: Julia Recursive Implementation of TRMM and TRSM, detailing an efficient Julia-based approach to triangular matrix operations on NVIDIA, AMD, and Apple Silicon GPUs.

This paper presents a recursive implementation in Julia for GPUs of triangular matrix-matrix multiplication (TRMM) and triangular solve (TRSM), restructured to leverage general matrix-matrix multiplication (GEMM) for improved GPU memory hierarchy utilization.

Using Julia’s multiple dispatch, metaprogramming, and frameworks like GPUArrays and KernelAbstractions, the authors developed a hardware-agnostic API supporting NVIDIA, AMD, and Apple Silicon GPUs. For large matrices, the implementation achieves throughput comparable to vendor libraries like cuBLAS and rocBLAS while providing TRMM/TRSM routines for Apple Silicon for the first time. The concise codebase demonstrates Julia’s ability to deliver near-vendor performance across heterogeneous architectures.

NVIDIA is at the forefront of advancing GPU technology, significantly impacting fields such as artificial intelligence, scientific research, and high-performance computing (HPC). Their innovations are strategically aimed at enhancing efficiency, scalability, and adaptability across diverse applications.

👉 More information
🗞 Toward Portable GPU Performance: Julia Recursive Implementation of TRMM and TRSM
🧠 DOI: https://doi.org/10.48550/arXiv.2504.13821

Quantum News

Quantum News

There is so much happening right now in the field of technology, whether AI or the march of robots. Adrian is an expert on how technology can be transformative, especially frontier technologies. But Quantum occupies a special space. Quite literally a special space. A Hilbert space infact, haha! Here I try to provide some of the news that is considered breaking news in the Quantum Computing and Quantum tech space.

Latest Posts by Quantum News:

Quantum optimization solving complex combinatorial problems

MIT Technique Identifies Critical Variables to Improve Design Optimization

March 4, 2026
Photonic quantum computer using light particles as qubits

Xanadu Highlights Path to Public Listing, Scalable Quantum Computing

March 4, 2026
Quantum computing processor with qubits represented on the Bloch sphere

MicroCloud Hologram Advances Deployable Quantum Recurrent Neural Network Technology

March 4, 2026