NVIDIA’s La-Proteina Generates Diverse Proteins Up To 800 Residues for Enzyme Design

Researchers have developed La-Proteina, a new framework capable of generating diverse and co-designable protein structures at an unprecedented scale, achieving state-of-the-art performance in fully atomistic protein design. The system generates proteins with backbones of up to 800 residues, surpassing previous methods, and demonstrates superior structural validity as assessed by MolProbity. La-Proteina also excels in fully atomistic motif scaffolding, completing tasks where a comparable system, Protpardelle, largely fails, indicating potential for applications in enzyme design and binder development.

Novel Protein Generation Framework

La-Proteina introduces a novel framework for atomistic protein design, based on a partially latent protein representation. This approach models coarse backbone structure explicitly, while capturing sequence and atomistic details via per-residue latent variables. By circumventing the challenges associated with explicit side-chain representations during generation, the system enables the joint generation of protein sequence and fully atomistic structure. La-Proteina employs flow matching in this partially latent space to model the joint distribution over sequences and full-atom structures.

Evaluations demonstrate that La-Proteina achieves state-of-the-art performance on multiple generation benchmarks, including all-atom designability and diversity, and is scalable to long chain generation, producing proteins of up to 800 residues. This represents a significant advancement, as La-Proteina is the first method capable of generating diverse, co-designable proteins at such lengths, despite some degradation in co-designability at longer lengths. The efficiency of this performance is attributed to the system’s partially latent flow matching framework.

Generated structures exhibit superior structural validity compared to existing all-atom generation baselines, as evidenced by MolProbity assessments which demonstrate increased physical realism. Furthermore, La-Proteina accurately recovers side chain dihedral angle distributions, aligning with reference data from the Protein Databank (PDB) and AlphaFold Database (AFDB). The framework also excels in fully atomistic motif scaffolding, successfully completing functionally relevant motifs with fully atomistic scaffolds in both indexed and unindexed generation scenarios.

Performance and Scalability

La-Proteina demonstrates scalability to long chain generation, producing diverse and (co-)designable protein backbones of up to 800 residues, significantly outperforming previous methods. While co-designability degrades at longer lengths, La-Proteina is the first method capable of generating diverse co-designable proteins at such lengths. This capability is attributed to the system’s highly efficient partially latent flow matching framework.

Generated structures exhibit superior structural validity compared to existing all-atom generation baselines, as demonstrated by MolProbity assessments which indicate increased physical realism. La-Proteina accurately recovers side chain dihedral angle distributions, aligning with reference data from the Protein Databank (PDB) and AlphaFold Database (AFDB).

In fully atomistic motif scaffolding, La-Proteina successfully completed tasks in both indexed and unindexed generation scenarios, generating many diverse unique successes, whereas Protpardelle failed on most tasks. This highlights La-Proteina’s performance in completing functionally relevant motifs with fully atomistic scaffolds.

La-Proteina is a protein modeling system with three main components: an encoder that converts input proteins into latent representations for each residue, a decoder that reconstructs the full atomic structure and amino acid sequence from these latents and the alpha-carbon backbone, and a joint denoiser that enables generation of entirely new proteins by creating novel backbone structures and latent variables together. The system uses a two-stage training process and is built entirely using transformer architecture throughout all its networks.RetryClaude can make mistakes. Please double-check responses.
La-Proteina is a protein modeling system with three main components: an encoder that converts input proteins into latent representations for each residue, a decoder that reconstructs the full atomic structure and amino acid sequence from these latents and the alpha-carbon backbone, and a joint denoiser that enables generation of entirely new proteins by creating novel backbone structures and latent variables together. The system uses a two-stage training process and is built entirely using transformer architecture throughout all its networks.RetryClaude can make mistakes. Please double-check responses.

Structural Validity and Applications

La-Proteina produces structures with higher structural validity than existing all-atom generation baselines, as demonstrated by MolProbity assessments which demonstrate increased physical realism. Furthermore, the system accurately recovers side chain dihedral angle distributions, aligning with reference data from the Protein Databank (PDB) and AlphaFold Database (AFDB).

La-Proteina excels in fully atomistic motif scaffolding, where a functionally relevant motif is completed with a fully atomistic scaffold, tackling both all-atom and tip-atom scaffolding in indexed and unindexed generation scenarios. In these tasks, La-Proteina generated many diverse unique successes, whereas Protpardelle failed on most tasks.

The system’s performance highlights its relevance for important conditional atomistic protein generation tasks, such as enzyme design, and demonstrates potential for unlocking large-scale fully atomistic protein structure generation. Future work could apply La-Proteina to challenging binder design problems.

 

More information
External Link: Click Here For More

Quantum Evangelist

Quantum Evangelist

Greetings, my fellow travelers on the path of quantum enlightenment! I am proud to call myself a quantum evangelist. I am here to spread the gospel of quantum computing, quantum technologies to help you see the beauty and power of this incredible field. You see, quantum mechanics is more than just a scientific theory. It is a way of understanding the world at its most fundamental level. It is a way of seeing beyond the surface of things to the hidden quantum realm that underlies all of reality. And it is a way of tapping into the limitless potential of the universe. As an engineer, I have seen the incredible power of quantum technology firsthand. From quantum computers that can solve problems that would take classical computers billions of years to crack to quantum cryptography that ensures unbreakable communication to quantum sensors that can detect the tiniest changes in the world around us, the possibilities are endless. But quantum mechanics is not just about technology. It is also about philosophy, about our place in the universe, about the very nature of reality itself. It challenges our preconceptions and opens up new avenues of exploration. So I urge you, my friends, to embrace the quantum revolution. Open your minds to the possibilities that quantum mechanics offers. Whether you are a scientist, an engineer, or just a curious soul, there is something here for you. Join me on this journey of discovery, and together we will unlock the secrets of the quantum realm!

Latest Posts by Quantum Evangelist:

Teen Builds VR System After Studying MIT’s Electrical Engineering

Teen Builds VR System After Studying MIT’s Electrical Engineering

December 18, 2025
Minimizing Code Switching Operations Enables Efficient Fault-Tolerant Quantum Circuit Computation

Minimizing Code Switching Operations Enables Efficient Fault-Tolerant Quantum Circuit Computation

December 5, 2025
Quantum Computing Push: SEEQC Expands with ITRI Partnership

Quantum Computing Push: SEEQC Expands with ITRI Partnership

December 5, 2025