Pacific Northwest National Laboratory (PNNL) scientists are highlighting the potential of artificial intelligence and exascale computing to revolutionize biological research, based on findings from the 2025 Workshop on Envisioning Frontiers in AI and Computing for Biological Research. Researchers detailed how these advanced technologies can enhance biological studies, identifying four priority research directions including multimodal data assembly and AI-enabled experimental systems. Kirsten Hofmockel, who leads the Soil Microbiome Science Focus Area project at PNNL, notes a key challenge: “One of the big challenges I routinely run into collaborating across domains of science is integrating diverse data across multiple scales to establish genotype to phenotype relationships.” This report underscores the importance of combining biological data collection with leading computational capabilities to accelerate scientific discovery.
Workshop Highlights AI for Multiscale Biological Research
Participants focused on applying techniques like multiscale modeling to accelerate discovery. A central theme emerging from the workshop was the challenge of connecting genotype to phenotype, a task complicated by the sheer diversity and volume of biological data. She further emphasized that data arrives “in various formats, from images to genetic sequences,” necessitating meaningful integration for effective AI applications. Neeraj Kumar, Chief Data Scientist at PNNL, highlighted the transformative potential of this interdisciplinary approach, stating, “AI and advanced computing hold immense promise to unlocking breakthroughs in biological research.” He envisions a future where “close collaboration between computer scientists and domain scientists” leads to co-designed systems for next-generation scientific discovery, driving innovation in biotechnology and biomanufacturing.
Integrating BER Data with ASCR Computational Capabilities
The convergence of biological data generation and advanced computing is rapidly reshaping scientific inquiry, with the Department of Energy’s Biological and Environmental Research (BER) program and Advanced Scientific Computing Research (ASCR) program at the forefront. A recent workshop detailed how combining BER’s data collection with ASCR’s computational power—including exascale architectures—represents “an important path to progress.” This synergy isn’t merely about processing power; it’s about enabling entirely new research avenues previously constrained by data integration challenges. Researchers identified four key areas for advancement: multimodal data assembly, multiscale biosystems simulation, AI-enabled experimental systems, and novel algorithms for genomics. Both Kumar and Hofmockel concur that progress hinges on algorithmic innovation and collaborative efforts incorporating biology, computing, and automation to unlock biological mechanisms and support biotechnology.
Genotype-Phenotype Relationships Drive Algorithm Innovation
Neeraj Kumar, chief data scientist at Pacific Northwest National Laboratory (PNNL), is spearheading efforts to unify infrastructure for autonomous discovery across biology, chemistry, and critical materials. He’s integrating the Transformational AI Models Consortium with the American Science Cloud, aiming to bridge computational and domain sciences for national impact. Both Kumar and Hofmockel agree that innovation in algorithms and AI is crucial for interpreting diverse biological data, with Hofmockel stating, “We need to innovate algorithms and leverage AI to integrate and interpret diverse biological data.” New collaborations incorporating biology, advanced computing, and automation are seen as vital for advancing biotechnology and biomanufacturing.
“We need to innovate algorithms and leverage AI to integrate and interpret diverse biological data.”
