A cornerstone of PacBio's long-read technology is its high accuracy, quality, and coverage of genomes. This manifests within its high-fidelity (HiFi) long-read sequencing, a powerful tool used to investigate large genomic or transcriptomic features at a single DNA or RNA molecule level. An essential aspect of generating long-read data is the process of base calling, which is crucial for determining nucleotide sequences of complex, long DNA molecules. However, this requires substantial computational resources, given the need to generate a consensus sequence for each molecule—a process that’s then executed across millions of molecules.
PacBio’s Sequel IIe long-read sequencer was designed with CPU-based computation. While functional, it reached a performance threshold that limited optimal throughout and, therefore, its utility for commercial customers. To address this limitation, PacBio introduced the Revio System featuring NVIDIA A100 GPUs. This advancement allowed for a significant increase in computational power within the same device footprint. As a result of this transition to NVIDIA GPUs, coupled with NVIDIA® CUDA® for code optimization, PacBio was able to accelerate base calling, resulting in overall throughput and efficiency of the sequencing process.
These technologies also significantly accelerated circular consensus sequencing (CCS) on the Revio System. The repeated sequencing of circularized DNA molecules to generate high-accuracy reads required substantial processing power and time, limiting the overall throughput and efficiency of the sequencer. WIth Revio using NVIDIA GPUS, PacBio was able to reduce the CCS process from over 15 hours to 2.5 hours, translating to time savings, enhanced productivity, and increased commercial viability of the Revio sequence to customers.