In this lab, accomplish NLP tasks, such as question answering (QA) and language inference, with NVIDIA Base Command™. Leverage multiple NVIDIA DGX™ A100 640GB systems to pretrain your own highly accurate BERT model on publicly available plain text.
In this lab, train large transformer-based language models on multi-GPU, multi-node NVIDIA DGX™ systems. You’ll find that all components of the stack, from silicon to network to software, are optimized and GPU-accelerated, promising you the fastest time to train. Bootstrap your enterprise's large language model (LLM) journey using pretuned hyperparameter configurations for GPT-3 models