The Lambda Deep Learning Blog

BERT is Google's SOTA pre-training language representations. This blog is about running BERT with multiple GPUs. Specifically, we will use the Horovod framework to parrallelize the tasks. We ill list all the changes to the original BERT implementation and highlight a few places that will make or break the performance.

The Lambda Deep Learning Blog

Featured Posts

Introducing ML Times: your destination for digestible AI news and insights

Lambda selected as 2024 NVIDIA Partner Network AI Excellence Partner of the Year

Lambda among first NVIDIA Cloud Partners to deploy NVIDIA Blackwell-based GPUs

Lambda is a Diamond Sponsor at NVIDIA GTC!

Lambda Raises $320M to Build a GPU Cloud for AI

ShadeRunner: Chrome plugin for enhanced on-page research

Benchmarking ZeRO-Inference on the NVIDIA GH200 Grace Hopper Superchip

Persistent storage now available for on-demand NVIDIA H100 GPU instances

Lambda launches Vector One, a new single-GPU desktop PC

Unleashing the power of Transformers with NVIDIA Transformer Engine

Lambda Cloud Clusters to support NVIDIA H200 Tensor Core GPUs

Lambda Cloud Clusters now available with NVIDIA GH200 Grace Hopper Superchip

DeepChat 3-Step Training At Scale: Lambda’s Instances of NVIDIA H100 SXM5 vs A100 SXM4

Persistent storage for Lambda Cloud is expanding!

Exploring AI's Role in Summarizing Scientific Reviews

Categories

Recent Posts

Multi-GPU enabled BERT using Horovod