The Lambda Deep Learning Blog

set-up-horovod-and-keras-for-multi-gpu-training

Horovod keras tutorials

Setting up Horovod + Keras for Multi-GPU training

This tutorial will walk you through how to setup a working environment for multi-GPU training with Horovod and Keras.

Published 08/28/2019 by Chuan Li

nccl hardware GPT-2 Horovod all reduce BERT nccl2 training sgd distributed training presentation

A Gentle Introduction to Multi GPU and Multi Node Distributed Training

This presentation is a high-level overview of the different types of training regimes you'll encounter as you move from single GPU to multi GPU to multi node distributed training. It describes where the computation happens, how the gradients are communicated, and how the models are updated and communicated.

Published 05/31/2019 by Stephen Balaban

multi-gpu NLP Horovod BERT 1080ti

Multi-GPU enabled BERT using Horovod

BERT is Google's SOTA pre-training language representations. This blog is about running BERT with multiple GPUs. Specifically, we will use the Horovod framework to parrallelize the tasks. We ill list all the changes to the original BERT implementation and highlight a few places that will make or break the performance.

Published 02/06/2019 by Chuan Li

The Lambda Deep Learning Blog

Subscribe

Featured posts

Introducing Lambda 1-Click Clusters, a new way to train large AI models

Introducing ML Times: Your Destination For Digestible AI News And Insights

Lambda selected as 2024 NVIDIA Partner Network AI Excellence Partner of the Year

Lambda among first NVIDIA Cloud Partners to deploy NVIDIA Blackwell-based GPUs

Lambda is a Diamond Sponsor at NVIDIA GTC!

Lambda Raises $320M to Build a GPU Cloud for AI

ShadeRunner: Chrome plugin for enhanced on-page research

Benchmarking ZeRO-Inference on the NVIDIA GH200 Grace Hopper Superchip

Persistent storage now available for on-demand NVIDIA H100 GPU instances

Lambda launches Vector One, a new single-GPU desktop PC

Unleashing the power of Transformers with NVIDIA Transformer Engine

Lambda Cloud Clusters to support NVIDIA H200 Tensor Core GPUs

Lambda Cloud Clusters now available with NVIDIA GH200 Grace Hopper Superchip

DeepChat 3-Step Training At Scale: Lambda’s Instances of NVIDIA H100 SXM5 vs A100 SXM4

Persistent storage for Lambda Cloud is expanding!

Categories

Recent posts

Setting up Horovod + Keras for Multi-GPU training

A Gentle Introduction to Multi GPU and Multi Node Distributed Training

Multi-GPU enabled BERT using Horovod