The Lambda Deep Learning Blog

Featured Posts

Recent Posts

OpenAI's GPT-3 Language Model: A Technical Overview

Chuan Li, PhD reviews GPT-3, the new NLP model from OpenAI. This paper empirically shows that language model performance scales as a power-law with model size, datataset size, and the amount of computation.

Published 06/03/2020 by Chuan Li

How to Run OpenAI's GPT-2 Text Generator on Your Computer

Update June 5th 2020: OpenAI has announced a successor to GPT-2 in a newly published paper. Checkout our GPT-3 model overview []. OpenAI recently published a blog post on their GPT-2 language model []. This tutorial shows you how to run the

Published 02/16/2019 by Stephen Balaban

Multi-GPU enabled BERT using Horovod

BERT is Google's SOTA pre-training language representations. This blog is about running BERT with multiple GPUs. Specifically, we will use the Horovod framework to parrallelize the tasks. We ill list all the changes to the original BERT implementation and highlight a few places that will make or break the performance.

Published 02/06/2019 by Chuan Li


Next page