THE Lambda
Deep Learning Blog

Recent

Putting the NVIDIA GH200 Grace Hopper Superchip to good use: superior inference performance and economics for larger models

When it comes to large language model (LLM) inference, cost and performance go hand-in-hand. Single GPU instances are practical and economical; however, models ...

Published on November 22, 2024 by Thomas Bordes

Benchmarking ZeRO-Inference on the NVIDIA GH200 Grace Hopper Superchip

This blog explores the synergy of DeepSpeed’s ZeRO-Inference, a technology designed to make large AI model inference more accessible and cost-effective, with ...

Published on December 20, 2023 by Chuan Li

Unleashing the power of Transformers with NVIDIA Transformer Engine

In this blog, Lambda showcases the capabilities of NVIDIA’s Transformer Engine, a cutting-edge library that accelerates the performance of transformer models ...

Published on November 21, 2023 by Chuan Li

To top

Ready to get started?

Create a cloud account instantly to spin up GPUs today or contact us to secure a long-term contract for thousands of GPUs

Launch GPU instances

Contact sales

THE Lambda Deep Learning Blog