The ultimate GPU server for deep learning

Now available with NVIDIA H100 NVL Tensor Core GPUs

microsoft logo
amazon logo
anthem logo
raytheon logo
argonne logo
sony logo
ibm logo
google logo
caltech logo
berkeley logo
netflix logo

10,000+ research teams trust Lambda


Lambda Scalar powered by NVIDIA H100 NVL GPUs

Lambda Scalar servers come with the new NVIDIA H100 NVL Tensor Core GPUs and deliver unprecedented performance, scalability, and security for every workload. NVIDIA H100 NVL GPUs feature fourth-generation Tensor Cores and the Transformer Engine with FP8 precision, further extending NVIDIA’s market-leading AI leadership with faster training and inference speedup on large language models.

Spec Highlights
lambda 4U

Engineered for your workload

Tell us about your research and we'll design a machine that's perfectly tailored to your needs.

Up to
Up to
and 384 threads
Up to
GB of
Up to
TB of
Echelon Clusters

Easily scale from server to cluster

As your team's compute needs grow, Lambda's in-house HPC engineers and AI researchers can help you integrate Scalar and Hyperplane servers into GPU clusters designed for deep learning.

  • Compute
    Scaling to 1000s of GPUs for distributed training or hyperparameter optimization.
  • Storage
    High-performance parallel file systems optimized for ML.
  • Networking
    Compute and storage fabrics for GPUDirect RDMA and GPUDirect Storage.
  • Software
    Fully integrated software stack for MLOps and cluster management.
Premium Support

Service and support by technical experts who specialize in machine learning

Lambda Premium Support includes:

  • Up to 5 year extended warranty with advanced parts replacement
  • Live technical support from Lambda's team of ML engineers
  • Support for ML software included in Lambda Stack: PyTorch®, Tensorflow, CUDA, CuDNN, and NVIDIA Drivers
Lambda Stack
lambda stack illustration

Plug in. Start training.

Our servers include Lambda Stack, which manages frameworks like PyTorch® and TensorFlow. With Lambda Stack, you can stop worrying about broken GPU drivers and focus on your research.

  • Zero configuration required
    All your favorite frameworks come pre-installed.
  • Easily upgrade PyTorch® and TensorFlow
    When a new version is released, just run a simple upgrade command.
  • No more broken GPU drivers
    Drivers will "just work" and keep compatible with popular frameworks.

Your servers. Our datacenter.

Lambda Colocation makes it easy to deploy and scale your machine learning infrastructure. We'll manage racking, networking, power, cooling, hardware failures, and physical security. Your servers will run in a Tier 3 data center with state-of-the-art cooling that's designed for GPUs. You'll get remote access to your servers, just like a public cloud.

Fast support
If hardware fails, our on-premise data center engineers can quickly debug and replace parts.
Optimal performance
Our state-of-the-art cooling keeps your GPUs cool to maximize performance and longevity.
High availability
Our Tier 3 data center has redundant power and cooling to ensure your servers stay online.
No network set up
We handle all network configuration and provide you with remote access to your servers.
Tech Specs

Technical Specifications

scalar 4u