Reserve thousands of cloud NVIDIA H100s for just $1.89/hr

Train Foundation Models and LLMs with First AI Cloud Clusters featuring NVIDIA H100 + 3200 Gbps Infiniband

Finally, cloud computing designed for large scale model training

First AI Cloud Clusters are designed for machine learning engineers who need high-performance networking and enterprise-grade GPUs.

Our network architecture is designed for non-blocking, which allows your ML team to spin up one large model across 255 NVIDIA H100 servers with no disruption in networking speed.

Reserved

Fixed no-negotiation pricing. Secure the #1 GPU Cluster architecture at the lowest public price in the world.

# of GPUs: 64 to 60,000

$1.89: 100% upfront payment

$2.04: 33% upfront, 67% monthly payment

Minimum Term: 3 Years

Sprint

Get access to an NVIDIA H100 Cloud Cluster designed to do one thing: train an LLM or Foundation Model in record time.

# of GPUs: 248

$4.85/H100/Hour

Maximum Term: 3 Months

First AI Cloud Clusters powered by NVIDIA H100 GPUs

NOW AVAILABLE

First AI Cloud Clusters come with NVIDIA H100 Tensor Core GPUs and deliver unprecedented performance, scalability, and security for every workload. NVIDIA H100 uses breakthrough innovations in the NVIDIA Hopper™ architecture to deliver industry-leading conversational AI and speeds up large language models.

“Leading enterprises recognize the incredible capabilities of AI and are building it into their operations to transform customer service, sales, operations, and many other key functions. First AI’s deep expertise, combined with cutting-edge NVIDIA technology, is helping customers create flexible, scalable AI deployments on premises, in the cloud, or at a colocation data center.”

Craig Weinstein, Vice President of the Americas Partner Organization

The fastest network for distributed training of LLMs, foundation models & generative AI

Train large generative models and LLMs with the fastest networking available by any cloud provider. Our 3200 Gbps Infiniband networking is purpose built for GPU Direct inter-node bandwith, RDMA and distributed training.

Skip the CPU and take advantage of GPUDirect RDMA for the fastest distributed training

A direct communication path between NVIDIA GPUs across all nodes in your cluster using InfiniBand.

GPUDirect RDMAprovides a significant decrease in GPU-GPU communication latency and completely offloads the CPU, removing it from all GPU-GPU communications across the network.

	Instance type	GPU	GPU Memory	vCPUs	Storage	Network Bandwidth (Gbps)	Per Hour Price	Term	# of GPUs
Reserved	8x NVIDIA H100	H100 SXM	80 GB	200	20 TB NVMe SSD local storage minimum	3200	$1.89/H100/hour	3-years	64 - 60,000
Sprint	8x NVIDIA H100	H100 SXM	80 GB	224	27 TB NVMe SSD local storage minimum	3200	$4.85/H100/hour	3-months	248

Pre-configured for machine learning

Start training your models immediately with pre-configured software, shared storage, and networking for deep learning. All you have to do is choose your GPU nodes and CPU nodes.

First AI Premium Support for Cloud Clusters includes PyTorch, TensorFlow, CUDA, cudNN, Keras and Jupyter. Kubernetes is not included.

Reserve thousands of cloud NVIDIA H100s for just $1.89/hr

Get a Quote

Fill out the form below and we'll be in touch shortly

Finally, cloud computing designed for large scale model training

NVIDIA H100 SXM

3200 Gbps

Non-Blocking InfiniBand

Trusted by world-renowned AI engineers

The only cloud prioritizing flexibility and value for ML teams

Reserved

Get a Quote

Fill out the form below and we'll be in touch shortly

Sprint

Get a Quote

Fill out the form below and we'll be in touch shortly

First AI Cloud Clusters powered by NVIDIA H100 GPUs

NOW AVAILABLE

Get a Quote

Fill out the form below and we'll be in touch shortly

First AI is aiming to be an NVIDIA Elite Cloud Solutions Provider

The fastest network for distributed training of LLMs, foundation models & generative AI

Skip the CPU and take advantage of GPUDirect RDMA for the fastest distributed training

A direct communication path between NVIDIA GPUs across all nodes in your cluster using InfiniBand.

The best prices and value for NVIDIA H100 clusters in the industry

Pre-configured for machine learning

First AI On-Demand Cloud powered by NVIDIA H100 GPUs

NOW AVAILABLE