Average Ratings 0 Ratings
Average Ratings 0 Ratings
Description
Amazon EC2 G4 instances are specifically designed to enhance the performance of machine learning inference and applications that require high graphics capabilities. Users can select between NVIDIA T4 GPUs (G4dn) and AMD Radeon Pro V520 GPUs (G4ad) according to their requirements. The G4dn instances combine NVIDIA T4 GPUs with bespoke Intel Cascade Lake CPUs, ensuring an optimal mix of computational power, memory, and networking bandwidth. These instances are well-suited for tasks such as deploying machine learning models, video transcoding, game streaming, and rendering graphics. On the other hand, G4ad instances, equipped with AMD Radeon Pro V520 GPUs and 2nd-generation AMD EPYC processors, offer a budget-friendly option for handling graphics-intensive workloads. Both instance types utilize Amazon Elastic Inference, which permits users to add economical GPU-powered inference acceleration to Amazon EC2, thereby lowering costs associated with deep learning inference. They come in a range of sizes tailored to meet diverse performance demands and seamlessly integrate with various AWS services, including Amazon SageMaker, Amazon ECS, and Amazon EKS. Additionally, this versatility makes G4 instances an attractive choice for organizations looking to leverage cloud-based machine learning and graphics processing capabilities.
Description
NVIDIA DGX Cloud Lepton is an advanced AI platform that facilitates connections for developers to a worldwide network of GPU computing resources across various cloud providers, all through a singular interface. It provides a cohesive experience for discovering and leveraging GPU capabilities, complemented by integrated AI services that enhance the deployment lifecycle across multiple cloud environments. With immediate access to NVIDIA's accelerated APIs, developers can begin their projects using serverless endpoints and prebuilt NVIDIA Blueprints, along with GPU-enabled computing. When scaling becomes necessary, DGX Cloud Lepton ensures smooth customization and deployment through its expansive global network of GPU cloud providers. Furthermore, it allows for effortless deployment across any GPU cloud, enabling AI applications to operate within multi-cloud and hybrid settings while minimizing operational complexities, and it leverages integrated services designed for inference, testing, and training workloads. This versatility ultimately empowers developers to focus on innovation without worrying about the underlying infrastructure.
API Access
Has API
API Access
Has API
Integrations
AMD Radeon ProRender
Amazon EC2
Amazon EKS
Amazon Elastic Inference
Amazon SageMaker
Amazon Web Services (AWS)
CUDA
CoreWeave
Crusoe
Fluidstack
Integrations
AMD Radeon ProRender
Amazon EC2
Amazon EKS
Amazon Elastic Inference
Amazon SageMaker
Amazon Web Services (AWS)
CUDA
CoreWeave
Crusoe
Fluidstack
Pricing Details
No price information available.
Free Trial
Free Version
Pricing Details
No price information available.
Free Trial
Free Version
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Vendor Details
Company Name
Amazon
Founded
1994
Country
United States
Website
aws.amazon.com/ec2/instance-types/g4/
Vendor Details
Company Name
NVIDIA
Founded
1993
Country
United States
Website
www.nvidia.com/en-us/data-center/dgx-cloud-lepton/
Product Features
Deep Learning
Convolutional Neural Networks
Document Classification
Image Segmentation
ML Algorithm Library
Model Training
Neural Network Modeling
Self-Learning
Visualization