Serve custom AI models at scale
Add one line of code to run any function in the cloud. Get instant autoscaling for ML inference, data jobs, and more.
Add one line of code to run any function in the cloud. Get instant autoscaling for ML inference, data jobs, and more.
We built a Rust-based container stack from scratch so you can iterate as quickly in the cloud as you can locally.
Bring your own image or build one in Python, scale resources as needed, and leverage state-of-the-art GPUs like H100s & A100s for high-performance computing.
Export function logs to Datadog or any OpenTelemetry-compatible provider, and easily mount cloud storage from major providers (S3, R2 etc.).
Manage data effortlessly with storage solutions (network volumes, key-value stores and queues). Provision storage types and interact with them using familiar Python syntax.
Take control of your workloads with powerful scheduling. Set up cron jobs, retries, and timeouts, or use batching to optimize resource usage.
Deploy and manage web services with ease. Create custom domains, set up streaming and websockets, and serve functions as secure HTTPS endpoints.
Troubleshoot efficiently with built-in debugging tools. Use the modal shell for interactive debugging and set breakpoints to pinpoint issues quickly.
Load gigabytes of weights in seconds with our optimized container file system.
Deploy anything from custom models to popular frameworks.
Handle bursty and unpredictable load by scaling to thousands of GPUs and back down to zero.
Provision Nvidia A100 and H100 GPUs in seconds. Your drivers and custom packages are already there.
Run as many experiments as you need to, in parallel. Stop paying for idle GPUs when you're done.
Mount weights and data in distributed volumes, then access them wherever they're needed.
Serverless, but for high-performance compute. Run things on massive amounts of CPU and memory.
Pay only for resources consumed, by the second, as you spin up containers.
Simple fan-out parallelism that scales to thousands of containers, with a single line of Python.
Security
and governance
The secure application kernel for containers, providing top-tier isolation in multi-tenant setups.
Fully compliant with SOC 2. Run HIPAA-compliant workloads. We have industry-standard security, availability, and confidentiality.
Deploy globally with enhanced compliance across geographic regions.
Enterprise-grade SSO for transparent, streamlined access management.
Wendy Shang, AI Scientist
Ship your first app in
minutes.
$30 / month free compute