High-performance
AI infrastructure

Serverless cloud for AI, ML, and data applications – built for developers
Get Started

Cloud development made frictionless

Run generative AI models, large-scale batch jobs, job queues, and much more. Bring your own code — we run the infrastructure.

View Docs

Iterate at the speed of thought

Make code changes and watch your app rebuild instantly. Never write a single line of YAML again.

View Docs

Built for large-scale workloads

Engineered in Rust, our custom container stack allows you to scale to hundreds of GPUs and then back down to zero in seconds. Pay only while it's running.

View Docs

Use Cases

Generative AI Inference that scales with you




View Examples

Fine-tuning and training without managing infrastructure

Fine-tuning graphic



View Examples

Batch processing optimized for high-volume workloads

Batch processing graphic



View Examples

Features







Only pay when your
code is running
Scale up to hundreds of nodes and down to zero within seconds. Pay for actual compute, by the CPU cycle. With $30 of compute on us, every month.

Compute costs


GPU Tasks

Nvidia H100

$0.001267 / sec

Nvidia A100, 80 GB

$0.000944 / sec

Nvidia A100, 40 GB

$0.000772 / sec

Nvidia L40S

$0.000542 / sec

Nvidia A10G

$0.000306 / sec

Nvidia L4

$0.000222 / sec

Nvidia T4

$0.000164 / sec


CPU

Physical core
(2 vCPU)

$0.000038 / core / sec

*minimum of 0.125 cores per container


Memory

$0.00000667 / GiB / sec

For teams
of all scales
Starter
For small teams and independent developers looking to level up.
Team
For startups and larger organizations looking to scale quickly.
Enterprise
For organizations prioritizing security, support, and reliability.

Security and governance





Learn More

Built with Modal

“Modal makes it easy to write code that runs on 100s of GPUs in parallel, transcribing podcasts in a fraction of the time.”

Mike Cohen, Head of Data

“Tasks that would have taken days to complete take minutes instead. We’ve saved thousands of dollars deploying LLMs on Modal.”

Rahul Sengottuvelu, Head of Applied AI

“The beauty of Modal is that all you need to know is that you can scale your function calls in the cloud with a few lines of Python.”

Georg Kucsko, Co-founder and CTO

Case Study
Join Modal's developer
community
Modal Community Slack

If you building AI stuff with Python and haven't tried @modal_labs you are missing out big time

@modal_labs continues to be magical... 10 minutes of effort and the `joblib`-based parallelism I use to test on my local machine can trivially scale out on the cloud. Makes life so easy!

This tool is awesome. So empowering to have your infra needs met with just a couple decorators. Good people, too!

Recently built an app on Lambda and just started to use @modal_labs, the difference is insane! Modal is amazing, virtually no cold start time, onboarding experience is great 🚀

Probably one of the best piece of software I'm using this year: modal.com

feels weird at this point to use anything else than @modal_labs for this — absolutely the GOAT of dynamic sandboxes

Nothing beats @modal_labs when it comes to deploying a quick POC

Late to the party, but finally playing with @modal_labs to run some backend jobs. DX is sooo nice (compared to Docker, Cloud Run, Lambda, etc). Just decorate a Python function and deploy. And it's fast! Love it.

If you building AI stuff with Python and haven't tried @modal_labs you are missing out big time

@modal_labs continues to be magical... 10 minutes of effort and the `joblib`-based parallelism I use to test on my local machine can trivially scale out on the cloud. Makes life so easy!

This tool is awesome. So empowering to have your infra needs met with just a couple decorators. Good people, too!

Recently built an app on Lambda and just started to use @modal_labs, the difference is insane! Modal is amazing, virtually no cold start time, onboarding experience is great 🚀

Probably one of the best piece of software I'm using this year: modal.com

feels weird at this point to use anything else than @modal_labs for this — absolutely the GOAT of dynamic sandboxes

Nothing beats @modal_labs when it comes to deploying a quick POC

Late to the party, but finally playing with @modal_labs to run some backend jobs. DX is sooo nice (compared to Docker, Cloud Run, Lambda, etc). Just decorate a Python function and deploy. And it's fast! Love it.

Ship your first app in minutes.

Get Started

$30 / month free compute