Try DeepSeek-R1 on Modal! View example

High-performance
AI infrastructure

Serverless cloud for AI, ML, and data applications – built for developers
Get Started

Sub-second container starts

We built a Rust-based container stack from scratch so you can iterate as quickly in the cloud as you can locally.

View Docs

Zero config files

Easily define hardware and container requirements next to your Python functions.

View Docs

Scale to hundreds of GPUs in seconds

Never worry about hitting rate limits again. We autoscale containers for your functions instantly.

View Docs

Use Cases

Generative AI Inference that scales with you




View Examples

Fine-tuning and training without managing infrastructure

Fine-tuning graphic



View Examples

Batch processing optimized for high-volume workloads

Batch processing graphic



View Examples

Features







Only pay when your
code is running
Scale up to hundreds of nodes and down to zero within seconds. Pay for actual compute, by the CPU cycle. With $30 of compute on us, every month.

Compute costs


GPU Tasks

Nvidia H100

$0.001267 / sec

Nvidia A100, 80 GB

$0.000944 / sec

Nvidia A100, 40 GB

$0.000772 / sec

Nvidia L40S

$0.000542 / sec

Nvidia A10G

$0.000306 / sec

Nvidia L4

$0.000222 / sec

Nvidia T4

$0.000164 / sec


CPU

Physical core
(2 vCPU)

$0.000038 / core / sec

*minimum of 0.125 cores per container


Memory

$0.00000667 / GiB / sec

For teams
of all scales
Starter
For small teams and independent developers looking to level up.
Team
For startups and larger organizations looking to scale quickly.
Enterprise
For organizations prioritizing security, support, and reliability.

Security and governance





Learn More

Built with Modal

“Modal Sandboxes enable us to execute generated code securely and flexibly. With Modal's support, we expedited the development of our code interpreter feature and successfully integrated it into our chat platform, Le Chat, to better assist our users.”

Wendy Shang, AI Scientist

“Modal makes it easy to write code that runs on 100s of GPUs in parallel, transcribing podcasts in a fraction of the time.”

Mike Cohen, Head of Data

“Tasks that would have taken days to complete take minutes instead. We’ve saved thousands of dollars deploying LLMs on Modal.”

Rahul Sengottuvelu, Head of Applied AI

“The beauty of Modal is that all you need to know is that you can scale your function calls in the cloud with a few lines of Python.”

Georg Kucsko, Co-founder and CTO

Join Modal's developer
community
Modal Community Slack

If you building AI stuff with Python and haven't tried @modal_labs you are missing out big time

@modal_labs continues to be magical... 10 minutes of effort and the `joblib`-based parallelism I use to test on my local machine can trivially scale out on the cloud. Makes life so easy!

This tool is awesome. So empowering to have your infra needs met with just a couple decorators. Good people, too!

Recently built an app on Lambda and just started to use @modal_labs, the difference is insane! Modal is amazing, virtually no cold start time, onboarding experience is great 🚀

Probably one of the best piece of software I'm using this year: modal.com

feels weird at this point to use anything else than @modal_labs for this — absolutely the GOAT of dynamic sandboxes

Nothing beats @modal_labs when it comes to deploying a quick POC

Late to the party, but finally playing with @modal_labs to run some backend jobs. DX is sooo nice (compared to Docker, Cloud Run, Lambda, etc). Just decorate a Python function and deploy. And it's fast! Love it.

If you building AI stuff with Python and haven't tried @modal_labs you are missing out big time

@modal_labs continues to be magical... 10 minutes of effort and the `joblib`-based parallelism I use to test on my local machine can trivially scale out on the cloud. Makes life so easy!

This tool is awesome. So empowering to have your infra needs met with just a couple decorators. Good people, too!

Recently built an app on Lambda and just started to use @modal_labs, the difference is insane! Modal is amazing, virtually no cold start time, onboarding experience is great 🚀

Probably one of the best piece of software I'm using this year: modal.com

feels weird at this point to use anything else than @modal_labs for this — absolutely the GOAT of dynamic sandboxes

Nothing beats @modal_labs when it comes to deploying a quick POC

Late to the party, but finally playing with @modal_labs to run some backend jobs. DX is sooo nice (compared to Docker, Cloud Run, Lambda, etc). Just decorate a Python function and deploy. And it's fast! Love it.

Ship your first app in minutes.

Get Started

$30 / month free compute