Guide
Reference
Log In
Sign Up
Overview
Introduction
Example: Hello, world
Tutorial: A simple web scraper
Custom container images
Custom container images
Private registries
GPUs and other resources
GPU acceleration
Reserving CPU and memory
Example: High performance LLM hosting with Text Generation Inference
Example: Generate 100+ tokens/second on Llama 2 with vLLM
Example: Fine-tune LLaMA-70B with multiple GPUs
Scaling out
Scaling out
Dicts and queues
Concurrent inputs (beta)
Example: Face detection on YouTube videos
Secrets and environment variables
Secrets
Environment variables
Example: Write to Google Sheets
Deployment
Managing deployments
Invoke deployed functions
Continuous deployment
Scheduling and cron jobs
Scheduling remote cron jobs
Example: Hacker News Slackbot
Web endpoints
Web endpoints
Streaming web endpoints
Web endpoint URLs
Request timeouts
Example: Document OCR job queue
Example: Document OCR web app
Data sharing and storage
Passing local data
Network file systems
Volumes
Deep learning and AI examples
Stable Diffusion CLI
Replace your CEO with an LLM
Voice Chat with LLMs
Generate music on Discord
Parallel podcast transcription using Whisper
Pet Art using Dreambooth
Question-answering with LangChain
Play with ControlNet
Miscellaneous examples
Examples using popular libraries
DuckDB: Analyze NYC taxi data in parallel
Blender: Distributed 3D rendering
SQLite: Publish explorable data with Datasette
Y! Finance: Stock prices in parallel
Dynamic sandboxes
Dynamic sandboxes
Reliability and robustness
Failures and retries
Preemption
Timeouts
Troubleshooting
Security
Security at Modal
Other topics
Developing and debugging
Cold start performance
Workspaces
Environments
Jupyter notebooks
Asynchronous usage
Global variables
Container lifecycle and parameters
Apps, stubs, and entrypoints
Example: Algolia docsearch crawler
Shared volumes have been renamed
Please see the docs for
network file systems
.
Shared volumes have been renamed