Modal logo
GuideExamplesReferencePlayground
Log In Sign Up
FeaturedGetting started Hello, worldSimple web scraperServing web endpointsLarge language models (LLMs) Deploy an OpenAI-compatible LLM service with vLLMCut Ministral 3 cold start times by 10x with snapshotsMaximize tokens per second in batch processing with vLLMServe an ultra-low-latency chatbot with SGLangEfficient LLM Finetuning with UnslothRun a multimodal RAG chatbot to answer questions about PDFsFine-tune an LLM to replace your CEODeploy a stateless MCP with FastMCPImages, video, & 3D Edit images with Flux KontextFine-tune Wan2.1 video models on your faceRun Flux fast with torch.compileFine-tune Flux with LoRAAnimate images with LTX-VideoGenerate video clips with LTX-VideoRun Stable Diffusion with a CLI, API, and web UIAudio Deploy a Moshi voice chatbotStream transcripts at the speed of speech using Kyutai STTMake music with ACE-StepGenerate speech with ChatterboxRun high throughput batched transcription with WhisperFine-tune Whisper to recognize new wordsReal-time communication (WebRTC) Serverless WebRTCWebRTC quickstart with FastRTCComputational biology Fold proteins with Chai-1Build a protein-folding dashboardFold proteins with Boltz-2Modal Sandboxes Run a background coding agent with OpenCodeBuild a scalable AI coding platformRun a LangGraph agent's code in a secure GPU sandboxControl a sandboxed computer with an LLMBuild a stateful, sandboxed code interpreterRun Node.js, Ruby, and more in a SandboxRun a sandboxed Jupyter notebookReinforcement Learning Train a model to solve math problems using GRPO and verlTrain a model to solve coding problems using GRPO and TRLEmbeddings Embed millions of documents with TEITurn satellite images into vectors and store them in MongoDBParallel processing and job scheduling Deploy a Hacker News SlackbotRun a Document OCR job queueServe a Document OCR web appTraining models from scratch Train an SLM with early-stopping grid search over hyperparametersRun long, resumable training jobsHosting popular libraries YOLO: Fine-tune and serve computer vision modelsBlender: Build a 3D render farmStreamlit: Run and deploy Streamlit appsComfyUI: Run Flux on ComfyUI as an APISQLite: Publish explorable data with DatasetteAlgolia: Build docsearch with a crawlerConnecting to other APIs Discord: Deploy and run a Discord BotGoogle Sheets: Sync databases and APIs to a Google SheetOpenAI: Run a RAG Q&A chatbotTailscale: Add Modal Apps to your VPNPrometheus: Publish custom metrics with PushgatewayManaging data Mount S3 buckets in Modal appsBuild your own data warehouse with DuckDB, DBT, and ModalCreate a LoRA Playground with Modal, Gradio, and S3Miscellaneous

Miscellaneous examples

Looking for how to make a popular model or library work with Modal? There’s a guide for that:

  • Benchmark the Tokasaurus high-throughput LLM server
  • Deploy ControlNet demos with Gradio
  • Deploy ollama
  • FastHTML: Deploy 100,000 multiplayer checkboxes
  • Generate Slack-ready GIFs with the Claude SDK
  • Generate video clips with Mochi
  • MultiOn: Create an agent for AI news
  • Override Modal resource options (GPU, scaling) at runtime
  • Profile PyTorch code
  • Real-time object detection with webcam input
  • Run batched Whisper transcription
  • Run Claude Code in a Modal Sandbox
  • Run continuous integration (CI) tests on Modal
  • Run OpenAI’s gpt-oss model
  • Run OpenCV to detect faces
  • Run SAM 2 video segmentation model
  • Run Text Embedding Inference (TEI)
  • Run Vision-Language Models with SGLang
  • Serve TensorRT-LLM at maximum throughput

You can find even more examples on the modal-examples GitHub repository or find larger projects built by Modal users at the awesome-modal GitHub repository.

Miscellaneous examples
Modal logo © Modal 2026
About Status Changelog Documentation Slack Community Pricing Examples