Featured examples
Serving Diffusion models
Serve Stable Diffusion XL on Modal with a number of optimizations for blazingly fast inference
Serverless TensorRT-LLM
Run large language models at SOTA performance by building a TRT-LLM engine on Modal
Stable Diffusion fine-tuning with Dreambooth
Fine-tune a Stable Diffusion model on images of your pet using Dreambooth
Voice chat with LLMs
Build a real-time voice chat app by combining speech-to-text, an LLM, and text-to-speech
Fast podcast transcriptions
Build an end-to-end podcast transcription app that leverages dozens of containers for super-fast processing
Document OCR job queue
Use Modal as an infinitely scalable job queue that can service async tasks from a web app
Parallel processing of Parquet files on S3
Analyze data from the Taxi and Limousine Commission of NYC in parallel
Retrieval-Augmented Generation for Q&A
Build a question-answering web endpoint that can cite its sources
Hacker News Slackbot
Use Modal to deploy a cron job that periodically queries Hacker News for new posts, and posts the results to Slack
Real-time Object Detection
Create a web endpoint that leverages HuggingFace models to do object detection in real-time
ControlNet playgrounds
Play with all 10 demo Gradio apps from the ControlNet project