Build intelligent applications with Modal's serverless infrastructure and MongoDB Atlas's data platform.
Learn how Contextual AI accelerated their developer iteration speed by using Modal to run tests on GPUs.
Learn how we used our new dynamic batching feature to improve throughput and reduce inference costs for the Whisper model with a single line of code!
A step-by-step guide to building a scalable analytics stack using Modal, dlt, and dbt for efficient data loading, transformation, and deployment.
Welcome to another round of Modal Product Updates! Here's what's new this month.
You can now enter BAAs with Modal to run HIPAA-compliant workloads.
How Modal's autoscaling works when running ComfyUI as an API.
How we built an in-browser code playground using Modal Sandboxes.
...and we're passing the savings to you. 15-30% price cuts on GPUs and CPUs.
Scale up smaller open models with search and evaluation to match frontier capabilities.
Welcome to another round of Modal Product Updates! Here's what's new this month.
Learn how Basis partnered with Modal to bring the spirit of competitive programming to prompt engineering.
Isolate your tasks with Modal containers while using Airflow for orchestration.
See how Modal combats cryptomining abuse with syscall-based program analysis, to secure GPUs for legitimate users.
Find out how Hunch uses Modal to run AI code even its users don't trust.
How we fine-tuned a Stable Diffusion model on the Heroicons library to generate all the icons we could dream of.
Learn how Substack sped up their developer iteration cycles by moving ML training and deployment to Modal from AWS SageMaker.
You can now specify which cloud region you would like to run your Functions in.
Welcome to another round of Modal Product Updates! Here's what's new this month.
Fine-tune on just a few hundred examples and kick off your very own data flywheel.
Easily develop and deploy custom ETL jobs while saving 99% on sync costs.
Celebrating the best in enterprise tech.
This guide shows how to convert a ComfyUI workflow to Python code as an alternative way to productionize a ComfyUI workflow.
Find out how Ramp uses Modal to customize open source LLMs to automate receipt processing.
In this post, we'll talk about how Modal handles real-time HTTP requests and WebSockets in serverless functions.
Modal now supports WebSocket connections, enabling real-time, bidirectional data transfer between client and server.
Find out how Suno uses Modal to scale inference and batch pre-processing to thousands of GPUs.
We've been busy in 2024 so far, bringing you WebSockets, interactive commands, H100s and more. Learn about what's new at Modal.
We’re excited to be making Nvidia H100 GPUs available on Modal starting today!
Leverage Modal’s parallel batch jobs and in-house storage features to quickly generate embeddings for billions of tokens.
An operational guide to fine-tuning an LLM on any dataset in minutes (ft. CodeLlama, Llama 2, Mistral, and more)
An intro to fine-tuning large language models in 2024
Modal offically launches today with no waitlist. And we also raised a Series A!
Modal Labs Announces Series A Financing Round, Securing $16 Million Investment to Launch Cloud-Based Infrastructure Platform, Build Towards End-to-End Enterprise Data Stack
Modal is excited to announce that it has successfully completed a System and Organization Controls (SOC) 2 Type 1 audit.