Blog

Blog post cover
September 24, 2024
Hybrid Search over California Embeddings with Modal, MongoDB, and Clay

Build intelligent applications with Modal's serverless infrastructure and MongoDB Atlas's data platform.

Blog post cover
September 18, 2024
How Contextual AI automated CI with Modal GPUs

Learn how Contextual AI accelerated their developer iteration speed by using Modal to run tests on GPUs.

Blog post cover
September 16, 2024
Boost your throughput with dynamic batching

Learn how we used our new dynamic batching feature to improve throughput and reduce inference costs for the Whisper model with a single line of code!

Blog post cover
September 10, 2024
Building a cost-effective analytics stack with Modal, dlt, and dbt

A step-by-step guide to building a scalable analytics stack using Modal, dlt, and dbt for efficient data loading, transformation, and deployment.

Blog post cover
September 6, 2024
Product Updates: Rollbacks, batching, sandbox tunnels & more

Welcome to another round of Modal Product Updates! Here's what's new this month.

Blog post cover
September 4, 2024
Modal supports HIPAA compliance

You can now enter BAAs with Modal to run HIPAA-compliant workloads.

Blog post cover
August 21, 2024
Scaling ComfyUI

How Modal's autoscaling works when running ComfyUI as an API.

Blog post cover
August 16, 2024
Inside the Modal Code Playground

How we built an in-browser code playground using Modal Sandboxes.

Blog post cover
August 6, 2024
GPU prices are falling...

...and we're passing the savings to you. 15-30% price cuts on GPUs and CPUs.

Blog post cover
August 5, 2024
Beat GPT-4o at Python by searching with 100 dumb LLaMAs

Scale up smaller open models with search and evaluation to match frontier capabilities.

Blog post cover
July 9, 2024
Product Updates: Datadog Integration, lower function latency & more

Welcome to another round of Modal Product Updates! Here's what's new this month.

Blog post cover
July 3, 2024
Competitive prompt engineering

Learn how Basis partnered with Modal to bring the spirit of competitive programming to prompt engineering.

Blog post cover
June 20, 2024
Run GPU jobs from Airflow with Modal

Isolate your tasks with Modal containers while using Airflow for orchestration.

Blog post cover
June 6, 2024
How to catch crypto miners using syscall signatures

See how Modal combats cryptomining abuse with syscall-based program analysis, to secure GPUs for legitimate users.

Blog post cover
May 23, 2024
How Hunch supercharged AI workflows with Modal Sandboxes

Find out how Hunch uses Modal to run AI code even its users don't trust.

Blog post cover
May 21, 2024
Create an infinite icon library by fine-tuning Stable Diffusion

How we fine-tuned a Stable Diffusion model on the Heroicons library to generate all the icons we could dream of.

Blog post cover
May 20, 2024
Why Substack moved their AI and ML pipelines to Modal

Learn how Substack sped up their developer iteration cycles by moving ML training and deployment to Modal from AWS SageMaker.

Blog post cover
May 13, 2024
Introducing: Region selection

You can now specify which cloud region you would like to run your Functions in.

Blog post cover
May 7, 2024
Product Updates: Cloud buckets, Okta SSO & more

Welcome to another round of Modal Product Updates! Here's what's new this month.

Blog post cover
April 26, 2024
Beating Proprietary Models with a Quick Fine-Tune

Fine-tune on just a few hundred examples and kick off your very own data flywheel.

Blog post cover
April 18, 2024
Why you should move your ETL stack to Modal

Easily develop and deploy custom ETL jobs while saving 99% on sync costs.

Blog post cover
April 10, 2024
Modal named to 2024 Enterprise Tech 30 list by Wing

Celebrating the best in enterprise tech.

Blog post cover
April 2, 2024
How to convert a ComfyUI workflow to Python code

This guide shows how to convert a ComfyUI workflow to Python code as an alternative way to productionize a ComfyUI workflow.

Blog post cover
March 26, 2024
How Ramp automated receipt processing with fine-tuned LLMs

Find out how Ramp uses Modal to customize open source LLMs to automate receipt processing.

Blog post cover
March 14, 2024
Lambda on hard mode: Inside Modal's web infrastructure

In this post, we'll talk about how Modal handles real-time HTTP requests and WebSockets in serverless functions.

Blog post cover
February 27, 2024
Introducing: WebSockets on Modal

Modal now supports WebSocket connections, enabling real-time, bidirectional data transfer between client and server.

Blog post cover
February 21, 2024
How Suno shaved 4 months off their launch timeline with Modal

Find out how Suno uses Modal to scale inference and batch pre-processing to thousands of GPUs.

Blog post cover
February 15, 2024
Product Updates: WebSocket support, interactive commands & more

We've been busy in 2024 so far, bringing you WebSockets, interactive commands, H100s and more. Learn about what's new at Modal.

Blog post cover
February 6, 2024
Introducing: H100s on Modal

We’re excited to be making Nvidia H100 GPUs available on Modal starting today!

Blog post cover
January 23, 2024
Embedding English Wikipedia in under 15 minutes

Leverage Modal’s parallel batch jobs and in-house storage features to quickly generate embeddings for billions of tokens.

Blog post cover
December 20, 2023
How to fine-tune an LLM on Modal

An operational guide to fine-tuning an LLM on any dataset in minutes (ft. CodeLlama, Llama 2, Mistral, and more)

Blog post cover
November 7, 2023
What is LLM fine-tuning?

An intro to fine-tuning large language models in 2024

Blog post cover
October 10, 2023
Modal is now generally available

Modal offically launches today with no waitlist. And we also raised a Series A!

Blog post cover
October 10, 2023
Press Release: Modal Labs Announces Series A Financing Round

Modal Labs Announces Series A Financing Round, Securing $16 Million Investment to Launch Cloud-Based Infrastructure Platform, Build Towards End-to-End Enterprise Data Stack

Blog post cover
June 15, 2023
Modal is SOC2 Compliant

Modal is excited to announce that it has successfully completed a System and Organization Controls (SOC) 2 Type 1 audit.

Ship your first app in minutes

with $30 / month free compute