Articles

November 2, 2024
Stable Diffusion 3.5 vs. Flux: top text-to-image models

Learn about top text-to-image models on the Artificial Analysis leaderboard

October 31, 2024
How much is an Nvidia A100?

Learn about the cost of Nvidia A100 GPUs and explore top GPU-on-demand platforms for accessing this powerful hardware.

October 30, 2024
Top embedding models for RAG

Learn how to select an embedding model for your RAG system

October 30, 2024
Top image segmentation models

Learn which models to use to segment out objects in images and videos

October 30, 2024
Top open-source text-to-video AI models

Learn about the top open-source text-to-video AI models

October 17, 2024
What is Flux Dev?

Learn about the most popular text-to-image diffusion model on the market

October 16, 2024
What is Flash Attention?

Learn how to speed up your model training and inference with Flash Attention

October 15, 2024
Glossary: LLM fine-tuning hyperparameters

Confused about what each hyperparameter means when you're doing LLM fine-tuning? Our glossary will help.

October 15, 2024
Fine-tuning vs. RAG

You want to tailor an LLM to your custom dataset. Should you fine-tune or build a RAG system?

October 15, 2024
Build interactive workflows using Kestra and Modal

Learn how to create interactive workflows that dynamically adapt to user inputs with Kestra’s open-source orchestration platform and Modal’s serverless infrastructure.

October 15, 2024
Top embedding models on the MTEB leaderboard

Overview of the top ranking embedding models on the MTEB leaderboard

October 15, 2024
vLLM vs. TGI

Learn how to speed up your model training and inference with vLLM or TGI

September 27, 2024
Top 5 serverless GPU providers

Learn all about top serverless GPU providers

September 25, 2024
AWS Lambda vs. Google Cloud functions: a comprehensive comparison

How do AWS Lambda and Google Cloud Functions compare? This article provides a detailed comparison of these two popular serverless execution environments.

September 25, 2024
Dagster vs. Airflow: a comprehensive comparison

An in-depth look at the differences between Dagster and Airflow for data orchestration

September 25, 2024
Google Cloud Run functions pricing: understanding costs and optimization

A comprehensive guide to the pricing model for Google Cloud Run functions, including differences between 1st and 2nd gen, CPU and memory allocation, and key pricing metrics. Learn how to optimize your serverless costs.

September 25, 2024
Google Cloud Run vs. Cloud Run Functions: understanding Google's serverless offerings

Explore the relationship between Google Cloud Run and Cloud Run Functions, their key differences, and how to choose the right serverless option for your needs.

September 25, 2024
RabbitMQ vs. Kafka: choosing the right messaging system

Learn about the key differences between RabbitMQ and Apache Kafka, their use cases, and how to choose the right messaging system for your needs.

September 25, 2024
Best practices for serverless inference

Learn about gotchas and best practices for serverless inference

September 23, 2024
Open-source AI agents

A roundup of popular open-source AI agents like OpenHands (formerly OpenDevin), SWE-agent, and Devika.

September 18, 2024
How to run Llama 3.1 as an API

Serve Meta's foundational Llama 3.1 models via API

September 15, 2024
How to get GPUs with a Jupyter notebook on Modal

Learn how to launch a Jupyter notebook backed by Modal GPUs with this step-by-step guide.

September 15, 2024
How to run ChatTTS

Learn how to run ChatTTS text-to-speech with this step-by-step guide.

September 15, 2024
How to deploy a Gradio app

Deploying a Gradio app on Modal

September 15, 2024
How to run Llama3-405B

Learn how to run Llama3-405B on Modal with this step-by-step guide.

September 15, 2024
How to run Ollama

Learn how to run Ollama on Modal with this step-by-step guide.

September 15, 2024
How to run XTTS

Learn how to run XTTS text-to-speech with this step-by-step guide.

September 14, 2024
How to deploy code in AWS Lambda: the easy way for beginners

Deploying your first Lambda function using AWS SAM

September 8, 2024
Fast, lazy container loading in Modal.com

Deep dive on Modal's optimizations for fast, lazy container loading

September 4, 2024
Upload files to S3 with AWS Lambda and AWS API Gateway in TypeScript: A Step-by-Step Guide

Learn how to create a serverless solution for uploading JPEG images to Amazon S3 using AWS API Gateway and Lambda with TypeScript

September 4, 2024
Batch processing vs. stream processing by example

Understand the crucial differences between batch processing and stream processing by example

September 1, 2024
How much VRAM do I need for LLM model fine-tuning?

Estimating VRAM requirements for large language model fine-tuning

September 1, 2024
How much VRAM do I need for LLM inference?

Estimating VRAM requirements for large language model inference

August 26, 2024
A10 vs. A100 vs. H100 - Which one should you choose?

Discover the best GPU for your AI workload: Compare A10, A100, and H100 performance, pricing, and use cases to make an informed decision.

August 23, 2024
A1111 vs ComfyUI

Which Stable Diffusion web UI should I use?

August 22, 2024
LoRA vs. QLoRA: Efficient fine-tuning techniques for LLMs

Learn the differences between LoRA and QLoRA, two different efficient fine-tuning techniques for large language models.

August 16, 2024
Top open-source text-to-speech libraries in 2024

Explore the top open-source text-to-speech libraries available in 2024, including TortoiseTTS, XTTS, StyleTTS, MeloTTS, OpenVoice v2, and VITS. Learn about their unique features and potential applications.

August 16, 2024
How Modal speeds up container launches in the cloud

Optimizations for blazing fast container launches

August 15, 2024
How much is an Nvidia H100?

Learn about the cost of Nvidia H100 GPUs and explore top GPU-on-demand platforms for accessing this powerful hardware.

August 15, 2024
All the open-source Whisper variations

WhisperX, Deepgram

August 10, 2024
Best frameworks for fine-tuning LLMs in 2024

Axolotl vs. Unsloth vs. Torchtune

May 6, 2024
Best open-source LLMs

Overview of the best open-source llms

April 30, 2024
How to run cron jobs

A brief explanation of cron jobs, cron syntax, and how to run cron jobs on Modal.

Ship your first app in minutes.

Get Started

$30 / month free compute