Sandbox resources and pricing

This page covers resource configuration and pricing for Modal Sandboxes. For general documentation on CPU and memory options, see Reserving CPU and memory.

Pay for what you use 

Modal Sandboxes are billed by the second based on whichever is higher: your resource request or your actual usage.

Sandboxes can burst beyond their CPU and memory requests when additional resources are available on the underlying host. Your request guarantees a minimum level of resources, but when spare capacity exists, your Sandbox can use more. You pay for max(request, actual).

See Billing in the resource guide for more details.

Configuring resources 

Set CPU and memory requests using the cpu and memory parameters when creating your Sandbox. The cpu parameter specifies physical CPU cores (1 core = 2 vCPUs), and memory specifies MiB:

import modal

app = modal.App.lookup("my-app", create_if_missing=True)

sb = modal.Sandbox.create(
    cpu=0.5,
    memory=512,
    app=app,
)

For details on default values and maximum limits, see Reserving CPU and memory.

Resource limits 

You can set upper limits to cap how much a Sandbox can burst. This is particularly useful when an AI agent controls what runs inside the Sandbox, as it prevents misbehaving or adversarial workloads from consuming unbounded resources:

sb = modal.Sandbox.create(
    cpu=(0.5, 4.0),       # Request 0.5 cores, limit to 4 cores
    memory=(512, 2048),   # Request 512 MiB, limit to 2048 MiB
    app=app,
)

See Resource limits for details on how CPU and memory limits behave.

Tuning your requests 

For maximum cost-efficiency, base your requests on observed usage percentiles rather than peaks: around p50–75 for CPU and p90–95 for memory.

If your observed usage is consistently higher than your request, you may run into resource contention on the host. This can manifest as OOM errors or CPU throttling.

The goal should be to set requests that correspond to your base load while letting bursting handle occasional spikes.

  1. Start with default values for CPU and memory.

  2. Run your typical workload and observe actual resource usage in the Modal dashboard to understand your baseline needs.

  3. Set your request to match the resource level your workload consistently needs.

GPU Sandboxes 

You can also run Sandboxes with GPUs. See GPU acceleration for available GPU types and configuration.

Unlike CPU Sandboxes, GPU Sandboxes are subject to preemption. Design your GPU workloads to handle interruptions gracefully.

Additional resources