December 16, 20245 minute read
How to run Nomic Embed V1.5 on Modal
author
Yiren Lu@YirenLu
Solutions Engineer

Introduction to Nomic Embed V1.5

Nomic Embed V1.5 is the latest version of the Nomic Embed model, a powerful text embedding model that frequently ranks near the top of the MTEB embedding model leaderboard. It is designed for various applications, including retrieval-augmented generation (RAG), clustering, and classication. This model allows you to convert sentences into dense vector representations, making it easier to perform semantic searches and analyze text data effectively.

Note that for each of these use cases, you will need to include a task instruction prefix to the sentences you want to embed as part of the prompt. For more details, see the Nomic Embed documentation.

Example code for running the Nomic Embed V1.5 embedding model on Modal

To run the following code, you will need to:

  1. Create an account at modal.com
  2. Run pip install modal to install the modal Python package
  3. Run modal setup to authenticate (if this doesn’t work, try python -m modal setup)
  4. Copy the code below into a file called app.py
  5. Run modal run app.py
import modal

MODEL_ID = "nomic-ai/nomic-embed-text-v1.5"
MODEL_REVISION = "d802ae16c9caed4d197895d27c6d529434cd8c6d"

image = modal.Image.debian_slim().pip_install(
    "torch", "sentence-transformers", "einops"
)
app = modal.App("example-base-nomic-embed", image=image)

GPU_CONFIG = modal.gpu.H100(count=1)

CACHE_DIR = "/cache"
cache_vol = modal.Volume.from_name("hf-hub-cache", create_if_missing=True)

@app.cls(
    gpu=GPU_CONFIG,
    volumes={CACHE_DIR: cache_vol},
    allow_concurrent_inputs=15,
    container_idle_timeout=60 * 10,
    timeout=60 * 60,
)
class Model:
    @modal.enter()
    def setup(self):
        from sentence_transformers import SentenceTransformer

        self.model = SentenceTransformer(
            MODEL_ID, revision=MODEL_REVISION, cache_dir=CACHE_DIR, trust_remote_code=True
        )

    @modal.method()
    def embed(self, sentences: list):
        return self.model.encode(sentences)


# ## Run the model
@app.local_entrypoint()
def main():
    sentences = [
        "search_document: TSNE is a dimensionality reduction algorithm created by Laurens van Der Maaten"
    ]

    print(Model().embed.remote(sentences))

Ship your first app in minutes.

Get Started

$30 / month free compute