How to run Nomic Embed V1.5 on Modal

All posts

Back

Model Library

December 16, 2024•5 minute read

Yiren Lu@YirenLu

Solutions Engineer

Introduction to Nomic Embed V1.5

Nomic Embed V1.5 is the latest version of the Nomic Embed model, a powerful text embedding model that frequently ranks near the top of the MTEB embedding model leaderboard. It is designed for various applications, including retrieval-augmented generation (RAG), clustering, and classication. This model allows you to convert sentences into dense vector representations, making it easier to perform semantic searches and analyze text data effectively.

Note that for each of these use cases, you will need to include a task instruction prefix to the sentences you want to embed as part of the prompt. For more details, see the Nomic Embed documentation.

To run the following code, you will need to:

Create an account at modal.com
Run pip install modal to install the modal Python package
Run modal setup to authenticate (if this doesn’t work, try python -m modal setup)
Copy the code below into a file called app.py
Run modal run app.py

import modal

MODEL_ID = "nomic-ai/nomic-embed-text-v1.5"
MODEL_REVISION = "d802ae16c9caed4d197895d27c6d529434cd8c6d"

image = modal.Image.debian_slim().pip_install(
    "torch", "sentence-transformers", "einops"
)
app = modal.App("example-base-nomic-embed", image=image)

GPU_CONFIG = modal.gpu.H100(count=1)

CACHE_DIR = "/cache"
cache_vol = modal.Volume.from_name("hf-hub-cache", create_if_missing=True)

@app.cls(
    gpu=GPU_CONFIG,
    volumes={CACHE_DIR: cache_vol},
    allow_concurrent_inputs=15,
    container_idle_timeout=60 * 10,
    timeout=60 * 60,
)
class Model:
    @modal.enter()
    def setup(self):
        from sentence_transformers import SentenceTransformer

        self.model = SentenceTransformer(
            MODEL_ID, revision=MODEL_REVISION, cache_dir=CACHE_DIR, trust_remote_code=True
        )

    @modal.method()
    def embed(self, sentences: list):
        return self.model.encode(sentences)


# ## Run the model
@app.local_entrypoint()
def main():
    sentences = [
        "search_document: TSNE is a dimensionality reduction algorithm created by Laurens van Der Maaten"
    ]

    print(Model().embed.remote(sentences))

Introduction to Nomic Embed V1.5

Example code for running the Nomic Embed V1.5 embedding model on Modal

Ship your first app in minutes.