Movatterモバイル変換


[0]ホーム

URL:


Skip to content

LanceDB

LanceDB is an open-source vector database for AI that's designed to store, manage, query and retrieve embeddings on large-scale multi-modal data. The core of LanceDB is written in Rust 🦀 and is built on top ofLance, an open-source columnar data format designed for performant ML workloads and fast random access.

Both the database and the underlying data format are designed from the ground up to beeasy-to-use,scalable andcost-effective.

Hosted LanceDB

If you want S3 cost-efficiency and local performance via a simple serverless API, checkoutLanceDB Cloud. For private deployments, high performance at extreme scale, or if you have strict security requirements, talk to us aboutLanceDB Enterprise.Learn more

Truly multi-modal

Most existing vector databases that store and query just the embeddings and their metadata. The actual data is stored elsewhere, requiring you to manage their storage and versioning separately.

LanceDB supports storage of theactual data itself, alongside the embeddings and metadata. You can persist your images, videos, text documents, audio files and more in the Lance format, which provides automatic data versioning and blazing fast retrievals and filtering via LanceDB.

Open-source and cloud solutions

LanceDB is available in two flavors:OSS andCloud.

LanceDBOSS is anopen-source, batteries-included embedded vector database that you can run on your own infrastructure. "Embedded" means that it runsin-process, making it incredibly simple to self-host your own AI retrieval workflows for RAG and more. No servers, no hassle.

LanceDBCloud is a SaaS (software-as-a-service) solution that runs serverless in the cloud, making the storage clearly separated from compute. It's designed to be cost-effective and highly scalable without breaking the bank. LanceDB Cloud is currently in private beta with general availability coming soon, but you can apply for early access with the private beta release by signing up below.

Try out LanceDB Cloud (Public Beta) Now

Why use LanceDB?

  • Embedded (OSS) and serverless (Cloud) - no need to manage servers

  • Fast production-scale vector similarity, full-text & hybrid search and a SQL query interface (viaDataFusion)

  • Python, Javascript/Typescript, and Rust support

  • Store, query & manage multi-modal data (text, images, videos, point clouds, etc.), not just the embeddings and metadata

  • Tight integration with theArrow ecosystem, allowing true zero-copy access in shared memory with SIMD and GPU acceleration

  • Automatic data versioning to manage versions of your data without needing extra infrastructure

  • Disk-based index & storage, allowing for massive scalability without breaking the bank

  • Ingest your favorite data formats directly, like pandas DataFrames, Pydantic objects, Polars (coming soon), and more

Documentation guide

The following pages go deeper into the internal of LanceDB and how to use it.


[8]ページ先頭

©2009-2025 Movatter.jp