Open zfy68 opened 1 year ago
Vector database,也叫矢量数据库,是一种专门用于存储和管理矢量数据的数据库系统。矢量数据是指通过点、线、面等几何对象来描述空间信息的数据,如地图、道路、建筑物、河流等。矢量数据库具有高效性和灵活性,能够处理大量的复杂空间数据,可用于地理信息系统(GIS)、遥感、测绘等领域。常见的矢量数据库系统有Oracle Spatial、PostGIS、Microsoft SQL Server、GeoJSON等。
矢量数据库索引和存储矢量嵌入以进行快速检索和相似性搜索,具有 CRUD 操作、元数据过滤和水平缩放等功能。
1.语义搜索
Vector Indexes for Search and Retrieval
Single-Stage Filtering
Data Sharding
Replication
Hybrid Storage
API
Visit the Pinecone learning center and read more about key concepts, including vector embeddings, vector indexes, and NLP for semantic search. Here are some of the most popular topics:
Sentence Transformers: Meanings in Disguise - This guide discusses core techniques for converting text and documents into vector embeddings and details some of the most popular NLP embedding models.
The Missing WHERE Clause in Vector Search - This article explains two common methods for adding metadata filters to vector search, and explores their limitations. Then, we cover how Single-Stage Filtering bridges some of these gaps.
Nearest Neighbor Indexes for Similarity Search - This article explores the pros and cons of some of the most important indexes including Flat, LSH, HNSW, and IVF. It also gives tips for deciding which to use and the impact of parameters in each index.
了解有关矢量数据库的更多信息 访问 Pinecone 学习中心并阅读有关关键概念的更多信息,包括矢量嵌入、矢量索引和用于语义搜索的 NLP。 以下是一些最热门的话题:
Sentence Transformers: Meanings in Disguise - 本指南讨论了将文本和文档转换为向量嵌入的核心技术,并详细介绍了一些最流行的 NLP 嵌入模型。
The Missing WHERE Clause in Vector Search - 本文解释了将元数据过滤器添加到矢量搜索的两种常用方法,并探讨了它们的局限性。 然后,我们将介绍单级过滤如何弥合其中的一些差距。
Nearest Neighbor Indexes for Similarity Search - 本文探讨了一些最重要的索引的优缺点,包括 Flat、LSH、HNSW 和 IVF。 它还提供了决定使用哪个以及每个索引中参数的影响的提示。
Once you have your vector embeddings, you’ll need a vector database to index, store, and retrieve them.
Create an account and launch your first vector database.
With Pinecone, you can do this in just a few minutes. Pinecone is a fully managed vector database that makes it easy to add vector search to production applications. It combines vector search libraries, capabilities such as filtering, and distributed infrastructure to provide high performance and reliability at any scale.
启动您的第一个矢量数据库 一旦你有了向量嵌入,你就需要一个向量数据库来索引、存储和检索它们。
创建一个帐户并启动您的第一个矢量数据库。
使用 Pinecone,您只需几分钟即可完成此操作。 Pinecone 是一个完全托管的矢量数据库,可以轻松地将矢量搜索添加到生产应用程序中。 它结合了矢量搜索库、过滤等功能和分布式基础架构,可在任何规模下提供高性能和可靠性。
What is a Vector Database?