Vector Database Concepts Flashcards
(49 cards)
What is a vector database?
A specialized database designed to store and query high-dimensional vector embeddings.
What is a vector embedding?
A numeric representation of data (text, images, etc.) in a high-dimensional space.
Why use vector databases?
To perform efficient similarity search across large sets of embeddings.
What is a similarity search?
Finding items that are most similar to a given query vector.
What is cosine similarity?
A metric that measures the cosine of the angle between two vectors.
What is Euclidean distance?
A metric that measures the straight-line distance between two points in space.
What is dot product similarity?
A similarity metric based on the dot product of two vectors.
What is an ANN index?
Approximate Nearest Neighbor index used to speed up similarity search.
What is brute-force search?
A similarity search method that compares every vector — accurate but slow.
What is HNSW?
Hierarchical Navigable Small World — a popular ANN algorithm.
What is IVF in vector search?
Inverted File Index — used to partition vectors for faster search.
What is PQ (Product Quantization)?
A technique to compress and speed up vector similarity searches.
What is Faiss?
A library developed by Facebook for efficient similarity search and clustering of dense vectors.
What is Milvus?
An open-source vector database for scalable similarity search.
What is Weaviate?
A vector search engine with integrated machine learning capabilities.
What is Pinecone?
A fully managed vector database service for production-grade applications.
What is Qdrant?
An open-source vector search engine focused on performance and reliability.
What is Vespa?
A platform for serving vector search and recommendation systems.
What is vector dimensionality?
The number of features or coordinates in each embedding vector.
What is vector normalization?
Scaling vectors to have unit length for consistent similarity metrics.
What is index building?
The process of preparing vectors for fast similarity search.
What is index probing?
Querying an ANN index to find nearest vectors.
What is filtering in vector search?
Restricting search to vectors that meet certain metadata criteria.
What is metadata in vector databases?
Structured attributes associated with each vector for filtering.