09 January 2026

#Vector Databases

#Vector Databases

Key Concepts


S.No Topic Sub-Topics
1Vector DatabasesWhat is a vector, embeddings basics, similarity search, use cases, traditional DB vs vector DB
2Mathematics Behind VectorsLinear algebra basics, cosine similarity, Euclidean distance, dot product, normalization
3Embeddings FundamentalsText embeddings, image embeddings, dimensionality, dense vs sparse vectors, embedding quality
4Embedding ModelsWord2Vec, GloVe, FastText, Sentence Transformers, OpenAI embeddings
5Vector Similarity MetricsCosine similarity, L2 distance, inner product, Hamming distance, trade-offs
6Indexing TechniquesFlat index, inverted index, IVF, HNSW, PQ
7Approximate Nearest Neighbor (ANN)Why ANN, accuracy vs speed, recall, latency, scalability
8HNSW AlgorithmGraph layers, insertion, search process, parameters, performance tuning
9IVF and PQ IndexesClustering, centroids, compression, memory optimization, search flow
10Index OptimizationRe-indexing, shard sizing, dimension reduction, caching, pruning
11Popular Vector DatabasesPinecone, Weaviate, Milvus, Qdrant, FAISS overview
12Pinecone ArchitectureIndexes, namespaces, pods, metadata filtering, scaling
13Weaviate ArchitectureSchema, classes, modules, GraphQL API, hybrid search
14Milvus ArchitectureCollections, partitions, segments, coordinators, storage layers
15Open Source vs Managed Vector DBsCost, scalability, maintenance, performance, use cases
16Data Modeling in Vector DBsVector schema, metadata fields, hybrid models, versioning, updates
17CRUD OperationsInsert vectors, update vectors, delete vectors, batch operations, upserts
18Metadata FilteringStructured filters, boolean logic, range queries, hybrid search, performance impact
19Hybrid SearchKeyword + vector search, BM25, re-ranking, score fusion, use cases
20Vector Search APIsREST APIs, SDKs, query parameters, pagination, result tuning
21Scalability & ShardingHorizontal scaling, replicas, partitioning, load balancing, fault tolerance
22Performance TuningLatency optimization, recall tuning, batch queries, memory usage, caching
23Consistency & DurabilityReplication, WAL, backups, recovery, data integrity
24Security in Vector DatabasesAuthentication, authorization, encryption, network security, compliance
25Monitoring & ObservabilityMetrics, logging, tracing, alerts, capacity planning
26Vector DB + LLM IntegrationRAG pattern, prompt injection, context windowing, retrieval tuning, pipelines
27Semantic Search ApplicationsSearch engines, document retrieval, FAQs, chatbots, recommendation systems
28Multimodal Vector SearchText-image search, audio embeddings, video embeddings, fusion strategies, use cases
29Production Best PracticesSchema evolution, re-embedding, cost optimization, SLA planning, testing
30Advanced & Future TrendsVector + graph DBs, real-time embeddings, edge deployment, AI agents, research trends

Interview question


Related Topics