VecDB API
  • 🌱VecDB
  • 🌱Structure
  • 🌱Glossary
  • 🌱FAQ
  • πŸ’»API Reference
  • Get Started
    • πŸ’»Authentication
    • πŸ’»Build Your First Search
      • πŸ’»Creating your first dataset
      • πŸ’»Encoding + Inserting
      • πŸ’»Your First Text To Image Search!
  • for your understanding
    • 🌱Concepts about vectors
    • 🌱What is vector search?
    • 🌱Vectors for classification
    • 🌱Limitations of vectors
  • Guides
    • πŸ’»Combine keyword search with vector search
  • ADMIN
    • 🌱Project Overview
      • πŸ’»Project Creation
      • πŸ’»List All Datasets
      • Best Practice With Project Management
      • πŸ’»Copy dataset
      • πŸ’»Copy dataset from another project
      • πŸ’»Request an API key
      • πŸ’»Request a read-only API key
  • Services
    • πŸ”Search
      • Text Search
      • Vector Search
      • Hybrid Search
      • Traditional Search
    • πŸ”Predict
      • πŸ’»KNN regression from search results
      • πŸ’»KNN Regression
    • πŸ”Tag
      • πŸ’»Tagging
      • πŸ’»Diversity Tagging
    • πŸ”Cluster
      • Cluster A Dataset Field
  • DATASETS
    • 🌱Datasets Overview
      • 🌱Special Field Types
  • WORKFLOWS
    • 🌱Workflow Overview
Powered by GitBook
On this page
  • What is vector search?
  • Why is vector search important?
  • The difficulties of vector search

Was this helpful?

  1. for your understanding

What is vector search?

An introduction to vector search/nearest neighbors

PreviousConcepts about vectorsNextVectors for classification

Last updated 3 years ago

Was this helpful?

What is vector search?

Vector search is the process of finding the most similar vectors to itself (also known as nearest neighbors of similarity search).

Although we initially used the analogy of vectors as fingerprints in our introduction, vectors actually have additional properties that allow them to be useful in practical applications. These include:

  1. Similar data have similar vectors.

  2. You can measure the similarity of these vectors statistically in a number of different ways.

Why is vector search important?

If you can find similar vectors based on the data - this means, you can provide different ways of linking data in ways individuals may have never considered. Linguistically - you can now link sentences based on semantics as opposed to relying on co-occurrences of words (used in traditional word search). Similarly, for image search, you can use reverse image search and personalised image search -- allowing for better recommendations for searches. If you are interested in vector search applications, we have listed a few below.

Vector Search is reliant largely on index libraries and encoders. Each vector search guide follows a template of:

The steps/materials can be summarised in the following way: Data: Obtain the data in a way that can be processed into a numerical representation and fed through the necessary model. Encode: Feed the data's numerical representation into a model and extract the vector which can be indexed and searched. Index: Indexing the vectors (from which the data has been encoded) in an efficient way that allows for retrieval. Search: Search the vectors that have been indexed by using a variety of Nearest Neighbor algorithms, filters, chunking and queries.

The difficulties of vector search

While the above process appears simple, there are a lot of difficulties with actually using vector search for production. These difficulties include:

  • Deploying your index and search for production

  • Usage of vectors to optimise search results

    • Optimising the way search is being done on the vectors

  • Optimising the matching of user intent and products

The most common algorithms that are used are called nearest neighbor algorithms. You can read more about them .

Process of indexing and searching vectors
🌱
here
Image Vector Search aims to identify the closest vectors based on given images.