• Home
  • Categories
  • Tags
  • Pricing
  • Submit
  1. Home
  2. Curated Resource Lists
  3. Image Retrieval in the Wild

Image Retrieval in the Wild

A CVPR 2020 tutorial on large-scale image retrieval in unconstrained environments, including methods and system considerations for vector-based image search relevant to vector database and ANN applications.

🌐Visit Website

About this tool

Image Retrieval in the Wild

Type: Conference tutorial
Category: Curated resource lists / Tutorials
Event: CVPR 2020 Tutorial
Format: Slides + recorded videos (remote, via Zoom)

Overview

“Image Retrieval in the Wild” is a CVPR 2020 tutorial focused on building large-scale, practical content-based image retrieval systems for real-world applications. It emphasizes:

  • Vector-based image search and approximate nearest neighbor (ANN) algorithms
  • System design and deployment for billion-scale visual search
  • Cross-modality person re-identification (e.g., low-res, infrared, sketch, text-to-image)
  • A live implementation demo of an image search engine using a pre-trained deep model

All presentation slides and videos are accessible from the tutorial schedule.

Features

Core Topics

  • Content-based image retrieval foundations

    • Role of image retrieval in interacting with large visual collections
    • Limitations of standard benchmarks (e.g., Oxford buildings) for real-world scenarios
  • Approximate Nearest Neighbor (ANN) search

    • Review of state-of-the-art ANN algorithms
    • Billion-scale approximate nearest neighbor search
    • Practical guidance on selecting ANN algorithms for specific tasks and constraints
  • Large-scale visual search systems

    • Case study: Online C2C marketplace app (Mercari)
    • Handling over one billion listings and ~15M monthly active users
    • System design for scalability and high availability
    • Deployment on Kubernetes for production visual search
  • Heterogeneous person re-identification

    • Focus on inter-modality discrepancies as the main challenge
    • Cross-modality scenarios covered:
      • Low-resolution (LR)
      • Infrared (IR)
      • Sketch
      • Text
    • Organization and overview of available datasets per scenario
    • Summary and comparison of representative algorithms and approaches
  • Live-coding demo: image search engine from scratch

    • Building a web-based image search system in ~100 lines of Python
    • Using a pre-trained deep model for feature extraction
    • End-to-end walkthrough from embedding to search interface

Access & Materials

  • Tutorial held remotely (Zoom) as part of CVPR 2020
  • Slides and videos linked directly from the schedule:
    • Opening & context
    • Billion-scale ANN search
    • Large-scale visual search in a C2C marketplace
    • Heterogeneous person re-identification survey
    • Live-coding demo of an image search engine

Schedule (High-Level)

  • Opening & Billion-scale ANN Search – Yusuke Matsui
  • Large-scale Visual Search in the Mercari C2C App – Takuma Yamaguchi
  • Break
  • Heterogeneous Person Re-identification Survey – Zheng Wang
  • Live-coding: Image Search Engine from Scratch – Yusuke Matsui

Tags

  • Tutorials
  • Multimodal
  • Vector search

Link

  • Tutorial page: https://matsui528.github.io/cvpr2020_tutorial_retrieval/
Surveys

Loading more......

Information

Websitematsui528.github.io
PublishedDec 25, 2025

Categories

1 Item
Curated Resource Lists

Tags

3 Items
#tutorials
#multimodal
#vector search

Similar Products

6 result(s)
Neural Search in Action

A CVPR 2023 tutorial that demonstrates neural search systems in practice, including vector representations, similarity search, and scalable retrieval architectures closely related to vector databases.

Mastering Multimodal RAG

A course focused on mastering multimodal Retrieval Augmented Generation (RAG) and embeddings, which are fundamental components often stored and managed by vector databases.

Deep Lake

Deep Lake is a vector database designed as a data lake for AI, capable of storing and managing vector embeddings, text, images, and videos. It utilizes a tensor format for efficient querying and integration with AI algorithms, making it suitable for similarity search and machine learning workflows. It is open-source and tailored for handling unstructured and multimodal data, with seamless integration with frameworks like PyTorch and TensorFlow.

Typesense Cloud

Fully managed cloud service for the open-source Typesense search engine, including support for vector search and hybrid search use cases.

Hashing

A set of libraries and methods focused on hashing for similarity search in vector databases, directly impacting the performance of large-scale vector search systems.

KShivendu/awesome-vector-search

A curated list of awesome projects and research related to vector search, including dedicated vector databases, vector search libraries, performance benchmarks, and cost analysis resources.

Built with
Ever Works
Ever Works

Connect with us

Stay Updated

Get the latest updates and exclusive content delivered to your inbox.

Product

  • Categories
  • Tags
  • Pricing
  • Help

Clients

  • Sign In
  • Register
  • Forgot password?

Company

  • About Us
  • Admin
  • Sitemap

Resources

  • Blog
  • Submit
  • API Documentation
All product names, logos, and brands are the property of their respective owners. All company, product, and service names used in this repository, related repositories, and associated websites are for identification purposes only. The use of these names, logos, and brands does not imply endorsement, affiliation, or sponsorship. This directory may include content generated by artificial intelligence.
Copyright © 2025 Acme. All rights reserved.·Terms of Service·Privacy Policy·Cookies