Constrastive learning models have made it easier to find relevant content from multimedia collections through descriptive text queries, reducing the interactivity needed to solve simple tasks. However, for complex tasks not only pertaining visual elements, or tasks focused on answering questions involving the contents of a singular or multiple media items (videos/images) still requires a fair …
Supervisor: Omar Shahbaz Khan
Semester: Spring 2026
Tags: Multimedia Retrieval, VLM, RAG, Vector Store, Multimedia Indexing
The extended Cluster Pruning (eCP) index, is a hierarchical approximate nearest neighbor index.
eCP-FS is a “white-box” implementation of the index as a file structure using zarr. The index building is done through Python, while the index loading and search have been implemented in both pure Python and a Rust-Python package.
While the index is slower from disk than other disk-based ANN …
Supervisor: Omar Shahbaz Khan
Semester: Spring 2026
Tags: High-dimensional Indexing, Vector Store, Rust, Python, Approximate Nearest Neighbor