Large-scale image search with text for information retrieval
DOI:
https://doi.org/10.3126/jiee.v4i1.35390Keywords:
Computer vision, Attention mechanism, Contrastive loss function, Natural language processing, Information retrieval systemsAbstract
Searching images in a large database is a major requirement in Information Retrieval Systems. Expecting image search results based on a text query is a challenging task. In this paper, we leverage the power of Computer Vision and Natural Language Processing in Distributed Machines to lower the latency of search results. Image pixel features are computed based on contrastive loss function for image search. Text features are computed based on the Attention Mechanism for text search. These features are aligned together preserving the information in each text and image feature. Previously, the approach was tested only in multilingual models. However, we have tested it in image-text dataset and it enabled us to search in any form of text or images with high accuracy.
Downloads
Downloads
Published
How to Cite
Issue
Section
License
Upon acceptance of an article, the copyright for the published works remains in the JIEE, Thapathali Campus and the authors.