Active learning for image retrieval via visual similarity metrics and semantic features
We introduce an active learning framework for content-based image retrieval for video surveillance that can be trained ad-hoc for a single camera in a matter of minutes. This technique allows searching for both, known and unknown objects, given a region of interest. The process does not require prior labelled data and treats image retrieval as a binary classification task, in which frames can be similar or different from a query image. The technique is compatible with any pre-trained deep feature extractor. In addition, we propose a novel label propagation algorithm that benefits from (1) visual similarity of image pairs and (2) the semantic representation of the feature vectors from a pre-trained deep feature extractor. This approach allows to reduce the amount of labels needed, while avoiding the propagation of errors. Our experiments with three use-cases from a nuclear facility show the validity of the proposed method, which achieves high precision and recall while requiring minimal amounts of labelled data.
CASADO COSCOLLA Alvaro;
SANCHEZ BELENGUER Carlos;
WOLFART Erik;
ANGORRILLA BUSTAMANTE Carlos;
SEQUEIRA Vitor;
2024-09-25
PERGAMON-ELSEVIER SCIENCE LTD
JRC139273
1873-6769 (online),
https://www.sciencedirect.com/science/article/pii/S0952197624013976,
https://publications.jrc.ec.europa.eu/repository/handle/JRC139273,
10.1016/j.engappai.2024.109239 (online),
Additional supporting files
| File name | Description | File type | |