Domain adaptation approaches can be used to efficiently train object detectors by leveraging labeled synthetic images, inexpensively generated from 3D models, and unlabeled real images, which are cheaper to obtain than labeled ones. Most of the state-of-the-art techniques consider only one source and one target domain for the adaptation task. However, real world scenarios, such as applications in cultural sites, naturally involve many target domains which arise from the use of different cameras at inference time (e.g. different wearable devices and different smartphones on which the algorithm will be deployed). In this work, we investigate whether the availability of multiple unlabeled target domains can improve domain adaptive object detection algorithms. To study the problem, we propose a new dataset comprising images of 16 different objects rendered from a 3D model as well as images collected in the real environment using two different cameras. We experimentally assess that current domain adaptive object detectors can improve their performance by leveraging the multiple targets. As evidence of the usefulness of explicitly considering multiple target domains, we propose a new unsupervised multi-camera domain adaptation approach for object detection which outperforms current methods. Code and dataset are available at https://iplab.dmi.unict.it/OBJ-MDA/.

Unsupervised Multi-camera Domain Adaptation for Object Detection in Cultural Sites

Pasqualino G.;Furnari A.;Farinella G. M.
2022-01-01

Abstract

Domain adaptation approaches can be used to efficiently train object detectors by leveraging labeled synthetic images, inexpensively generated from 3D models, and unlabeled real images, which are cheaper to obtain than labeled ones. Most of the state-of-the-art techniques consider only one source and one target domain for the adaptation task. However, real world scenarios, such as applications in cultural sites, naturally involve many target domains which arise from the use of different cameras at inference time (e.g. different wearable devices and different smartphones on which the algorithm will be deployed). In this work, we investigate whether the availability of multiple unlabeled target domains can improve domain adaptive object detection algorithms. To study the problem, we propose a new dataset comprising images of 16 different objects rendered from a 3D model as well as images collected in the real environment using two different cameras. We experimentally assess that current domain adaptive object detectors can improve their performance by leveraging the multiple targets. As evidence of the usefulness of explicitly considering multiple target domains, we propose a new unsupervised multi-camera domain adaptation approach for object detection which outperforms current methods. Code and dataset are available at https://iplab.dmi.unict.it/OBJ-MDA/.
2022
978-3-031-06426-5
978-3-031-06427-2
Cultural sites
Domain adaptation
Multi-camera
Object detection
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11769/540585
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 1
social impact