Exploring the Impact of Model Parameters and Components on Video Saliency Prediction with Foundation Models

Moradi, M.; Moradi, M.; Rundo, F.; Spampinato, C.; Borji, A.; Palazzo, S.

doi:10.1007/978-3-031-97822-7_13

As a companion to the ICPR 2024 accepted paper “SalFoM: Dynamic Saliency Prediction with Video Foundation Models”, this work investigates how various model parameters and components impact its performance. Since SalFoM represents the first effort of its kind in this field, the additional experiments presented here are designed to provide insights into the application of video foundation models for dynamic saliency prediction. This is achieved by exploring different aspects of the model’s architecture and the use of large video models. Additionally, this work analyzes the impact of various strategies for defining training objectives on the model’s learning capabilities and overall performance. The code is available at https://github.com/mr17m/SalFoM—Video-Saliency-Prediction.

Exploring the Impact of Model Parameters and Components on Video Saliency Prediction with Foundation Models

Moradi M.;Moradi M.;Rundo F.;Spampinato C.;Borji A.;Palazzo S.

2025-01-01

Abstract

As a companion to the ICPR 2024 accepted paper “SalFoM: Dynamic Saliency Prediction with Video Foundation Models”, this work investigates how various model parameters and components impact its performance. Since SalFoM represents the first effort of its kind in this field, the additional experiments presented here are designed to provide insights into the application of video foundation models for dynamic saliency prediction. This is achieved by exploring different aspects of the model’s architecture and the use of large video models. Additionally, this work analyzes the impact of various strategies for defining training objectives on the model’s learning capabilities and overall performance. The code is available at https://github.com/mr17m/SalFoM—Video-Saliency-Prediction.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2025
			
	Codice ISBN
	
				9783031978210
9783031978227
			
	Parole chiave
	
				Reproducibility
Video Foundation Model
Video Saliency Prediction
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11769/686709

Citazioni

ND

0

ND

Exploring the Impact of Model Parameters and Components on Video Saliency Prediction with Foundation Models

Moradi M.;Moradi M.;Rundo F.;Spampinato C.;Borji A.;Palazzo S.

2025-01-01

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)