GLR: Gradient-Based Learning Rate Scheduler

Napoli Spatafora, M. A.; Ortis, A.; Battiato, S.

doi:10.1007/978-3-031-43148-7_23

Training a neural network is a complex and time-consuming process because of many combinations of hyperparameters that have to be adjusted and tested. One of the most crucial hyperparameters is the learning rate which controls the speed and direction of updates to the weights during training. We proposed an adaptive scheduler called Gradient-based Learning Rate scheduler (GLR) that significantly reduces the tuning effort thanks to a single user-defined parameter. GLR achieves competitive results in a very wide set of experiments compared to the state-of-the-art schedulers and optimizers. The computational cost of our method is trivial and can be used to train different network topologies.

GLR: Gradient-Based Learning Rate Scheduler

Napoli Spatafora M. A.;Ortis A.;Battiato S.

2023-01-01

Abstract

Training a neural network is a complex and time-consuming process because of many combinations of hyperparameters that have to be adjusted and tested. One of the most crucial hyperparameters is the learning rate which controls the speed and direction of updates to the weights during training. We proposed an adaptive scheduler called Gradient-based Learning Rate scheduler (GLR) that significantly reduces the tuning effort thanks to a single user-defined parameter. GLR achieves competitive results in a very wide set of experiments compared to the state-of-the-art schedulers and optimizers. The computational cost of our method is trivial and can be used to train different network topologies.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2023
			
	Codice ISBN
	
				978-3-031-43147-0
978-3-031-43148-7
			
	Parole chiave
	
				Hyperparameters
Neural network
Optimization
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
GLR Gradient-Based Learning Rate Scheduler_compressed-1-7.pdf solo gestori archivio Tipologia: Versione Editoriale (PDF) Licenza: NON PUBBLICO - Accesso privato/ristretto Dimensione 6.46 MB Formato Adobe PDF Visualizza/Apri	6.46 MB	Adobe PDF	Visualizza/Apri
GLR Gradient-Based Learning Rate Scheduler_compressed-8-13.pdf solo gestori archivio Tipologia: Versione Editoriale (PDF) Licenza: NON PUBBLICO - Accesso privato/ristretto Dimensione 5.01 MB Formato Adobe PDF Visualizza/Apri	5.01 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11769/575492

Citazioni

ND

0

0

GLR: Gradient-Based Learning Rate Scheduler

Napoli Spatafora M. A.;Ortis A.;Battiato S.

2023-01-01

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)