Embedding Simulated Annealing within Stochastic Gradient Descent

Fischetti, M.; Stringher, M.

doi:10.1007/978-3-030-85672-4_1

We propose a new metaheuristic training scheme for Machine Learning that combines Stochastic Gradient Descent (SGD) and Discrete Optimization in an unconventional way. Our idea is to define a discrete neighborhood of the current SGD point containing a number of “potentially good moves” that exploit gradient information, and to search this neighborhood by using a classical metaheuristic scheme borrowed from Discrete Optimization. In the present paper we investigate the use of a simple Simulated Annealing (SA) metaheuristic that accepts/rejects a candidate new solution in the neighborhood with a probability that depends both on the new solution quality and on a parameter (the temperature) which is modified over time to lower the probability of accepting worsening moves. Computational results on image classification (CIFAR-10) are reported, showing that the proposed approach leads to an improvement of the final validation accuracy for modern Deep Neural Networks such as ResNet34 and VGG16.

Embedding Simulated Annealing within Stochastic Gradient Descent

Fischetti M.;Stringher M.

2021

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
			2021
		
	Titolo del Libro
	
			Communications in Computer and Information Science
		
	Codice DOI
	
			https://dx.doi.org/10.1007/978-3-030-85672-4_1
		
	Codice WOS
	
			WOS:001054800900001
		
	Codice Scopus
	
			2-s2.0-85115143736
		
	Codice ISBN
	
			978-3-030-85671-7
978-3-030-85672-4
		
	Appare nelle tipologie:
	
			04.01 - Contributo in atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11577/3411711

Embedding Simulated Annealing within Stochastic Gradient Descent

Fischetti M.;Stringher M.

2021

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Pubblicazioni consigliate

Citazioni

social impact

Embedding Simulated Annealing within Stochastic Gradient Descent

Fischetti M.;Stringher M.

2021

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)