Disentangling Grasp-Object Representations in the Latent Space: Toward Brain-Like Affordances for Machines

Hejri, S. M.; Pourfannan, H.; Ma, R.; Rodriguez, A. J.; Di Nuovo, A.

doi:10.1007/978-3-032-07448-5_35

Humans perceive the world through their bodies. The theory of object affordances suggests that when encountering an object, our brain encodes it not only based on its physical properties but also according to how we intend to use it. Decades of foundational research in neuroscience indicate that object properties are associated with distinct regions of the sensorimotor cortex, depending on the grasp type they tend to activate. In this study, we trained a Conditional Variational Autoencoder (CVAE) on the HO-3D_v3 dataset to reconstruct hand poses conditioned on object properties. Principal Component Analysis (PCA), clustering, and visualization of the model’s latent space revealed structured patterns for the abstract representation of the hand, which were distinctly organized according to object associations. This bears a notable resemblance to neural strategies observed in the human sensorimotor cortex for representing object-grasp relationships. This finding supports the notion that artificial intelligence systems can develop brain-like latent representations of object affordances. Such representations could significantly enhance robotic control in the future by enabling real-time motor planning for high-degree-of-freedom humanoid hand actions in an abstract latent space, bypassing the need for low-level pixel- and joint-level computations.

Disentangling Grasp-Object Representations in the Latent Space: Toward Brain-Like Affordances for Machines

Hejri S. M.;Pourfannan H.;Ma R.;Rodriguez A. J.;Di Nuovo A.^Ultimo

2026-01-01

Abstract

Humans perceive the world through their bodies. The theory of object affordances suggests that when encountering an object, our brain encodes it not only based on its physical properties but also according to how we intend to use it. Decades of foundational research in neuroscience indicate that object properties are associated with distinct regions of the sensorimotor cortex, depending on the grasp type they tend to activate. In this study, we trained a Conditional Variational Autoencoder (CVAE) on the HO-3D_v3 dataset to reconstruct hand poses conditioned on object properties. Principal Component Analysis (PCA), clustering, and visualization of the model’s latent space revealed structured patterns for the abstract representation of the hand, which were distinctly organized according to object associations. This bears a notable resemblance to neural strategies observed in the human sensorimotor cortex for representing object-grasp relationships. This finding supports the notion that artificial intelligence systems can develop brain-like latent representations of object affordances. Such representations could significantly enhance robotic control in the future by enabling real-time motor planning for high-degree-of-freedom humanoid hand actions in an abstract latent space, bypassing the need for low-level pixel- and joint-level computations.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2026
			
	Codice ISBN
	
				9783032074478
9783032074485
			
	Parole chiave
	
				Conditional Variational Autoencoders (CVAEs)
Grasp Embeddings
Latent Space Analysis
Object Affordances
Principal Component Analysis
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11769/707272

Citazioni

ND

0

ND

Disentangling Grasp-Object Representations in the Latent Space: Toward Brain-Like Affordances for Machines

Hejri S. M.;Pourfannan H.;Ma R.;Rodriguez A. J.;Di Nuovo A.Ultimo

Ultimo

2026-01-01

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Hejri S. M.;Pourfannan H.;Ma R.;Rodriguez A. J.;Di Nuovo A.^Ultimo

Scheda breve

Scheda completa

Scheda completa (DC)