Modelling human visual navigation using multi-view scene reconstruction

It is often assumed that humans generate a 3D reconstruction of the environment, either in egocentric or world-based coordinates, but the steps involved are unknown. Here, we propose two reconstruction-based models, evaluated using data from two tasks in immersive virtual reality. We model the obser...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Biological cybernetics 2013-08, Vol.107 (4), p.449-464
Hauptverfasser:	Pickup, Lyndsey C., Fitzgibbon, Andrew W., Glennerster, Andrew
Format:	Artikel
Sprache:	eng
Schlagworte:	Applied sciences Artificial intelligence Bioinformatics Biomedical and Life Sciences Biomedicine Complex Systems Computer Appl. in Life Sciences Computer based modeling Computer science control theory systems Computer systems and distributed systems. User interface Control theory. Systems Exact sciences and technology Eyes & eyesight Human Humans Information processing Landmarks Likelihood Functions Mathematical models Modelling and identification Models, Theoretical Navigation Neurobiology Neurosciences Original Paper Pattern recognition. Digital image processing. Computational geometry Position (location) Reconstruction Sensory perception Software Three dimensional Virtual reality Visual Perception
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	It is often assumed that humans generate a 3D reconstruction of the environment, either in egocentric or world-based coordinates, but the steps involved are unknown. Here, we propose two reconstruction-based models, evaluated using data from two tasks in immersive virtual reality. We model the observer’s prediction of landmark location based on standard photogrammetric methods and then combine location predictions to compute likelihood maps of navigation behaviour. In one model, each scene point is treated independently in the reconstruction; in the other, the pertinent variable is the spatial relationship between pairs of points. Participants viewed a simple environment from one location, were transported (virtually) to another part of the scene and were asked to navigate back. Error distributions varied substantially with changes in scene layout; we compared these directly with the likelihood maps to quantify the success of the models. We also measured error distributions when participants manipulated the location of a landmark to match the preceding interval, providing a direct test of the landmark-location stage of the navigation models. Models such as this, which start with scenes and end with a probabilistic prediction of behaviour, are likely to be increasingly useful for understanding 3D vision.
ISSN:	0340-1200 1432-0770
DOI:	10.1007/s00422-013-0558-2