Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 6 de 6
Filter
Add more filters










Database
Language
Publication year range
1.
Sensors (Basel) ; 19(18)2019 Sep 18.
Article in English | MEDLINE | ID: mdl-31540453

ABSTRACT

Scene recognition is still a very important topic in many fields, and that is definitely the case in robotics. Nevertheless, this task is view-dependent, which implies the existence of preferable directions when recognizing a particular scene. Both in human and computer vision-based classification, this actually often turns out to be biased. In our case, instead of trying to improve the generalization capability for different view directions, we have opted for the development of a system capable of filtering out noisy or meaningless images while, on the contrary, retaining those views from which is likely feasible that the correct identification of the scene can be made. Our proposal works with a heuristic metric based on the detection of key points in 3D meshes (Harris 3D). This metric is later used to build a model that combines a Minimum Spanning Tree and a Support Vector Machine (SVM). We have performed an extensive number of experiments through which we have addressed (a) the search for efficient visual descriptors, (b) the analysis of the extent to which our heuristic metric resembles the human criteria for relevance and, finally, (c) the experimental validation of our complete proposal. In the experiments, we have used both a public image database and images collected at our research center.

3.
Vision Res ; 154: 60-79, 2019 01.
Article in English | MEDLINE | ID: mdl-30408434

ABSTRACT

In this study we provide the analysis of eye movement behavior elicited by low-level feature distinctiveness with a dataset of synthetically-generated image patterns. Design of visual stimuli was inspired by the ones used in previous psychophysical experiments, namely in free-viewing and visual searching tasks, to provide a total of 15 types of stimuli, divided according to the task and feature to be analyzed. Our interest is to analyze the influences of low-level feature contrast between a salient region and the rest of distractors, providing fixation localization characteristics and reaction time of landing inside the salient region. Eye-tracking data was collected from 34 participants during the viewing of a 230 images dataset. Results show that saliency is predominantly and distinctively influenced by: 1. feature type, 2. feature contrast, 3. temporality of fixations, 4. task difficulty and 5. center bias. This experimentation proposes a new psychophysical basis for saliency model evaluation using synthetic images.


Subject(s)
Attention/physiology , Eye Movements/physiology , Psychophysics , Visual Perception/physiology , Adult , Female , Fixation, Ocular/physiology , Humans , Male , Middle Aged , Young Adult
4.
IEEE Trans Pattern Anal Mach Intell ; 39(5): 893-907, 2017 05.
Article in English | MEDLINE | ID: mdl-27187946

ABSTRACT

General dynamic scenes involve multiple rigid and flexible objects, with relative and common motion, camera induced or not. The complexity of the motion events together with their strong spatio-temporal correlations make the estimation of dynamic visual saliency a big computational challenge. In this work, we propose a computational model of saliency based on the assumption that perceptual relevant information is carried by high-order statistical structures. Through whitening, we completely remove the second-order information (correlations and variances) of the data, gaining access to the relevant information. The proposed approach is an analytically tractable and computationally simple framework which we call Dynamic Adaptive Whitening Saliency (AWS-D). For model assessment, the provided saliency maps were used to predict the fixations of human observers over six public video datasets, and also to reproduce the human behavior under certain psychophysical experiments (dynamic pop-out). The results demonstrate that AWS-D beats state-of-the-art dynamic saliency models, and suggest that the model might contain the basis to understand the key mechanisms of visual saliency. Experimental evaluation was performed using an extension to video of the well-known methodology for static images, together with a bootstrap permutation test (random label hypothesis) which yields additional information about temporal evolution of the metrics statistical significance.

5.
J Vis ; 12(6): 17, 2012 Jun 12.
Article in English | MEDLINE | ID: mdl-22693335

ABSTRACT

A hierarchical definition of optical variability is proposed that links physical magnitudes to visual saliency and yields a more reductionist interpretation than previous approaches. This definition is shown to be grounded on the classical efficient coding hypothesis. Moreover, we propose that a major goal of contextual adaptation mechanisms is to ensure the invariance of the behavior that the contribution of an image point to optical variability elicits in the visual system. This hypothesis and the necessary assumptions are tested through the comparison with human fixations and state-of-the-art approaches to saliency in three open access eye-tracking datasets, including one devoted to images with faces, as well as in a novel experiment using hyperspectral representations of surface reflectance. The results on faces yield a significant reduction of the potential strength of semantic influences compared to previous works. The results on hyperspectral images support the assumptions to estimate optical variability. As well, the proposed approach explains quantitative results related to a visual illusion observed for images of corners, which does not involve eye movements.


Subject(s)
Adaptation, Physiological/physiology , Fixation, Ocular/physiology , Form Perception/physiology , Models, Neurological , Motion Perception/physiology , Face , Humans , Illusions/physiology , Pattern Recognition, Visual/physiology , Photic Stimulation/methods
6.
IEEE Trans Biomed Eng ; 52(12): 2115-8, 2005 Dec.
Article in English | MEDLINE | ID: mdl-16366236

ABSTRACT

In this paper, we present a method for the decomposition of a volumetric image into its most relevant visual patterns, which we define as features associated to local energy maxima of the image. The method involves the clustering of a set of predefined bandpass energy filters according to their ability to segregate the different features in the image, thus generating a set of composite-feature detectors tuned to the specific visual patterns present in the data. Clustering is based on a measure of statistical dependence between pairs of frequency features. We will illustrate the applicability of the method to the initialization of a three-dimensional geodesic active model.


Subject(s)
Algorithms , Computer Graphics , Image Enhancement/methods , Image Interpretation, Computer-Assisted/methods , Imaging, Three-Dimensional/methods , Pattern Recognition, Automated/methods , User-Computer Interface , Artificial Intelligence , Brain/anatomy & histology , Humans , Information Storage and Retrieval/methods , Magnetic Resonance Imaging/methods , Reproducibility of Results , Sensitivity and Specificity
SELECTION OF CITATIONS
SEARCH DETAIL
...