Shape Perception as Bayesian Inference of Modality-independent Part-based 3D Object-centered Shape Representations

Shape Perception as Bayesian Inference of Modality-independent Part-based 3D Object-centered Shape Representations
Author: Goker Erdogan
Publisher:
Total Pages: 211
Release: 2017
Genre: Form perception
ISBN:

Download Shape Perception as Bayesian Inference of Modality-independent Part-based 3D Object-centered Shape Representations Book in PDF, Epub and Kindle

"Shape is a fundamental property of physical objects. It provides crucial information for various critical behaviors from object recognition to motor planning. The fundamental question here for cognitive science is to understand object shape perception, i.e., how our brains extract shape information from sensory stimuli and make use of it. In other words, we want to understand the representations and algorithms our brains use to achieve successful shape perception. This thesis reports a computational theory of shape perception that uses modality-independent, part-based, 3D, object-centered shape representations and frames shape perception as Bayesian inference over such representations. In a series of behavioral, neuroimaging and computational studies reported in the following chapters, we test various aspects of this proposed theory and show that it provides a promising approach to understanding shape perception."--Page xi.

3D Shape

3D Shape
Author: Zygmunt Pizlo
Publisher: MIT Press
Total Pages: 295
Release: 2010
Genre: Medical
ISBN: 026251513X

Download 3D Shape Book in PDF, Epub and Kindle

Zygmunt Pizlo is Professor of Psychological Sciences and Electrical and Computer Engineering (by courtesy) at Purdue University.

Curve-Based Shape Representation in Visual Perception

Curve-Based Shape Representation in Visual Perception
Author: Nicholas Baker
Publisher:
Total Pages: 190
Release: 2020
Genre:
ISBN:

Download Curve-Based Shape Representation in Visual Perception Book in PDF, Epub and Kindle

Shape is the predominant cue for object recognition in visual perception. Though many studies have demonstrated the psychological importance of shape information, much remains unknown about how the visual system forms representations of shape. Shape representations are unlikely to be a literal recording of an object's boundary. Rather, representations of shape are abstract in that they encode relations between parts, are economical, selectively encoding information present in the physical stimulus, and are invariant to 2D transformations and changes to the properties of local elements. In this dissertation, I examine evidence for the theory that representations of shapes are formed by partitioning a contour into regions of similar curvature and representing segments with a single curvature value. I first develop a computational model for how contours could be recoded abstractly as sets of constant curvature segments. I experimentally tested two free parameters in the model and then tested the model's ability to predict the perceptual difference between pairs of shapes. In Chapter 2, I showed how the visual system could encode constant curvature representations of shape from activations of oriented luminance contrast detectors in early vision, bridging a theoretical gap between subsymbolic activations that are responsive to light energy and symbolic representations that are concerned with objects, contours, and surfaces. In Chapters 3 and 4, I applied the constant curvature theory to two interesting domains of shape perception. First, I tested how and why people encode shape representations from arrays of unconnected dots. Consistent with the constant curvature theory of shape, dot arrays that were perceived to have curvilinear contours were more easily represented as shapes than dot arrays perceived to have straight edges joined at corners. In Chapter 4, I studied shapes with both global form and high frequency local contour features. Evidence was found for a hypothesis that local and global contour features are encoded independently and in separate systems. In this theory, global features are extracted from large curvature detectors and described in detail while local contour features are extracted from small curvature detectors and encoded with a few descriptive statistics rather than as individual features.

3D Shape Analysis

3D Shape Analysis
Author: Hamid Laga
Publisher: John Wiley & Sons
Total Pages: 374
Release: 2018-12-14
Genre: Mathematics
ISBN: 111940519X

Download 3D Shape Analysis Book in PDF, Epub and Kindle

An in-depth description of the state-of-the-art of 3D shape analysis techniques and their applications This book discusses the different topics that come under the title of "3D shape analysis". It covers the theoretical foundations and the major solutions that have been presented in the literature. It also establishes links between solutions proposed by different communities that studied 3D shape, such as mathematics and statistics, medical imaging, computer vision, and computer graphics. The first part of 3D Shape Analysis: Fundamentals, Theory, and Applications provides a review of the background concepts such as methods for the acquisition and representation of 3D geometries, and the fundamentals of geometry and topology. It specifically covers stereo matching, structured light, and intrinsic vs. extrinsic properties of shape. Parts 2 and 3 present a range of mathematical and algorithmic tools (which are used for e.g., global descriptors, keypoint detectors, local feature descriptors, and algorithms) that are commonly used for the detection, registration, recognition, classification, and retrieval of 3D objects. Both also place strong emphasis on recent techniques motivated by the spread of commodity devices for 3D acquisition. Part 4 demonstrates the use of these techniques in a selection of 3D shape analysis applications. It covers 3D face recognition, object recognition in 3D scenes, and 3D shape retrieval. It also discusses examples of semantic applications and cross domain 3D retrieval, i.e. how to retrieve 3D models using various types of modalities, e.g. sketches and/or images. The book concludes with a summary of the main ideas and discussions of the future trends. 3D Shape Analysis: Fundamentals, Theory, and Applications is an excellent reference for graduate students, researchers, and professionals in different fields of mathematics, computer science, and engineering. It is also ideal for courses in computer vision and computer graphics, as well as for those seeking 3D industrial/commercial solutions.

Deep Shape Representations for 3D Object Recognition

Deep Shape Representations for 3D Object Recognition
Author: Hamed Ghodrati Asbfroushani
Publisher:
Total Pages: 97
Release: 2018
Genre:
ISBN:

Download Deep Shape Representations for 3D Object Recognition Book in PDF, Epub and Kindle

Deep learning is a rapidly growing discipline that models high-level features in data as multilayered neural networks. The recent trend toward deep neural networks has been driven, in large part, by a combination of affordable computing hardware, open source software, and the availability of pre-trained networks on large-scale datasets. In this thesis, we propose deep learning approaches to 3D shape recognition using a multilevel feature learning paradigm. We start by comprehensively reviewing recent shape descriptors, including hand-crafted descriptors that are mostly developed in the spectral geometry setting and also the ones obtained via learning-based methods. Then, we introduce novel multi-level feature learning approaches using spectral graph wavelets, bag-of-features and deep learning. Low-level features are first extracted from a 3D shape using spectral graph wavelets. Mid-level features are then generated via the bag-of-features model by employing locality-constrained linear coding as a feature coding method, in conjunction with the biharmonic distance and intrinsic spatial pyramid matching in a bid to effectively measure the spatial relationship between each pair of the bag-offeature descriptors. For the task of 3D shape retrieval, high-level shape features are learned via a deep auto-encoder on mid-level features. Then, we compare the deep learned descriptor of a query shape to the descriptors of all shapes in the dataset using a dissimilarity measure for 3D shape retrieval. For the task of 3D shape classification, mid-level features are represented as 2D images in order to be fed into a pre-trained convolutional neural network to learn high-level features from the penultimate fully-connected layer of the network. Finally, a multiclass support vector machine classifier is trained on these deep learned descriptors, and the classification accuracy is subsequently computed. The proposed 3D shape retrieval and classification approaches are evaluated on three standard 3D shape benchmarks through extensive experiments, and the results show compelling superiority of our approaches over state-of-the-art methods.

Elastic Shape Analysis of Three-Dimensional Objects

Elastic Shape Analysis of Three-Dimensional Objects
Author: Ian H. Jermyn
Publisher: Morgan & Claypool
Total Pages: 185
Release: 2017-09-15
Genre: Mathematics
ISBN: 9781681730271

Download Elastic Shape Analysis of Three-Dimensional Objects Book in PDF, Epub and Kindle

Statistical analysis of shapes of 3D objects is an important problem with a wide range of applications. This analysis is difficult for many reasons, including the fact that objects differ in both geometry and topology. In this manuscript, we narrow the problem by focusing on objects with fixed topology, say objects that are diffeomorphic to unit spheres, and develop tools for analyzing their geometries. The main challenges in this problem are to register points across objects and to perform analysis while being invariant to certain shape-preserving transformations. We develop a comprehensive framework for analyzing shapes of spherical objects, i.e., objects that are embeddings of a unit sphere in ℝ, including tools for: quantifying shape differences, optimally deforming shapes into each other, summarizing shape samples, extracting principal modes of shape variability, and modeling shape variability associated with populations. An important strength of this framework is that it is elastic: it performs alignment, registration, and comparison in a single unified framework, while being invariant to shape-preserving transformations. The approach is essentially Riemannian in the following sense. We specify natural mathematical representations of surfaces of interest, and impose Riemannian metrics that are invariant to the actions of the shape-preserving transformations. In particular, they are invariant to reparameterizations of surfaces. While these metrics are too complicated to allow broad usage in practical applications, we introduce a novel representation, termed square-root normal fields (SRNFs), that transform a particular invariant elastic metric into the standard L2 metric. As a result, one can use standard techniques from functional data analysis for registering, comparing, and summarizing shapes. Specifically, this results in: pairwise registration of surfaces; computation of geodesic paths encoding optimal deformations; computation of Karcher means and covariances under the shape metric; tangent Principal Component Analysis (PCA) and extraction of dominant modes of variability; and finally, modeling of shape variability using wrapped normal densities. These ideas are demonstrated using two case studies: the analysis of surfaces denoting human bodies in terms of shape and pose variability; and the clustering and classification of the shapes of subcortical brain structures for use in medical diagnosis. This book develops these ideas without assuming advanced knowledge in differential geometry and statistics. We summarize some basic tools from differential geometry in the appendices, and introduce additional concepts and terminology as needed in the individual chapters.

Representations and Techniques for 3D Object Recognition and Scene Interpretation

Representations and Techniques for 3D Object Recognition and Scene Interpretation
Author: Derek Hoiem
Publisher: Morgan & Claypool Publishers
Total Pages: 172
Release: 2011
Genre: Computers
ISBN: 1608457281

Download Representations and Techniques for 3D Object Recognition and Scene Interpretation Book in PDF, Epub and Kindle

One of the grand challenges of artificial intelligence is to enable computers to interpret 3D scenes and objects from imagery. This book organizes and introduces major concepts in 3D scene and object representation and inference from still images, with a focus on recent efforts to fuse models of geometry and perspective with statistical machine learning. The book is organized into three sections: (1) Interpretation of Physical Space; (2) Recognition of 3D Objects; and (3) Integrated 3D Scene Interpretation. The first discusses representations of spatial layout and techniques to interpret physical scenes from images. The second section introduces representations for 3D object categories that account for the intrinsically 3D nature of objects and provide robustness to change in viewpoints. The third section discusses strategies to unite inference of scene geometry and object pose and identity into a coherent scene interpretation. Each section broadly surveys important ideas from cognitive science and artificial intelligence research, organizes and discusses key concepts and techniques from recent work in computer vision, and describes a few sample approaches in detail. Newcomers to computer vision will benefit from introductions to basic concepts, such as single-view geometry and image classification, while experts and novices alike may find inspiration from the book's organization and discussion of the most recent ideas in 3D scene understanding and 3D object recognition. Specific topics include: mathematics of perspective geometry; visual elements of the physical scene, structural 3D scene representations; techniques and features for image and region categorization; historical perspective, computational models, and datasets and machine learning techniques for 3D object recognition; inferences of geometrical attributes of objects, such as size and pose; and probabilistic and feature-passing approaches for contextual reasoning about 3D objects and scenes. Table of Contents: Background on 3D Scene Models / Single-view Geometry / Modeling the Physical Scene / Categorizing Images and Regions / Examples of 3D Scene Interpretation / Background on 3D Recognition / Modeling 3D Objects / Recognizing and Understanding 3D Objects / Examples of 2D 1/2 Layout Models / Reasoning about Objects and Scenes / Cascades of Classifiers / Conclusion and Future Directions

Perception as Bayesian Inference

Perception as Bayesian Inference
Author: David C. Knill
Publisher: Cambridge University Press
Total Pages: 532
Release: 2008-06-12
Genre: Computers
ISBN: 9780521064996

Download Perception as Bayesian Inference Book in PDF, Epub and Kindle

In recent years, Bayesian probability theory has emerged not only as a powerful tool for building computational theories of vision, but also as a general paradigm for studying human visual perception. This book provides an introduction to and critical analysis of the Bayesian paradigm. Leading researchers in computer vision and experimental vision science describe general theoretical frameworks for modeling vision, detailed applications to specific problems and implications for experimental studies of human perception. The book provides a dialogue between different perspectives both within chapters, which draw on insights from experimental and computational work, and between chapters, through commentaries written by the contributors on each other's work. Students and researchers in cognitive and visual science will find much to interest them in this thought-provoking collection.

Deep Object-centric 3D Perception

Deep Object-centric 3D Perception
Author: Li Yi
Publisher:
Total Pages:
Release: 2019
Genre:
ISBN:

Download Deep Object-centric 3D Perception Book in PDF, Epub and Kindle

Teaching machines to perceive visual content in a 3D environment as humans do is a central topic in Artificial Intelligence. The goal is to be able to process different types of 3D sensory inputs and generate symbolic or numerical descriptions about the environment to support decision making. In this thesis, we advocate an object-centric way to generate such descriptions, in which we represent an environment as a collection of 3D objects equipped with various attributes important for specific tasks. To generate such a representation, we focus on deep object-centric 3D perception, a class of approach built upon 3D deep learning techniques. This thesis covers three critical components of deep object-centric 3D perception: constructing large-scale 3D model repository, designing 3D deep learning frameworks to consume various formats of 3D data, applying big data and deep learning techniques to real perception tasks. We start by providing an overview of each component. Following this, we show how we could accelerate the labeling acquisition process to scale-up 3D model repositories so that data-hungry deep learning approaches can be applied. 3D data can usually be represented in different formats. Some of the prevalent geometric formats, such as point cloud and polygon mesh, poses a significant challenge to deep learning framework design since traditional deep nets designed for regular data forms, e.g., images, can not be directly applied. We then investigate how to build deep learning frameworks capable of consuming 3D shape meshes, an irregular graph-structured data format. Next, we provide two real perception applications as case studies, to show how big data and 3D deep learning help the field evolve. In particular, we study instance segmentation in 3D point cloud and develop a novel 3D object proposal network named GSPN as well as a 3D instance segmentation framework named R-PointNet, which boosts the state-of-the-art instance segmentation performance by a large margin on existing benchmarks. In the second application, we go one step further and tackle detailed part-level perception. We study the problem of articulation-based object part segmentation. We show how to modularize deep network design by disentangling complex perception problems into subproblems. We conclude by summarizing our efforts and discuss the challenges and open questions in the field.

Inference of 3D Shape from Line Drawings

Inference of 3D Shape from Line Drawings
Author: Seha Kim
Publisher:
Total Pages: 58
Release: 2015
Genre: Shapes
ISBN:

Download Inference of 3D Shape from Line Drawings Book in PDF, Epub and Kindle

Line drawings lack direct 3D depth information, yet human vision easily perceives the 3D shapes from the contours. This dissertation investigates the mechanisms underlying the 3D shape inference from 2D line drawings. Here, four psychophysical experiments and a computational model for the 3D shape inference are discussed. Experiment 1 shows that human responses in depth judgments for line drawings reflect an underlying uncertainty of the perceived 3D shape, which is based on the complex interaction of local and global depth cues propagated from the contours. The computational model estimates the posterior probability of possible 3D surfaces from the contours of a line drawing in a Bayesian framework. The comparison of the model predictions and human depth responses for the line drawings from Experiment 1 demonstrates that the model accounts for the probabilistic 3D shape interpretation of line drawings by human vision. Experiment 2 shows that the reliability of a contour segment in a line drawing as a meaningful depth cue is conditional to the complex global context. Experiments 3 and 4 show that the certainty of depth difference perceptions from partial line drawings increases as more non-local visual cues are available. The experiments and the model offer a new perspective on 3D shape perception from line drawings as an inference based on the probability over possible 3D shapes given the contour cues, providing a broader understanding on the mechanisms of human vision.