Analyzing 3D Objects in 2D Images

Analyzing 3D Objects in 2D Images
Author: Mohsen Hejratin
Publisher:
Total Pages: 89
Release: 2015
Genre:
ISBN: 9781339124209

Download Analyzing 3D Objects in 2D Images Book in PDF, Epub and Kindle

Robots are mechanically capable of doing many tasks, carrying loads, precisely manipulating objects, picking and packing or collaborating with humans. However, they require accurate 3D perception of objects and surrounding environment to do these tasks autonomously. Traditional methods build 3D representation of the scene using structure from motion techniques or depth sensors, while more recent approaches use statistical models to learn geometry and appearance of 3D objects and scenes. This thesis investigates approaches to represent, learn and analyze 3D objects in natural images. We first propose two new methods for 3D object recognition and pose estimation in single 2D images. Second, we study various geometric representations for the novel task of primitive 3D shape categorization. We propose two novel approaches for recognizing 3D objects: (1) Aligning a 3D model to detected 2D landmarks, where we propose a novel method based on deformable-part models to propose candidate detections and 2D estimates of shape, then these estimates are refined by using an explicit 3D model of shape and viewpoint. (2) An analysis by synthesis approach where a forward synthesis model constructs possible geometric interpretations of the world, and then selects the interpretation that best agrees with the measured visual evidence. We show state of the art performance for detection and pose estimation on two challenging 3D object recognition datasets of cars and cuboids. 3D object recognition methods focus on modeling 3D shape of the objects, however, many objects may have similar 3D shape (washing machines, cabinets and microwave are all cuboidal), thus recognizing them require reasoning about appearance and geometry at the same time. The natural approach for recognition might extract pose-normalized appearance features. Though such approaches are extraordinarily common in the literature, in this thesis we demonstrate that they are {\em not optimal}. Instead, we introduce methods based on pose-synthesis, a somewhat simple approach of augmenting training data with geometrically perturbed training samples. We demonstrate that synthesis is a surprisingly simple but effective strategy that allows for state-of-the-art categorization and automatic 3D alignment.

Representations and Techniques for 3D Object Recognition and Scene Interpretation

Representations and Techniques for 3D Object Recognition and Scene Interpretation
Author: Derek Hoiem
Publisher: Morgan & Claypool Publishers
Total Pages: 172
Release: 2011
Genre: Computers
ISBN: 1608457281

Download Representations and Techniques for 3D Object Recognition and Scene Interpretation Book in PDF, Epub and Kindle

One of the grand challenges of artificial intelligence is to enable computers to interpret 3D scenes and objects from imagery. This book organizes and introduces major concepts in 3D scene and object representation and inference from still images, with a focus on recent efforts to fuse models of geometry and perspective with statistical machine learning. The book is organized into three sections: (1) Interpretation of Physical Space; (2) Recognition of 3D Objects; and (3) Integrated 3D Scene Interpretation. The first discusses representations of spatial layout and techniques to interpret physical scenes from images. The second section introduces representations for 3D object categories that account for the intrinsically 3D nature of objects and provide robustness to change in viewpoints. The third section discusses strategies to unite inference of scene geometry and object pose and identity into a coherent scene interpretation. Each section broadly surveys important ideas from cognitive science and artificial intelligence research, organizes and discusses key concepts and techniques from recent work in computer vision, and describes a few sample approaches in detail. Newcomers to computer vision will benefit from introductions to basic concepts, such as single-view geometry and image classification, while experts and novices alike may find inspiration from the book's organization and discussion of the most recent ideas in 3D scene understanding and 3D object recognition. Specific topics include: mathematics of perspective geometry; visual elements of the physical scene, structural 3D scene representations; techniques and features for image and region categorization; historical perspective, computational models, and datasets and machine learning techniques for 3D object recognition; inferences of geometrical attributes of objects, such as size and pose; and probabilistic and feature-passing approaches for contextual reasoning about 3D objects and scenes. Table of Contents: Background on 3D Scene Models / Single-view Geometry / Modeling the Physical Scene / Categorizing Images and Regions / Examples of 3D Scene Interpretation / Background on 3D Recognition / Modeling 3D Objects / Recognizing and Understanding 3D Objects / Examples of 2D 1/2 Layout Models / Reasoning about Objects and Scenes / Cascades of Classifiers / Conclusion and Future Directions

Recognizing 3D Objects from 2D Images

Recognizing 3D Objects from 2D Images
Author: William Eric Leifur Grimson
Publisher:
Total Pages: 0
Release: 1992
Genre: Computer vision
ISBN:

Download Recognizing 3D Objects from 2D Images Book in PDF, Epub and Kindle

Abstract: "Many recent object recognition systems use a small number of pairings of data and model features to compute the 3D transformation from a model coordinate frame into the sensor coordinate system. In the case of perfect image data, these systems seem to work well. With uncertain image data, however, the performance of such methods is less well understood. In this paper, we examine the effects of two- dimensional sensor uncertainty on the computation of three-dimensional model transformations. We use this analysis to bound the uncertainty in the transformation parameters, as well as the uncertainty associated with applying the transformation to map other model features into the image. We also examine the effects of the transformation uncertainty on the effectiveness of recognition methods."

Representations and Techniques for 3D Object Recognition and Scene Interpretation

Representations and Techniques for 3D Object Recognition and Scene Interpretation
Author: Derek Hoiem
Publisher: Morgan & Claypool Publishers
Total Pages: 171
Release: 2011-09-09
Genre: Technology & Engineering
ISBN: 160845729X

Download Representations and Techniques for 3D Object Recognition and Scene Interpretation Book in PDF, Epub and Kindle

One of the grand challenges of artificial intelligence is to enable computers to interpret 3D scenes and objects from imagery. This book organizes and introduces major concepts in 3D scene and object representation and inference from still images, with a focus on recent efforts to fuse models of geometry and perspective with statistical machine learning. The book is organized into three sections: (1) Interpretation of Physical Space; (2) Recognition of 3D Objects; and (3) Integrated 3D Scene Interpretation. The first discusses representations of spatial layout and techniques to interpret physical scenes from images. The second section introduces representations for 3D object categories that account for the intrinsically 3D nature of objects and provide robustness to change in viewpoints. The third section discusses strategies to unite inference of scene geometry and object pose and identity into a coherent scene interpretation. Each section broadly surveys important ideas from cognitive science and artificial intelligence research, organizes and discusses key concepts and techniques from recent work in computer vision, and describes a few sample approaches in detail. Newcomers to computer vision will benefit from introductions to basic concepts, such as single-view geometry and image classification, while experts and novices alike may find inspiration from the book's organization and discussion of the most recent ideas in 3D scene understanding and 3D object recognition. Specific topics include: mathematics of perspective geometry; visual elements of the physical scene, structural 3D scene representations; techniques and features for image and region categorization; historical perspective, computational models, and datasets and machine learning techniques for 3D object recognition; inferences of geometrical attributes of objects, such as size and pose; and probabilistic and feature-passing approaches for contextual reasoning about 3D objects and scenes. Table of Contents: Background on 3D Scene Models / Single-view Geometry / Modeling the Physical Scene / Categorizing Images and Regions / Examples of 3D Scene Interpretation / Background on 3D Recognition / Modeling 3D Objects / Recognizing and Understanding 3D Objects / Examples of 2D 1/2 Layout Models / Reasoning about Objects and Scenes / Cascades of Classifiers / Conclusion and Future Directions

Recognizing 3D Objects from 2D Images

Recognizing 3D Objects from 2D Images
Author: William Eric Leifur Grimson
Publisher:
Total Pages: 30
Release: 1991
Genre: Computer vision
ISBN:

Download Recognizing 3D Objects from 2D Images Book in PDF, Epub and Kindle

Abstract: "Many recent object recognition systems use a small number of pairings of data and model features to compute the 3D transformation from a model coordinate frame into the sensor coordinate system. In the case of perfect image data, these systems seem to work well. With uncertain image data, however, the performance of such methods is less well understood. In this paper, we examine the effects of two- dimensional sensor uncertainty on the computation of three-dimensional model transformations. We use this analysis to bound the uncertainty in the transformation parameters, as well as the uncertainty associated with applying the transformation to map other model features into the image. We also examine the effects of the transformation uncertainty on the effectiveness of recognition methods."

Multispectral Image Processing and Pattern Recognition

Multispectral Image Processing and Pattern Recognition
Author: Jun Shen
Publisher: World Scientific
Total Pages: 144
Release: 2001
Genre: Computers
ISBN: 9789812797599

Download Multispectral Image Processing and Pattern Recognition Book in PDF, Epub and Kindle

A study of multispectral image processing and pattern recognition. It covers: geometric and orthogonal moments; minimum description length method for facet matching; an integrated vision system for ALV navigation; fuzzy Bayesian networks; and more.

Boosting for Generic 2D/3D Object Recognition

Boosting for Generic 2D/3D Object Recognition
Author: Doaa Abd al-Kareem Mohammed Hegazy
Publisher:
Total Pages: 0
Release: 2009
Genre:
ISBN:

Download Boosting for Generic 2D/3D Object Recognition Book in PDF, Epub and Kindle

Generic object recognition is an important function of the human visual system. For an artificial vision system to be able to emulate the human perception abilities, it should also be able to perform generic object recognition. In this thesis, we address the generic object recognition problem and present different approaches and models which tackle different aspects of this difficult problem. First, we present a model for generic 2D object recognition from complex 2D images. The model exploits only appearance-based information, in the form of a combination of texture and color cues, for binary classification of 2D object classes. Learning is accomplished in a weakly supervised manner using Boosting. However, we live in a 3D world and the ability to recognize 3D objects is very important for any vision system. Therefore, we present a model for generic recognition of 3D objects from range images. Our model makes use of a combination of simple local shape descriptors extracted from range images for recognizing 3D object categories, as shape is an important information provided by range images. Moreover, we present a novel dataset for generic object recognition that provides 2D and range images about different object classes using a Time-of-Flight (ToF) camera.