Vision-based 3D hand posture estimation using hierarchical-ISOSOM
by Guan, Haiying, Ph.D., UNIVERSITY OF CALIFORNIA, SANTA BARBARA, 2007, 135 pages; 3283638

Abstract:

Hand gesture is a potentially useful modality for visual-based interaction (VBI) in human-computer interaction (HCI), but estimating hand posture and pose under camera viewpoint variations is a challenging problem due to the high intrinsic degrees of freedom (DOF) and the self-occlusion problem. This work focuses on the problem of appearance-based 3D hand posture and pose estimation.

Firstly, we propose the Isometric Self-Organizing Map (ISOSOM) algorithm, which provides a concise representation of randomly sampled data points in high dimensional space by a small number of representatives organized in a structure in a low dimensional manifold. In order to resolve the complexity problem of ISOSOM, we proposed the Hierarchical-ISOSOM (H-ISOSOM) algorithm. The hierarchical structure follows a coarse-to-fine strategy. According to the coarse global structure, it "unfolds" the manifold at the coarse level and decomposes the sample data into small patches, then iteratively learns the nonlinearity of each patch in finer levels. We compare the proposed method with similar methods on five standard data sets. Encouraging quantitative experimental results from synthetic data sets illustrate the validity of ISOSOM and H-ISOSOM.

Then we formulate the appearance-based hand pose estimation to a nonlinear manifold learning and representation problem. We apply the H-ISOSOM algorithm to learn a concise, organized representation of the hand manifold, which is a complex, extremely non-linear, large scale, high-dimensional manifold composed of three components: hand image features, hand postures, and hand poses. We represent hand images using several feature descriptors, such as shape context based on 2D intensity edge and 2.5D depth edge. Given the hand image features, the relevant representatives with the camera viewpoint ground-truth are retrieved by the learned map, which maintains the intrinsic structure revealed by a huge number of synthetic training images.

In order to resolve the ambiguity caused by hand self-occlusion, we proposed a MAP (maximum a posteriori) framework to fuse the information obtained from multiple cameras for better retrieval accuracy. The quantitative experimental results validate the effectiveness and efficiency of H-ISOSOM and MAP framework, which shows that retrieval performance of H-ISOSOM with a multi-view approach is much better than Nearest Neighbor algorithm with single-view approach.

 
AdviserMatthew Turk
SchoolUNIVERSITY OF CALIFORNIA, SANTA BARBARA
SourceDAI/B 68-10, p. , Jan 2008
Source TypeDissertation
SubjectsComputer science
Publication Number3283638
Adobe PDF Access the complete dissertation:
 

» Find an electronic copy at your library.
  Use the link below to access a full citation record of this graduate work:
  http://gateway.proquest.com/openurl%3furl_ver=Z39.88-2004%26res_dat=xri:pqdiss%26rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation%26rft_dat=xri:pqdiss:3283638
  If your library subscribes to the ProQuest Dissertations & Theses (PQDT) database, you may be entitled to a free electronic version of this graduate work. If not, you will have the option to purchase one, and access a 24 page preview for free (if available).

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.