Towards general-purpose image registration
by Yang, Gehua, Ph.D., RENSSELAER POLYTECHNIC INSTITUTE, 2007, 199 pages; 3286353

Abstract:

This thesis presents four image-registration-related methods. The first two methods aid the construction of a fully-automatic retinal image registration system, called RIVERS.

The first method, automatic masking, is a preprocessing technique for image registration that separates the retinal surface from the background in each image. It is especially important when aligning retinal images scanned from slides or images acquired from different cameras.

The second method, a covariance-driven refinement technique, is developed to handle pairs with extreme low overlap (e.g. 5%) in the context of aligning a set of sparsely-overlap images.

The third method, named GDB-ICP, is an automated 2d-image-pair registration algorithm using a hypothesis-and-test strategy and an extension of the Dual-Bootstrap method for refinement. This algorithm is capable of aligning images taken of a wide variety of natural and man-made scenes as well as many medical images, tolerating low overlap, substantial orientation and scale differences, large illumination variations, and physical changes in the scene. An important component of this is the ability to automatically reject pairs that have no overlap or have too many differences to be aligned well. Experimental results on a data set of 22 challenging image pairs show that the algorithm effectively aligns 19 of the 22 pairs and rejects 99.8% of the misalignments that occur when all possible pairs are tried.

An extension of the algorithmic principle of GDB-ICP, the fourth method, a 3d-to-2d alignment algorithm, is applied to estimate the location of a hand-held camera with respect to a 3d model augmented with texture information. Little prior knowledge is assumed about the camera location. A key issue is that initially the model-to-image mapping is well-approximated by a simple 2d-to-2d transformation based on a local model surface approximation. However, the algorithm must transition to the 3d-to-2d projection necessary to solve the position estimation problem. The experiments are conducted on a collection of 9 range scans and 60 image covering approximately a 100m x 100m region of RPI campus. The algorithm successfully and correctly determines the camera location of 52 images, while indicates it cannot find an alignment for the remaining 8.

 
AdviserCharles V. Stewart
SchoolRENSSELAER POLYTECHNIC INSTITUTE
SourceDAI/B 68-10, p. , Jan 2008
Source TypeDissertation
SubjectsArtificial intelligence; Computer science
Publication Number3286353
Adobe PDF Access the complete dissertation:
 

» Find an electronic copy at your library.
  Use the link below to access a full citation record of this graduate work:
  http://gateway.proquest.com/openurl%3furl_ver=Z39.88-2004%26res_dat=xri:pqdiss%26rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation%26rft_dat=xri:pqdiss:3286353
  If your library subscribes to the ProQuest Dissertations & Theses (PQDT) database, you may be entitled to a free electronic version of this graduate work. If not, you will have the option to purchase one, and access a 24 page preview for free (if available).

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.