UMI  
ProQuest® Dissertations & Theses
The world's most comprehensive collection of dissertations and theses. Learn more...
ProQuest  
 
 
Arabic handwriting recognition using machine learning approaches
by Ball, Gregory Raymond, Ph.D., STATE UNIVERSITY OF NEW YORK AT BUFFALO, 2007, 107 pages; 3277785
 

Abstract:

While handwriting recognition tasks for Latin script based languages have received considerable attention, far less work has been done on the Arabic script. Arabic poses some unique challenges, such as a larger character set, the presence of dots and diacritics, and intra-word whitespace regions. Machine learning approaches have the potential to significantly improve state of the art Arabic handwriting recognition results. This dissertation presents several such machine learning techniques, such as writer adaptation and segmentation free unconstrained text processing. We integrate these techniques into novel algorithms for general recognition, word spotting, and transcript mapping.

Writer adaptation or specialization is the adjustment of handwriting recognition algorithms to a specific writer's style of handwriting. Such adjustment yields significantly improved recognition rates over a generalized recognition counterpart algorithms. Specialization is commonly used in online Latin script handwriting applications, such as for tablet computers or PDAs. Some rudimentary offline Latin script adaptation methods have been proposed recently in the literature as well. Handwriting adaptation for the Arabic script, however, is unexplored. An iterative bootstrapping model is presented which adapts a writer-independent model to a writer-dependent model using a small number of words achieving a large recognition rate increase in the process. Furthermore, a confidence weighting method is described which generates better results by weighting words based on their length. Script features unique to Arabic are discussed, as well as they are incorporated into the adaptation process. Even though Arabic has many more character classes than languages such as English, significant improvement is observed.

One issue common to Arabic recognition tasks is generating candidate word regions on a page. Attempting to definitely segment the document into such regions (automatic segmentation) can meet with some success, but the performance of such an algorithm is often a limiting factor in spotting performance. Another approach is to directly scan the image on the page without attempting to generate such a definite segmentation. Such segmentation-free approaches result in better recognition at a performance cost.

The algorithms discussed are tested using a database of truthed, page-length, handwritten Arabic documents. Where applicable, the literature standard IFN/ENIT database is used for testing as well. We validate our approaches by exploring the implications on such tasks as word spotting (attempting to find a query word or image and placement in a set of documents), transcript mapping (the automatic alignment of a handwritten document with its machine readable transcript), and general unconstrained recognition. Novel algorithms for these tasks are also presented.

Specifically, contributions in this dissertation include novel descriptions of machine learning algorithms applied to Arabic handwriting recognition problems and quantification of the improvement generated by their usage. Examples of such algorithms include writer adaptation, versatile search, and the advantage and trade offs gained by processing such tasks as word spotting in a segmentation-free fashion instead of a segmentation-based manner.

 
Advisor: Srihari, Sargur N.
School: STATE UNIVERSITY OF NEW YORK AT BUFFALO
Source: DAI-B 68/09, p. , Mar 2008
Source Type: Ph.D.
Subjects: Computer science
Publication Number: 3277785
     
Adobe PDF Access the complete dissertation:
 

» Find an electronic copy at your library.
  Use the link below to access a full citation record of this graduate work:
  http://gateway.proquest.com/openurl%3furl_ver=Z39.88-2004%26res_dat=xri:pqdiss%26rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation%26rft_dat=xri:pqdiss:3277785
  If your library subscribes to the ProQuest Dissertations & Theses (PQDT) database, you may be entitled to a free electronic version of this graduate work. If not, you will have the option to purchase one, and access a 24 page preview for free (if available).

 
 
 

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.il.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.



Copyright © 2007 ProQuest. All rights reserved. Terms and Conditions

ProQuest