Content-aware approaches for digital video adaptation, summarization and compression
by Lu, Taoran, Ph.D., UNIVERSITY OF FLORIDA, 2010, 128 pages; 3467681

Abstract:

In this dissertation, we mainly present our work on three challenging problems of digital video applications: video compression, summarization and adaptation. Unlike conventional techniques, we focus on the video content modeling and investigate how the characteristics of human attention will help solving these problems. We denote these approaches “content-aware” approaches and present our innovations in the main body of this dissertation.

The first problem is content-aware video adaptation. We employ saliency analysis, which generates a saliency map to indicate the relative importance of pixels within a frame for human attention modeling. We also propose a nonlinear saliency map fusing approach that considers human perceptual characteristics. To effectively map the important content from source to the target display, we propose to have both intra-frame visual considerations and inter-frame visual considerations, where intra-considerations focus on measuring the information loss within a frame, and inter-considerations emphasis the visual smoothness between frames. The mapping problem is formulated as a shortest path problem and is solved with dynamic programming.

The second problem is content-aware video summarization. We introduce an automatic video summarization approach that includes unsupervised learning of original video-audio concept primitives and hierarchical (both frame and shot levels) skimming. For video concept mining, we propose a novel model using bag-of-word (BoW) shot features. We further design a hierarchical video summarization framework which jointly considers content completeness, saliency, smoothing and scalability.

Another problem that we investigate is content-aware video compression. We study this problem in two aspects. In one aspect, we propose a content-aware framework and formulate the constraint optimization of rate-distortion as a resource allocation problem, where bit-allocation is adjusted differently at two levels: region-of-interest (ROI) and non-ROI, intra frames and inter frames. The results exhibit better visual quality as well as objective quality improvement. In the other aspect, we aim at improving the coding efficiency of existing H.264 intra prediction. We incorporate a reverse encoding order with geometric analysis for binary transition points on block boundaries to explicitly derive the prediction direction. Besides, we design and implement a video coding parameter analyzer to facilitate the development of new coding tools for state-of-the-art and next-generation video compression standards.

 
AdviserDapeng Oliver Wu
SchoolUNIVERSITY OF FLORIDA
SourceDAI/B 72-10, p. , Aug 2011
Source TypeDissertation
SubjectsComputer engineering; Electrical engineering
Publication Number3467681
Adobe PDF Access the complete dissertation:
 

» Find an electronic copy at your library.
  Use the link below to access a full citation record of this graduate work:
  http://gateway.proquest.com/openurl%3furl_ver=Z39.88-2004%26res_dat=xri:pqdiss%26rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation%26rft_dat=xri:pqdiss:3467681
  If your library subscribes to the ProQuest Dissertations & Theses (PQDT) database, you may be entitled to a free electronic version of this graduate work. If not, you will have the option to purchase one, and access a 24 page preview for free (if available).

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.