On numerical properties of data assimilation methods
by Li, Jia, Ph.D., PURDUE UNIVERSITY, 2009, 101 pages; 3403113

Abstract:

In sequential data assimilation problems, the Kalman filter (KF) is optimal for linear Gaussian models. The ensemble Kalman filter (EnKF) has been widely used as an numerical approximation of the KF primarily due to its ease of implementation by Monte Carlo methods. Similarly, the optimal filter/Bayesian filter plays an important role in nonlinear non-Gaussian models. And the particle filter (PF) is widely used as a Monte Carlo version of the Bayesian filter for practical reasons. This thesis consists of two main components: (1) conducting error analysis on EnKF and PF, and (2) proposing new data assimilation methods to improve efficiency and accuracy.

A rigorous analysis on the numerical errors of the EnKF is conducted in a general setting. Error bounds are provided and convergence of the EnKF to the exact Kalman Filter is established. The analysis reveals that the ensemble errors induced by the Monte Carlo sampling can be dominant, compared to other errors such as the numerical integration error of the underlying model equations.

A new error analysis is conducted for PF from the numerical analysis perspective, which is different from the existing convergence results for PF in the probability literature. In this thesis, we demonstrate that the difference between the optimal filter and PF, in general, will increase exponentially. This error mainly consists of two parts: (1) the numerical errors from solving the dynamic equations, and (2) the sampling errors introduced by generating particles. The convergence of the PF to the optimal filter in the weak sense with the numerical error incorporated is proved. And the bounds for one-step local error and cumulative global error are provided.

Both the analysis on EnKF and PF suggest a less obvious fact—more frequent data assimilation may lead to larger numerical errors of the EnKF and PF.

Based on the analysis, two sets of methods to reduce sampling errors for EnKF are developed. First, we present a deterministic sampling strategy(qEnKF) based on cubature rules with much improved accuracy. Second, we propose an efficient EnKF implementation via generalized polynomial chaos (gPC) expansion. The key ingredients of the gPC-based approach involve (1) solving the system of stochastic state equations via the gPC methodology to gain efficiency; (2) sampling the gPC approximation of the stochastic solution with an arbitrarily large number of samples, at virtually no additional computational cost, to drastically reduce sampling errors. The resulting algorithms thus achieve high accuracy at reduced computational cost, compared to the classical implementations of EnKF.

Numerical examples are provided to verify the theoretical findings and to demonstrate the improved performance of the qEnKF and gPC-based EnKF.

 
AdviserDongbin Xiu
SchoolPURDUE UNIVERSITY
SourceDAI/B 71-05, p. , Jun 2010
Source TypeDissertation
SubjectsMathematics
Publication Number3403113
Adobe PDF Access the complete dissertation:
 

» Find an electronic copy at your library.
  Use the link below to access a full citation record of this graduate work:
  http://gateway.proquest.com/openurl%3furl_ver=Z39.88-2004%26res_dat=xri:pqdiss%26rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation%26rft_dat=xri:pqdiss:3403113
  If your library subscribes to the ProQuest Dissertations & Theses (PQDT) database, you may be entitled to a free electronic version of this graduate work. If not, you will have the option to purchase one, and access a 24 page preview for free (if available).

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.