Fall 2021 Data Science Seminar Series: Matthias Katzfuss: Scalable Gaussian-Process Approximations for Big Data

Speaker: Matthias Katzfuss, Ph.D. Associate Professor, Department of Statistics, Texas A&M University

Faculty Host: : Yu Ding, Texas A&M Institute of Data Science

Abstract: Gaussian processes (GPs) are popular, flexible, and interpretable probabilistic models for functions in geospatial analysis, computer-model emulation, and machine learning. However, direct application of GPs involves dense covariance matrices and is computationally infeasible for large datasets. We consider a framework for fast GP inference based on the so-called Vecchia approximation, which implies a sparse Cholesky factor of the inverse covariance matrix. The approximation can be written in closed form and computed in parallel, and it includes many popular existing approximations as special cases. We discuss applications of the framework to noisy and non-Gaussian data, to emulation of computer experiments, and to nonparametric regression and variable selection.

Biography: Dr. Matthias Katzfuss is an Associate Professor in the Department of Statistics at Texas A&M University. His research interests include spatial and spatio-temporal statistics, Gaussian processes, computational statistics for massive datasets, and data assimilation, with applications to environmental problems and satellite remote-sensing data. His research has been funded by NSF, NASA, NOAA, USDA, and the Jet Propulsion Laboratory. Dr. Katzfuss is the recipient of an NSF Career Award, a Fulbright Scholarship, and an Early Investigator Award by the ASA Section on Statistics and the Environment.

Link to pdf version

You can also click this link to join the seminar

This image has an empty alt attribute; its file name is tamidsFooter-1024x81.jpg


For more information about TAMIDS seminar series, please contact Ms. Jennifer South at jsouth@tamu.edu