Research

My research interests are Approximation Theory, High-dimensioanl Probability, and in particular their applications in Scientific Machine Learning and Data Science.

Optimal Recovery

Optimal Recovery (OR), which is a subfield of Approximation Theory, can be viewed as a nonstatistical learning Theory. The main task is to approximate an unknown function from observations and an explicit model assumption on the function to be recovered. The problem is considered under deterministic problem setting where we assume function inputs are fixed quantities (even unfavorable) and observational errors are bounded not random. To assess the performance, we adopt worst-case error perspective which is key to Optimal Recovery.

We observed that the optimal recovery framework has a closed relation to Learning Theory, Numerical Analysis (Quadrature Rule), Gaussian Process Regression and Estimation Theory. It has been used to explain many practical problems, but the development of optimal recovery slowed down due to the lack of computational advantages.

The goal of my research is to make optimal recovery more computational-embracing. Some recent papers with reproducible files can be found here.

Scientific Machine Learning

Scientific machine learning (SciML), which is a rapidly emerging field, combines data-driven methods with traditional scientific modeling to solve complex physical and engineering problems. It bridges the gap between traditional approaches that rely on explicit physical laws, and modern machine learning techniques that derive patterns from data. My work focuses on the theoretical foundations and computational methods that make these approaches reliable, efficient, and broadly applicable across scientific domains.

Operator Learning

Operator learning focuses on learning mappings between function spaces rather than finite-dimensional inputs and outputs. Instead of predicting a single value, these models learn entire solution operators—such as mapping a differential equation’s inputs (e.g., boundary conditions) to its solutions. This approach is particularly powerful for accelerating simulations of physical systems and enables rapid generalization across varying conditions.