Machine Learning Surrogates for Scientific Simulations: A Practical Guide

Reading Time: 8 minutes

Machine learning (ML) surrogates are fast, data-driven approximations of expensive scientific simulations. They enable near real-time predictions, massive design space exploration, and uncertainty quantification that would be impossible with direct simulation alone. Use surrogates when you need iterative optimization or thousands of evaluations; stick with direct simulation for final validation or when high accuracy is non-negotiable. This guide covers the fundamentals, types, implementation workflow, and critical pitfalls to avoid.

Introduction: The Computational Bottleneck

Scientific simulations—whether solving partial differential equations (PDEs) with FiPy, running finite element analysis, or modeling fluid dynamics—are computationally expensive. A single high-fidelity simulation can take hours, days, or even weeks on powerful hardware. When you need to explore hundreds of design variations, perform sensitivity analysis, or run uncertainty quantification with thousands of samples, the computational cost becomes prohibitive.

This is where machine learning surrogates come in.

What Are Machine Learning Surrogates?

A surrogate model (or metamodel) is a fast, approximate model that mimics the input-output behavior of a complex, expensive simulation [1]. Instead of solving the full physics-based equations each time, you train a machine learning model on a limited set of simulation results, then use it to make instant predictions for new inputs.

Think of it this way:

Direct simulation: High accuracy, high cost, physics-based
ML surrogate: Good approximation, near-zero cost, data-driven

The trade-off is clear: surrogates sacrifice some accuracy for massive speed gains, enabling tasks that would otherwise be impossible.

Key Terminology

Training data: A set of input parameters and corresponding simulation outputs used to fit the surrogate.
Surrogate model: The ML model that approximates the simulation.
Active learning: Strategically selecting new simulation points to most improve the surrogate.
Uncertainty quantification (UQ): Estimating the confidence or error bounds of surrogate predictions.

When to Use ML Surrogates vs Direct Simulation

Not every problem benefits from surrogate modeling. The decision matrix below summarizes when to choose each approach [2]:

Feature	Machine Learning Surrogate	Direct Simulation
Speed	Instant / Real-time	Slow (Hours/Days)
Accuracy	Good Approximation	High / Ground Truth
Cost	Low (after training)	High
Data Requirement	High (for training)	None (Physics-based)
Best For	Iterative design, Optimization	Validation, New Physics

Use ML Surrogates When:

High computation cost (hours/days per run)
Iterative design/optimization (100s-1000s of evaluations)
Real-time interaction or user-facing apps
Uncertainty quantification (Monte Carlo needs 1000s runs)

Use Direct Simulation When:

Ultimate accuracy required (final validation, safety certifications)
One-off analyses (training data cost > benefit)
New physical regimes with no training data
Regulatory or safety-critical certification

Bottom line: Build a surrogate if you need to evaluate the simulation many times with slightly different inputs. Use direct simulation for final checks, novel scenarios, or when the simulation is already fast enough.

Types of Machine Learning Surrogate Models

Several ML architectures are commonly used for surrogate modeling, each with strengths and trade-offs [3]:

1. Gaussian Processes (Kriging)

Gaussian processes (GPs) are a natural choice for scientific surrogates because they provide probabilistic predictions—not just a value, but also an uncertainty estimate [4]. This is invaluable for risk-aware design and active learning.

Pros:

Built-in uncertainty quantification
Works well with small to medium datasets (< 10,000 points)
Robust to overfitting with proper covariance kernel

Cons:

Scales poorly with training data size (O(n³) inference)
Limited to low-to-moderate input dimensions (< 50)

Best for: Engineering design optimization, small parameter spaces, applications where uncertainty bounds are critical.

2. Neural Networks and Deep Learning

Neural networks, especially deep architectures, excel with large datasets and high-dimensional inputs [5]. Recent advances like Fourier Neural Operators (FNOs) and Physics-Informed Neural Networks (PINNs) can even learn to solve entire families of PDEs.

Pros:

Fast inference after training
Handles high-dimensional inputs and complex relationships
Can incorporate physics constraints (PINNs)

Cons:

Requires substantial training data
Black-box nature; uncertainty quantification needs extra techniques
Risk of overfitting without careful regularization

Best for: Large-scale problems, image-like data (e.g., field outputs), operator learning (mapping parameters to full solution fields).

3. Polynomial Chaos Expansion (PCE)

PCE represents the simulation output as a spectral expansion in terms of orthogonal polynomials of the random inputs [6]. It’s particularly well-suited for probabilistic input distributions and provides analytic expressions for moments.

Pros:

Efficient for moderate input dimensions
Direct access to statistical moments (mean, variance)
Rigorous mathematical foundation

Cons:

Assumes smooth dependence on inputs
Curse of dimensionality beyond ~20 variables
Requires specialized basis construction

Best for: Uncertainty propagation, sensitivity analysis when inputs are independent and distributions are known.

4. Other Methods

Sparse grids: For low-dimensional problems with expensive simulations
Random forests / gradient boosting: For tabular data with non-smooth responses
Support vector machines: For smaller datasets with clear margins

Building a Surrogate: The Workflow

Creating a trustworthy surrogate involves several critical steps [7]:

Step 1: Define the Input Space

Specify the range and distribution of input parameters you want the surrogate to cover. Be realistic—surrogates cannot reliably extrapolate far beyond the training data region.

Common mistake: Defining an overly broad input space “just in case.” Surrogates interpolate; they don’t extrapolate well. Better to train multiple specialized surrogates than one general one that guesses wildly outside its training.

Step 2: Generate Training Data

Run the high-fidelity simulation at a set of sampled input points. The quality and coverage of this dataset determine the surrogate’s accuracy.

Sampling strategies:

Latin hypercube sampling (LHS): Good space-filling properties for moderate dimensions
Sobol sequences: Low-discrepancy sequences for uniform coverage
Active learning: Iteratively add points where the surrogate is most uncertain (reduces total simulations needed)

Warning: Training data generation is often the bottleneck—it’s expensive! Active learning can reduce the number of required simulations by 50% or more [8].

Step 3: Choose and Train the Model

Select a surrogate type based on your problem size, data volume, and need for uncertainty estimates. Train on the dataset, using cross-validation to tune hyperparameters.

Key considerations:

Normalize inputs and outputs
Split data into training/validation/test sets
Monitor for overfitting (perfect training accuracy, poor validation)

Step 4: Validate the Surrogate

Never trust a surrogate without rigorous validation [9]. Use metrics like:

R² score (coefficient of determination)
Root mean squared error (RMSE)
Mean absolute error (MAE)
Maximum absolute error (catastrophic failures)

Also, plot residuals and prediction vs. actual values. Look for systematic bias or heteroscedasticity.

Essential practice: Keep a held-out test set of simulation runs that were never used during training or hyperparameter tuning. This gives an unbiased estimate of real-world performance.

Step 5: Deploy with Caution

Once validated, the surrogate can be used for rapid exploration. However:

Never use a surrogate outside its validated input domain
Always verify critical predictions with a direct simulation (especially for final designs)
Monitor degradation over time as the underlying system may drift

Uncertainty Quantification: Why and How

Surrogate predictions without uncertainty bounds are dangerous—they look precise but may be wildly wrong [10]. Uncertainty quantification (UQ) tells you how much you should trust each prediction.

Sources of Uncertainty

Aleatory uncertainty: Inherent randomness in inputs (e.g., material properties within tolerance ranges)
Epistemic uncertainty: Lack of knowledge due to limited training data; reduces as you add more data

A good surrogate model quantifies both. Gaussian processes provide this naturally via predictive variance. For neural networks, you can use:

Monte Carlo dropout: Multiple forward passes with dropout enabled to get predictive distribution
Deep ensembles: Train multiple networks and use prediction variance
Conformal prediction: Distribution-free uncertainty guarantees [11]

Active Learning with Uncertainty

Active learning uses the surrogate’s own uncertainty to decide where to run the next expensive simulation [12]. The general strategy:

Train initial surrogate on small dataset
Identify regions where prediction uncertainty is highest (or where failures are likely)
Run simulations at those points
Retrain with augmented dataset
Repeat until uncertainty acceptable or budget exhausted

This approach can achieve the same accuracy with far fewer simulations than uniform sampling.

Common Pitfalls and How to Avoid Them

Based on the literature and practical experience [13], here are the most common mistakes:

1. Insufficient or Poor-Quality Training Data

Problem: Too few simulation runs, or samples clustered in one region, leaving large gaps in input space.

Solution: Use space-filling designs (LHS, Sobol) and assess coverage. Active learning can help target sparse regions. As a rule of thumb, aim for at least 10× more training points than input dimensions, though this varies.

2. Extrapolation Beyond Training Domain

Problem: Using the surrogate for input combinations far from any training point.

Solution: Define clear operational boundaries. Implement “out-of-distribution” detection (e.g., distance to nearest training point, prediction variance thresholds). Return a warning or refuse to predict when inputs are outside the validated range.

3. Ignoring Uncertainty Quantification

Problem: Treating surrogate predictions as ground truth.

Solution: Always report prediction intervals. In optimization, use conservative criteria (e.g., lower confidence bound) to avoid false confidence.

4. Overfitting the Training Data

Problem: Surrogate memorizes training points but fails to generalize.

Solution: Use cross-validation, regularization, and simple models where possible. For neural networks, use dropout, weight decay, and early stopping. Monitor validation error, not just training error.

5. Neglecting Model Maintenance

Problem: Training once and forgetting, even as the underlying simulation code or physics changes.

Solution: Treat surrogates like any research software—track versions, retrain when simulation updates occur, and document data sources. See our guide on tracking long-term technical debt in research software for best practices.

Tools and Frameworks

Several open-source libraries support surrogate modeling and UQ [14]:

OpenTURNS: A comprehensive library for uncertainty analysis and meta-modeling (C++/Python)
UQlab: MATLAB-based framework for UQ and surrogate modeling
scikit-learn: Python library with Gaussian processes, random forests, and more
GPy / GPyTorch: Gaussian process frameworks for Python
PyTorch / TensorFlow: For custom neural network surrogates
UQ4CFD: Specialized platform for CFD uncertainty quantification

For FiPy users, you can integrate surrogates at the workflow level: use the surrogate to screen many parameter combinations, then run FiPy only on the most promising candidates.

Integration with Simulation Workflows

Surrogates don’t replace simulations; they complement them. Typical integration patterns:

Pre-screening: Quickly eliminate poor designs before committing to expensive simulation
Optimization loops: Use surrogate as inexpensive objective function; periodically validate with direct simulation
Uncertainty propagation: Run surrogate thousands of times to assess output distributions
Real-time control: Deploy surrogate in embedded systems or interactive tools

Example: In materials science, you might train a surrogate to predict phase-field evolution given material parameters. Use it to explore parameter space rapidly, then verify optimal points with full FiPy simulations.

Case Study: Metamaterial Design with Active Learning

A compelling demonstration comes from metamaterial design [8]. Researchers needed to optimize the band structure of a photonic crystal—a task requiring hundreds of expensive electromagnetic simulations. They used a Gaussian process surrogate with active learning:

Started with 50 simulation runs
Used the GP’s predictive variance to identify where additional data would most reduce uncertainty
Ran 150 more targeted simulations
Achieved accurate surrogate after only 200 total runs (vs. 500+ with uniform sampling)
Used surrogate to explore 10,000+ design candidates instantly

The result: discovery of novel band-gap configurations that would have been infeasible with direct simulation alone.

Related Guides

Managing Large-Scale PDE Problems – When simulations become expensive enough to need surrogates
From Equations to Simulations – The full modeling workflow
Code Coupling with preCICE – Multi-physics contexts where surrogates can accelerate coupling
Visualizing Simulation Results Effectively – Presenting surrogate predictions and uncertainty
Reproducibility and Its Role in Debugging – Maintaining trustworthy surrogate models

Conclusion and Next Steps

Machine learning surrogates are powerful tools for accelerating scientific discovery and engineering design. They transform computationally prohibitive problems into tractable ones by learning the mapping from inputs to outputs from a limited set of high-fidelity simulations.

Key takeaways:

Use surrogates when you need many fast evaluations; stick with direct simulation for final validation
Choose model type based on data size, dimensionality, and need for uncertainty estimates
Never skip rigorous validation and uncertainty quantification
Treat surrogates as living components that require maintenance as simulations evolve

When to avoid surrogates: If your simulation is already fast (< few seconds), you have no training data budget, or you’re exploring entirely new physical regimes where extrapolation would be unreliable.

Ready to accelerate your simulations? MatForge offers consultation on building trustworthy surrogate models tailored to your specific PDE-based workflows. Contact us to discuss your project.

References

[1] Forrester, A., Sobester, A., & Keane, A. (2008). Engineering Design via Surrogate Modelling: A Practical Guide. Wiley.

[2] “Machine Learning Surrogates vs Direct Simulation.” AI Overview, Google Search, 2025.

[3] Franco, N.R., et al. (2023). “Deep learning-based surrogate models for parametrized PDEs: Handling geometric variability through graph neural networks.” Chaos: An Interdisciplinary Journal of Nonlinear Science, 33(12). https://pubs.aip.org/aip/cha/article/33/12/123121/2929382/Deep-learning-based-surrogate-models-for

[4] Marrel, A., et al. (2024). “Probabilistic surrogate modeling by Gaussian process.” Journal of Quality Technology. https://www.sciencedirect.com/science/article/abs/pii/S0951832024001686

[5] Chen, X., et al. (2021). “An improved data-free surrogate model for solving partial differential equations.” Scientific Reports, 11, 19837. https://www.nature.com/articles/s41598-021-99037-x

[6] Sudret, B. (2023). “Surrogate models for uncertainty quantification: An overview.” Simtech, University of Stuttgart. https://www.simtech.uni-stuttgart.de/exc/research/pn/pn6/pn6-2/

[7] Gopakumar, V., et al. (2024). “Uncertainty Quantification of Surrogate Models using Conformal Prediction.” arXiv:2408.09881. https://arxiv.org/abs/2408.09881

[8] Pestourie, R., et al. (2020). “Active learning of deep surrogates for PDEs: application to inverse and reliability problems.” npj Computational Materials, 6, 149. https://www.nature.com/articles/s41524-020-00431-2

[9] Zahura, F.T., et al. (2020). “Training Machine Learning Surrogate Models From a High-Fidelity Physics-Based Model.” Water Resources Research, 56(6). https://agupubs.onlinelibrary.wiley.com/doi/full/10.1029/2019WR027038

[10] Rios, T., et al. (2024). “Large Language Model-assisted Surrogate Modeling.” Honda Research Institute. https://www.honda-ri.de/pubs/pdf/5711.pdf

[11] Lones, M.A. (2024). “Avoiding common machine learning pitfalls.” Journal of Machine Learning Research, 25(89). https://pmc.ncbi.nlm.nih.gov/articles/PMC11573893/

[12] “Uncertainty Quantification and Machine Learning Surrogates.” Ruhr-Universität Bochum, 2024. https://www.subsurf.ruhr-uni-bochum.de/sfe/mam/se-o-17_uncertainty_quantification_in_fe_analyses_with_surrogate_modeling.pdf

[13] Nguyen, B.D., et al. (2024). “Efficient surrogate models for materials science simulations.” Materials & Design, 235. https://www.sciencedirect.com/science/article/pii/S2666827024000203

[14] “Uncertainty Quantification for Scientific Computing.” Geilo Winter School, ETH Zürich, 2026. https://www.sintef.no/projectweb/geilowinterschool/2026-uncertainty-quantification/

Machine Learning Surrogates for Scientific Simulations: A Practical Guide

Introduction: The Computational Bottleneck

What Are Machine Learning Surrogates?

Key Terminology

When to Use ML Surrogates vs Direct Simulation

Types of Machine Learning Surrogate Models

1. Gaussian Processes (Kriging)

2. Neural Networks and Deep Learning

3. Polynomial Chaos Expansion (PCE)

4. Other Methods

Building a Surrogate: The Workflow

Step 1: Define the Input Space

Step 2: Generate Training Data

Step 3: Choose and Train the Model

Step 4: Validate the Surrogate

Step 5: Deploy with Caution

Uncertainty Quantification: Why and How

Sources of Uncertainty

Active Learning with Uncertainty

Common Pitfalls and How to Avoid Them

1. Insufficient or Poor-Quality Training Data

2. Extrapolation Beyond Training Domain

3. Ignoring Uncertainty Quantification

4. Overfitting the Training Data

5. Neglecting Model Maintenance

Tools and Frameworks

Integration with Simulation Workflows

Case Study: Metamaterial Design with Active Learning

Related Guides

Conclusion and Next Steps

References

Related articles

Monte Carlo Methods for Scientific Simulations: A Python Guide

Geophysics Applications: PDE Modeling for Earth Systems

Reproducible Parameter Sweeps: Designing Simulation Campaigns for Publication