← Back
Mixed-ligand ruthenium(II) complexes capable of hydrogen-bonding interactions with DNA: DNA binding, nuclease activity, cytotoxicity, and topoisomerase inhibition
Scientific Reports
https://doi.org/10.1038/s41598-025-32975-y
Article in Press
Monitoring of ultra-high performance concrete
manufacturing for reproducible quality and
waste reduction
Farzad Rezazadeh, Amin Abrishambaf, Gregor Zimmermann & Andreas Kroll
Received: 7 September 2025
Accepted: 15 December 2025
Cite this article as: Rezazadeh F.,
Abrishambaf A., Zimmermann G. et al.
Monitoring of ultra-high performance
concrete manufacturing for
reproducible quality and waste
reduction. Sci Rep (2025). https://doi.
org/10.1038/s41598-025-32975-y
A
We are providing an unedited version of this manuscript to give early access to its
findings. Before final publication, the manuscript will undergo further editing. Please
note there may be errors present which affect the content, and all legal disclaimers
apply.
E
R
P
S
S
If this paper is publishing under a Transparent Peer Review model then Peer
Review reports will publish with the final article.
I
T
R
E
L
C
IN
© The Author(s) 2025. Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing,
adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source,
provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included
in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative
Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission
directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
ARTICLE IN PRESS
Monitoring of ultra-high performance concrete
manufacturing for reproducible quality and waste
reduction
Farzad Rezazadeh1,* , Amin Abrishambaf2 , Gregor Zimmermann3 , and Andreas Kroll1
1 Department of Measurement and Control Engineering, University of Kassel, Kassel, 34125, Germany
2 QuantumFusion GmbH, Waldeck, 34513, Germany
3 MAITERIA UG, Kassel, 34131, Germany
* farzad.rezazadeh@mrt.uni-kassel.de
ABSTRACT
Ultra-high performance concrete (UHPC) combines exceptional strength and durability, yet its industrial production is hampered
by batch-to-batch variability that generates costly off-specification waste. Leveraging a 150-batch design-of-experiments dataset
based on systematic variations of a single reference UHPC mix, this study takes a holistic view of the UHPC manufacturing
chain and quantifies how fluctuations in raw material quality, storage conditions, dosing errors, mixer energy demand, and
curing regimes affect the 28-day compressive strength. Ten diverse machine learning algorithms are benchmarked; the
best-performing model explains ≥ 75 % of the strength variance with a prediction error ≤ 10 % under leave-one-out crossvalidation. SHapley Additive exPlanations reveal that long-term curing temperature and humidity dominate strength development,
followed by ingredient moisture and silica fume impurity. These insights are operationalized in an at-line, operator-in-the-loop
recommendation system that explores the curing envelope and proposes end-of-mix, batch-specific adjustments before curing
starts. In five validation cases, curing adjustments rescued 5/5 underperforming batches, eliminating 75 m3 of off-specification
UHPC and – considering cement only with 600 kg/m3 and 15 m3 per batch of white Portland cement – avoided ≈ 41 t CO2 e
(cement-only; 0.913 kg CO2 e/kg, A1–A3). The framework therefore not only elucidates the main sources of UHPC quality
inconsistency but also provides a practical, data-driven tool to rescue off-specification products, minimize waste, and cut
associated CO2 emissions.
Introduction
I
T
AR
E
L
C
IN
S
S
E
R
P
Ultra-high performance concrete (UHPC) is an advanced cementitious composite with outstanding mechanical performance
and durability. Owing to a densely packed microstructure obtained by optimizing ultrafine particles and steel fibers1, 2 , UHPC
routinely achieves compressive strengths above 120 MPa and tensile strengths exceeding 5 MPa – several times those of
conventional concrete3–5 . The resulting low porosity affords exceptional resistance to water ingress, chemical attack, and
freeze–thaw cycling6–9 ; autogenous self-healing further prolongs service life10 .
These properties have propelled UHPC into a wide spectrum of demanding applications. In bridge construction, it enables
slender, lighter decks with extended design lives11 . Its flowability and early strength permit intricate architectural façades,
and its durability in aggressive environments suits marine and offshore structures12 . In repair and retrofit projects, UHPC
lowers maintenance and life-cycle costs13 . For instance, Russell and Graybeal4 report that, for decks, full-depth UHPC waffle
panels reduced slab dead load by ≈ 56 % compared with conventional concrete. In a short-span case study, a composite
timber–ultra-high performance fiber-reinforced concrete (UHPFRC) superstructure weighed 20,890 kg versus 63,422 kg for a
conventional reinforced concrete (RC) alternative (an ≈ 67 % reduction); the authors also note that UHPFRC decks typically
reduce deck weight by a factor of ∼ 3 relative to RC14 . At the component scale, designs of UHPC bridge piers achieved ≈
3.5–36.6 % reductions in concrete volume15, 16 . Regarding costs, a Federal Highway Administration (FHWA) life-cycle cost
(LCC) analysis for a signature suspension bridge found that a UHPC overlay rehabilitation option (Installation Strategy 2; ≈
3.75 in. partial-depth replacement) reaches break-even with new precast deck replacement at an actual overlay service life
of ≈ 24 years17 . In the same short-span study, life-cycle costs for the timber–UHPFRC solution were ≈ 30 % lower (the RC
alternative’s LCC was ≈ 43 % higher)14 . At the network scale, systematic UHPFRC interventions are projected to save up to
18.5 billion Swiss francs (CHF) over an 80-year horizon relative to demolition–reconstruction strategies18 . Accordingly, the
material is reshaping modern construction practice and supporting the transition to more sustainable infrastructure.
Although UHPC offers superior performance, its manufacture is both resource- and carbon-intensive. The material contains
a high cement content and demands tight process control; consequently, it is costly to produce and associated with elevated
ARTICLE IN PRESS
1) Raw material
storage conditions:
• Storage duration,
• Ingredient temperature,
• Ingredient moisture
4) Curing conditions 0–24 h:
• Plastic wrap or
cabinet at 90 % RH
• Temperature (°C)
Adjustment of early curing
conditions
Feedback to operator
2) Dosing error (DE) and
particle size distributions
(PSD): • Sand and filler
content (DE and PSD),
• Superplasticizer
content (DE)
UHPC process
model:
predicts 28-day
CS and FS
DAS
3) Mixing condition:
• Mixer power
consumption (kW)
Recommendation
system
5) Curing conditions days 2–28:
• Plastic wrap
or water immersion
• Temperature (°C)
Adjustment of long-term
curing conditions
Figure 1. At-line end-of-mix recommendation system for UHPC production. The data acquisition system ingests process data
into a predictive process model and a recommendation engine, which jointly monitor the current batch and propose pre-curing
adjustments to keep the predicted 28-day strength within specification. Feedback is given to the operator (operator-in-the-loop).
(CS = compressive strength; FS = flexural strength)
S
S
E
R
P
CO2 emissions. Moreover, steel fibers and the high dosage of superplasticizer commonly required in UHPC also contribute to
higher cost and embodied carbon19 . A persistent industrial challenge is achieving batch-to-batch reproducibility: even when
the same reference mix is followed, slight deviations in process conditions can yield off-specification (off-spec) batches that
must be scrapped, thereby compounding costs and carbon footprint20 . These realities highlight the need for a decision-support
system that allows operators to adjust processing parameters, predict performance, and uphold quality, ultimately reducing
waste, lowering costs, and ensuring consistent high-performance output.
Accurately forecasting 28-day mechanical properties further complicates UHPC production. Conventional empirical
relationships have advanced concrete science, yet they cannot capture the multifactorial chemical and physical interactions
that govern high-performance concretes21 . Accordingly, more sophisticated, data-driven modeling approaches are required to
navigate the complex variable space and deliver reliable property predictions22, 23 .
Recent studies highlight the promise of machine learning (ML) techniques for predicting the properties of cementitious
materials23–27 . Ling et al.28 reported that support vector regression (SVR) surpassed artificial neural networks (ANN) and
decision trees (DT) in capturing environmental effects on concrete strength, whereas Hoang and Pham29 identified Gaussian
process regression (GPR) as superior to both ANN and SVR. Nguyen et al.30 further showed that extreme gradient boosting
(XGB)31 accurately predicts UHPC compressive strength, expediting mixture development at lower cost. However, most prior
ML-assisted UHPC studies stop at prediction or mix-design exploration; they do not deliver at-line recommendations that can
adjust processing parameters for the current batch before curing begins.
Despite these advances, ML deployment in UHPC modeling faces notable obstacles23, 32 . Chief among them is data sparsity:
UHPC experiments are expensive and time-consuming, limiting dataset size and risking model overfitting. Existing datasets –
such as those compiled by Yeh33, 34 – aggregate high-performance concrete (HPC) data from diverse sources and have been
widely used in studies. However, data collected from disparate sources risk leading to redundant experiments and introduce
inconsistencies in material quality and production conditions, which may undermine the reliability of predictive models, even
when large datasets are employed. A practical remedy is to generate high-quality data systematically. Design of Experiments
(DoE) frameworks maximize information gain per trial and thus support reliable model training while minimizing laboratory
effort35 .
This study tackles the persistent challenge of delivering consistent UHPC quality at industrial scale. A comprehensive
dataset comprising 150 mixtures, created under controlled conditions to establish a high-quality data foundation36 , is employed.
To emulate real-world variability, the experimental design systematically perturbed six uncertainty sources – raw material
storage, particle size distribution, dosing errors, mixer power demand, and early- and late-age curing regimes – while holding
the reference mix proportions constant36 . Thus, the machine learning models developed here are specific to this single UHPC
formulation and are not universal across recipes.
To maximize the available data for modeling while ensuring reliable evaluation on unseen data, a leave-one-out cross-
I
T
AR
E
L
C
IN
ARTICLE IN PRESS
validation (LOOCV)37 approach, with varied model initializations and hyperparameter-optimization runs in each fold, is applied
across ten ML algorithms. The best-performing model (BPM) is embedded in a recommendation engine (Fig. 1) that monitors
incoming batches and proposes curing adjustments to keep strength on target.
The results advance process understanding on two fronts. First, outdoor storage exposes aggregates and binders to
uncontrolled temperature and moisture swings, which translate directly into strength variability. Second, curing temperature and
humidity dominate mechanical performance, especially under the seasonal and geographic variations typical of field production.
Collectively, these insights inform data-driven control strategies that minimize waste, stabilize batch quality, and reduce the
embodied CO2 in UHPC production.
This study (i) assembles a targeted 150-batch dataset that perturbs some source of uncertainty while holding the base UHPC
recipe fixed; (ii) benchmarks ten ML models with nested cross-validation; (iii) uses SHapley Additive exPlanations (SHAP) to
reveal the most influential factors; (iv) demonstrates practical impact by rescuing five underperforming batches to 118–122 MPa
via model-recommended curing adjustments; and, crucially, (v) implements at-line, operator-in-the-loop recommendation
engine that exhaustively searches the curing envelope (3,844 set-points) and returns a plan in ∼30 s.
Methods
Materials and Experiments
The dataset used in this study is created using a DoE framework that maximized input-space coverage and efficiently revealed
the factors influencing UHPC performance36 . First, 50 experiments were conducted using a Taguchi L50 orthogonal array for
factor screening35 . Next, the design space from phase one is augmented with Latin hypercube sampling (LHS) to place another
100 experiments between existing data points and improve uniform coverage of the input space38, 39 . This methodical approach
produced 150 well-balanced UHPC batches that capture realistic industrial variability for subsequent analysis20, 36, 40 .
All experiments employed a fixed reference UHPC mixture formulated for façade panel applications, composed of white
Portland cement (600 kg/m3 ), silica fume (100 kg/m3 ), quartz fillers (450 kg/m3 combined), two silica-based sands (1,100
kg/m3 combined), water (195 kg/m3 ), and a polycarboxylate-based superplasticizer (21 kg/m3 ) to yield 15 m3 of UHPC product.
The fixed reference recipe is designed to yield a UHPC product with a final compressive strength of 120 MPa. The cement
used is 42.5R white Portland type I. Silica fume with over 95 % SiO2 content enhances the mix’s density and reduces porosity,
contributing significantly to its mechanical properties and durability. Quartz powder, with a silicon oxide content of 99.6 % and
a d50 of 27 µm, serves as filler type I. Filler type II is an add-on developed to further improve workability and enhance strength.
Two grades of silica-based sands are included: one with a d50 of 0.32 mm and another with a d50 of 0.94 mm, both with a purity
of 99.2 % SiO2 . This combination of sand grades optimizes particle packing and minimizes voids, ensuring a strong and dense
UHPC structure. Controlled variations were systematically introduced to mimic realistic industrial conditions, explore their
impacts on UHPC properties, and understand causal effects in the reproducibility challenges of UHPC even when a reference
recipe is followed.
The factors investigated are listed in Table 1. Moisture content (IM) and temperature (IT) of the raw materials were varied
to replicate differences caused by storage environments. Typically, such materials in the concrete industry are stored outdoors,
which causes variations in temperature and humidity at the moment of use. Graphite (GRP) content in silica fume represents
impurities in this fine material. The study also considers the impact of the particle size distribution of sands (SAI, SAII) and
fillers (FLI, FLII), along with measurement errors in dosing sand, filler, and superplasticizer (SPP). Average mixer power
(APW) during the mixing process is recorded as an informative variable for predicting final UHPC quality, which may be
affected by these variations in raw materials.
Finally, curing regimes – early age (day 1) and late age (days 2–28) – with temperature profiles of 10–40 °C and conditions
such as plastic wrapping, water immersion, or exposure to air at high relative humidity were examined to understand the effect
of seasonal and daily environmental conditions on the final UHPC product.
E
L
C
IN
S
S
E
R
P
I
T
AR
Data-driven Modeling of the UHPC Production Process
To model the UHPC manufacturing process, a diverse set of ten ML algorithms is employed. The algorithms used are multiple
linear regression (MLR)41 , partial least squares (PLS)41, 42 , kernel ridge regression (KRR)41, 43 , k-nearest neighbors (KNN)41, 44 ,
support vector regression (SVR)41, 45 , decision trees (DT)41, 45 , random forest (RF)41, 46 , gradient boosting (GB)41, 47 , extreme
gradient boosting (XGB)31 , and Gaussian process regression (GPR)41, 48 . Hyperparameters for these models are optimized
using an evolutionary algorithm49 . These algorithms, which are typically appropriate for modeling small datasets, are widely
applied in industrial prediction tasks50 , particularly for forecasting the mechanical properties of concrete51 .
The selected modeling approaches are grouped into three broad categories: linear models, nonlinear single models, and
nonlinear ensemble models. A linear relationship between the inputs and the output is assumed by linear models. MLR
is included as a baseline; PLS is used as a linear latent-factor method effective for handling correlated features or sparse
data in small datasets; and KRR is employed, extending ordinary ridge regression with a nonlinear kernel to better tackle
ARTICLE IN PRESS
Table 1. The dataset (X ∈ R13×150 ) is based on a primary ultra-high performance concrete recipe and includes variations in
material quality, potential measurement errors, mixer power consumption, and curing conditions. CC1 comprises two curing
classes: specimens stored at 90 % relative humidity (Class = 1) or encased in plastic wrap (Class = 2); CC28 likewise
comprises two classes: specimens encased in plastic wrap (Class = 1) or submerged in water (Class = 2). The cement and silica
fume contents remain constant in all experiments. The reference UHPC recipe is designed to yield a 28-day compressive
strength of 120 MPa; the observed ranges (85–136 MPa for CS28 and 8–25 MPa for FS28) demonstrate that reproducibility
cannot be guaranteed even when identical mix proportions are followed.
Group
Factor name
Var.
Unit
Mean
Median
SD
Min.
Max.
Raw materials
Ingredient moisture
Ingredient/water temperature
Graphite
IM
IT
GRP
kg
°C
kg
3.13
24.20
0.08
3.15
25
0.09
0.16
9.05
0.07
2.92
10
0
3.36
40
0.22
Particle size distributions
and measurement errors
Sand I
Sand II
Filler I
Filler II
Superplasticizer
SAI
SAII
FLI
FLII
SPP
kg
kg
kg
kg
kg
5.98
10.53
6.00
0.75
0.32
6.00
10.50
6.00
0.75
0.32
0.59
1.04
0.59
0.07
0.02
5.10
8.92
5.10
0.63
0.29
6.90
12.07
6.90
0.86
0.35
Mixing condition
Average power consumption
APW
kW
1.04
1.06
0.19
0.36
1.40
Curing
conditions
Curing temperature, day 1
Curing class, day 1
Curing temperature, days 2–28
Curing class, days 2–28
CT1
CC1
CT28
CC28
°C
Class
°C
Class
24.60
–
22.15
–
20
–
20
–
9.72
–
9.38
–
10
1
10
1
40
2
40
2
Outputs
Compressive strength at day 28
Flexural strength at day 28
CS28
FS28
MPa
MPa
109.83
16.98
110.57
17.29
11.83
3.60
85.06
8.15
135.71
24.57
E
L
C
IN
S
S
E
R
P
high-dimensional data with limited observations. These linear models are high-bias (low-variance) estimators and thus serve as
baseline benchmarks for predictive performance.
I
T
AR
Nonlinear single models are stand-alone algorithms capable of capturing nonlinear patterns. In this category, SVR and GPR
are included; both are kernel-based methods adept at modeling complex functions – notably, Bayesian priors are leveraged
by GPR, which tends to perform well even on small datasets. Additionally, KNN – a simple non-parametric approach that
makes predictions based on the local neighborhood of a query point in the feature space – and DT modeling, which provides an
interpretable rule-based model that captures nonlinear feature interactions (and also serves as a baseline for tree-based ensemble
methods), are included. These single-model approaches were selected to represent a diverse range of learning paradigms –
from instance-based learning (KNN) to kernel-based learning (SVR and GPR) and rule-based learning (DT) – covering both
parametric and non-parametric techniques.
In nonlinear ensemble models, multiple base learners are combined to improve generalization, especially when dealing
with small datasets52 . In this study, RF is included from the bagging family, which constructs an ensemble of decision trees
on bootstrapped subsets of the training data to reduce variance and better handle high-dimensional feature spaces. From the
boosting family, GB and its advanced variant, XGB, are employed; an ensemble of decision trees is trained sequentially by
these algorithms, with each successive tree correcting errors made by the model thus far52 . These algorithms were selected due
to their demonstrated success in structured regression tasks – particularly in the context of concrete production applications53, 54 .
By evaluating multiple ensemble approaches, the performance gains of more complex, high-capacity models relative to simpler
models can be observed.
To ensure reliable model development and evaluation on unseen data, a nested training-evaluation loop is implemented, as
illustrated in Fig. 2. In the outer loop, a LOOCV strategy is adopted whereby, in each iteration, all data points except one are
used for training, while the excluded data point is reserved as unseen data for the final evaluation of the model’s performance.
In the inner loop, prior to the final model training, the optimization algorithm employs a 10-fold cross-validation strategy
on the training data to determine the optimal hyperparameters. Subsequently, the model is built using the complete training
dataset with these optimized hyperparameters. The procedure is repeated for every data point, each time using independent
model initializations and separate hyperparameter-optimization runs for all ten ML algorithms. Ultimately, BPM for each
mechanical property is identified through comparative analysis. Model quality is assessed using three metrics calculated on
unseen data (collected from each iteration): coefficient of determination (R2 ) (1), mean absolute error (MAE) (2), and mean
ARTICLE IN PRESS
Outer loop: LOOCV
Compute the average
performance of the trained
models across N iterations
Iteration 1 → Fold 1 Fold 2 Fold 3
Iteration 2 → Fold 1 Fold 2 Fold 3
Iteration 3 → Fold 1 Fold 2 Fold 3
N = 139; K = 10
n = i = 0; k = j = 1
i, n ∈ N; j, k ∈ K
i≥N
Save prediction results
…
Fold N
Fold N
Fold N
…
F
T
Test Data: one data
point in fold n
in iteration i
Start
…
…
…
…
Iteration N → Fold 1 Fold 2 Fold 3
n=n+1
i=i+1
End
…
…
Save the best-performing
model (BPM)
…
…
Fold N
Training data:
data points in folds 1 to N,
excluding fold n in iteration i
Model training phase: train the
model with different random
initializations in each iteration i
Model evaluation phase
Trained
model
Inner loop: K-Fold CV
Training data
Iteration 1 → Fold 1 Fold 2 Fold 3
Iteration 2 → Fold 1 Fold 2 Fold 3
Iteration 3 → Fold 1 Fold 2 Fold 3
Model validation phase
…
…
Validation data: data points
in fold k in iteration j
…
Fold K
Fold K
Fold K
…
…
…
…
…
Iteration K → Fold 1 Fold 2 Fold 3
…
…
Fold K
E
L
C
I
T
AR
T
j≥K
F
S
S
E
R
P
Trained model with
optimized hyperparameters
in iteration j
Save prediction performance and
optimized hyperparameters in iteration j
k = k + 1; j = j + 1
Data points in folds 1 to K
for training, excluding
fold k in iteration j
IN
Training & hyperparameter
optimization with different
random initializations in
each iteration j
Best hyperparameters across K iterations
Figure 2. Model training and evaluation workflow. The outer loop uses leave-one-out cross-validation (LOOCV), holding out
one sample per iteration to estimate performance on unseen data. The inner loop performs 10-fold cross-validation on the
training set to optimize hyperparameters. Held-out predictions from all LOOCV folds are concatenated to compute the
coefficient of determination (R2 ), mean absolute error (MAE), and mean absolute percentage error (MAPE). (BPM:
best-performing model)
absolute percentage error (MAPE) (3):
∑Ni=1 (yi − ŷi )2
∑Ni=1 (yi − ȳ)2
(1)
1 N
∑ |yi − ŷi |
N i=1
(2)
100 % N yi − ŷi
∑ yi
N i=1
(3)
R2 = 1 −
MAE =
MAPE =
where yi is the i-th true value, ŷi is the corresponding predicted value, and N is the total number of data points.
ARTICLE IN PRESS
Feedback to operator
𝐶𝐶28 = 𝐶𝐶28 +1
𝐶𝑇28 = 𝐶𝑇28 +1
F
𝐶𝑇28 ≤ 5
T
𝐶𝑇1 = 1
𝐶𝑇1 ≤ 5
F
T
𝐶𝐶28 = 1
𝐶𝐶28 ≤ 2
F
T
𝐶𝐶1 = 1
𝐶𝐶1 ≤ 2
F
𝐶𝐶1 = 𝐶𝐶1+1
𝐶𝑇1 = 𝐶𝑇1+1
Best(𝐶𝑇28∗ , 𝐶𝑇1∗ , 𝐶𝐶28∗ , 𝐶𝐶1∗ )
T
𝐁𝐏𝐌(𝐶𝑇28, 𝐶𝑇1, 𝐶𝐶28, 𝐶𝐶1)
Recommendation system
Start
𝐶𝑇28 = 1
End
Adjustment of 28 days curing conditions
Final
product
E
L
C
IN
S
S
E
R
P
Fresh
UHPC
I
T
AR
Figure 3. Recommendation engine workflow for curing-setpoint selection. Based on the best-performing model (BPM), the
system forecasts the 28-day strength at the end of mixing and proposes batch-specific curing adjustments before curing starts to
keep the product within specification. Abbreviations for variables are defined in Table 1.
Model Interpretability
Model predictions are interpreted by means of SHAP55 , a game-theoretic framework that decomposes a prediction into additive
feature attributions. For a sample x with feature set F and M = |F|, the model output is represented as
M
f (x) = φ0 + ∑ φ j ,
(4)
j=1
where φ0 = EX [ f (X)] denotes the expected model output under a background distribution, and φ j is the Shapley value for
feature j, defined as the weighted average marginal contribution of j across all coalitions S ⊆ F \ { j}55, 56 ,
φj =
|S|! (M − |S| − 1)!
fS∪{ j} (xS∪{ j} ) − fS (xS ) .
M!
S⊆F\{ j}
∑
(5)
This attribution satisfies local accuracy, consistency, and missingness axioms55 . During the outer LOOCV, SHAP values are
computed only for held-out instances, using the fold-specific trained model and a background set drawn from the corresponding
training partition to avoid information leakage. For linear estimators, the linear SHAP formulation is used; for non-linear
estimators, the model-agnostic KernelSHAP estimator is employed.55 Global influence is summarized as the mean absolute
SHAP magnitude, mean(|φ j |), across all held-out predictions, whereas signed SHAP values are used to analyze the direction
and magnitude of batch-level effects in subsequent figures.
ARTICLE IN PRESS
Table 2. Batches excluded from modeling and their exclusion rationale (11 of 150)36 .
Experiment number
5, 17, 75, 99, 101
30
36
41
47
128
148
Rationale for exclusion
Human error attributable to inadvertent use of material(s) in different composition/volume
than the fixed reference recipe; labeled as an outlier.
Abnormal early–age curing with suppressed hydration led to unrealistically low 24 h
strengths relative to process settings; the batch is judged non-representative of the population and excluded.
Abnormally high entrained air together with poor flowability and prolonged efflux behavior indicated non-representative rheology; flagged by univariate checks and by the
multivariate consistency screening.
Inconsistent fresh–state and hardened measurements (aberrant CS28 and FS28) indicated
a procedural/testing anomaly; excluded as non-representative.
Physically inconsistent rheology (very large slump-flow concurrent with long V-funnel
time) indicated measurement instability; flagged by univariate checks and the multivariate
consistency screening.
Missing critical targets (CS28 and FS28); unusable for supervised learning without
imputation of outcomes; excluded by integrity check.
Unusually high 24 h strengths inconsistent with recorded process conditions, suggestive
of logging or testing irregularity; excluded as non-representative.
S
S
E
R
P
Model-Based Recommendation System for Adjusting Curing Conditions
Once BPM for each mechanical property is established, it is embedded in an at-line (end-of-mix), operator-in-the-loop
recommendation engine to safeguard the UHPC production line against off-spec output (illustrated in Fig. 3). In practice,
operators input the actual mix recipe and process parameters into the BPM, which immediately forecasts the 28-day strength at
the end of mixing – before any curing decisions become irreversible. If this initial prediction falls outside the 120 MPa target
specification, the engine automatically triggers an exhaustive nested grid search over the four adjustable curing parameters.
This model-based recommendation system can thus proactively suggest batch-specific curing adjustments to avoid potential
off-spec products or to improve suboptimal quality, thereby reducing UHPC waste.
The search itself is exhaustive in the strict sense: every feasible combination of the four curing variables – initial curing
temperature (CT1), initial curing condition class (CC1), final curing temperature (CT28), and final curing condition class
(CC28) – is systematically evaluated. All other input features (mix composition, etc.) remain fixed. The continuous temperature
parameters (CT1 and CT28) are scanned across 10–40 ◦ C in 1 ◦ C increments, while the categorical parameters CC1 and
CC28 are each tested at their two experimentally validated levels (see Table 1). In total, this yields a discrete search space of
31 × 2 × 31 × 2 = 3,844 possible curing condition combinations. For each candidate combination, the BPM predicts the 28-day
compressive strength; the algorithm then identifies the combination that minimizes the absolute deviation from the 120 MPa
target (i.e. the predicted strength closest to 120 MPa, meeting the design specification and preventing off-spec products).
Because every point in the search space is evaluated exactly once, this brute-force approach is immune to local minima and
requires no complex hyperparameter tuning – advantages that make it straightforward to implement in practice. Even with
thousands of evaluations, the computation remains fast: the full grid search completes in roughly 30 s on a standard plant
workstation, well within the normal production cycle time.
E
L
C
IN
I
T
AR
Results and Discussion
Data Preprocessing
The modeling inputs comprised 12 controllable factors along with the recorded average mixing power during mixing (APW),
while the 28-day mechanical strengths were designated as outputs (Table 2).
Before model development, eleven batches (Exp. 5, 17, 30, 36, 41, 47, 57, 99, 101, 128, 148) were excluded as outliers,
see Table 2. Exclusion followed a two–step procedure: (i) an integrity check for missing critical targets (CS28, FS28); (ii)
univariate plausibility screening of fresh–state and strength measurements against the dataset’s empirical ranges. The identified
outliers and the specific anomalies prompting their exclusion are summarized in Table 2. Eliminating these 11 irregular cases
yielded a refined dataset of 139 experiments for subsequent modeling.
Prior to model development, inter-feature dependencies are assessed via Pearson correlation analysis (Fig. 4) to mitigate
potential multicollinearity57 . The nearly perfect negative correlations observed between sand type I (SAI) and sand type II (SAII),
and between filler type I (FLI) and filler type II (FLII), highlight the delicate balance between these inputs. Such relationships
emphasize that the relative proportions fundamentally influence matrix packing density and, therefore, the microstructure of
,0
6$,
6$,,
)/,
)/,,
633
&7
&&
,7
*53
$3:
&7
&7
&&
,0
6$,
6$,,
)/,
)/,,
633
&7
&&
,7
*53
$3:
&&
ARTICLE IN PRESS
_ _
Figure 4. Correlation structure of the UHPC dataset batches). Left: heatmap of pairwise Pearson correlation coefficients (ρ);
right: bar plot of the row-wise sum of absolute correlations (|∑ ρ|), a scalar summary of how strongly each variable co-varies
with the remaining feature space. Lower |∑ ρ| indicates lower collinearity and greater uniqueness; features with the lowest
values – CT28, CC28, IM, GRP – retain comparatively unique information for subsequent modeling. Abbreviations for
variables are defined in Table 1.
IN
S
S
E
R
P
UHPC, which ultimately affects mechanical behavior. Certain pairs of highly correlated features – (SAI, SAII) and (FLI, FLII) –
are subsequently identified for removal from the feature pool to prevent redundancy prior to further analytical steps.
Distributions of all input and output variables after outlier exclusion and the correlation analysis are shown as violin plots of
z-scored values (each variable normalized to zero mean and unit variance) (Fig. 5). Each violin depicts the probability density
of the 139 recorded values for that feature, with horizontal lines marking the median and interquartile range (embedded within
the violin).
LOOCV is employed to robustly evaluate the predictive models on the final dataset (N = 139). In each LOOCV iteration, a
single batch is held out as the test case, and the remaining 138 batches are used for training the model. Prior to training, feature
values in the training fold are normalized to a common scale (0–1) using min–max scaling; the same scaling parameters are
then applied to normalize the held-out sample to ensure consistency. No information from the test batch is used in fitting the
scaler or the model. This process is repeated 139 times so that each batch is left out once. Thus, every observation is predicted
exactly once (using a model fitted to all other observations), and performance statistics are computed from the aggregate of
these held-out predictions.
E
L
C
I
T
AR
Evaluation of UHPC Models for Predicting Final Mechanical Properties
Table 3 summarizes the performance of ten ML models for predicting the mechanical properties of UHPC. For compressive
strength at day 28 (CS28), the PLS model exhibited the best overall performance, achieving the highest R2 of 74.89 %, the
lowest MAE of 4.88 MPa, and the lowest MAPE of 4.43 %. Notably, MLR, despite its simplicity, performed competitively with
an R2 of 74.54 %. For flexural strength at day 28 (FS28), SVR demonstrated superior predictive performance by attaining the
highest R2 (77.91 %). Furthermore, RF and XGB achieved comparable R2 values of 77.58 % and 77.10 %, respectively; RF
yielded the lowest MAPE (8.10 %), whereas XGB attained the lowest MAE (1.29 MPa).
Overall, the results indicate that PLS and MLR are well-suited for predicting CS28, whereas SVR, RF, GB, and XGB
are recommended for FS28 owing to their predictive capabilities and minimal error rates. In particular, the accuracy of PLS
makes it especially appealing for CS28 prediction, while SVR – because of its predictive performance and its simplicity of
implementation – is regarded as the most effective for FS28. Accordingly, PLS and SVR are selected as the BPMs for CS28
and FS28, respectively.
Confidence Interval Analysis of Best-Performing UHPC Models
Table 4 reports calibration and predictive performance with 95 % confidence intervals (CIs). For CS28 (PLS), calibration
is statistically indistinguishable from the identity line: slope b = 0.98 (95 % CI: [0.88, 1.07]), intercept a = 2.6 MPa (95 %
ARTICLE IN PRESS
=VFRUH
)6
&6
&&
&7
&&
&7
$3:
633
)/,
6$,
*53
,7
,0
Figure 5. Empirical distributions of inputs and outputs used in modeling. For each variable, the white dot marks the median,
the thick bar indicates the interquartile range, and the envelope shows a kernel-density estimate of the sample distribution.
Notably, 28-day compressive strengths range from 85 MPa to 136 MPa (mean 110 MPa), underscoring substantial
batch-to-batch variation even under a fixed recipe. Abbreviations for variables are defined in Table 1.
S
S
E
R
P
Table 3. Performance of ten machine-learning models for predicting 28-day UHPC strength under leave-one-out
cross-validation (LOOCV). Metrics are computed from held-out predictions across all folds: R2 (percent), mean absolute error
(MAE, MPa), and mean absolute percentage error (MAPE, percent) for compressive strength (CS28) and flexural strength
(FS28). Higher R2 and lower MAE / MAPE indicate better performance. Because all training data are drawn from a single
UHPC recipe with controlled perturbations, models are local to that mix. Best values per column are shown in bold.
E
L
C
R2 (%)
I
T
AR
Model
Abbr.
CS28
Multiple Linear Regression
Partial Least Squares
Kernel Ridge Regression
K-Nearest Neighbors
Support Vector Regression
Decision Tree
Random Forest
Gradient Boosting
Extreme Gradient Boosting
Gaussian Process Regression
MLR
PLS
KRR
KNN
SVR
DT
RF
GB
XGB
GPR
74.54
74.89
72.99
56.09
72.29
57.26
68.35
67.88
68.58
71.19
IN
MAE (MPa)
MAPE (%)
FS28
CS28
FS28
CS28
FS28
73.27
73.22
76.57
60.69
77.91
72.90
77.58
77.10
77.28
76.88
4.91
4.88
5.08
6.47
5.16
6.43
5.36
5.41
5.35
5.26
1.51
1.52
1.36
1.76
1.33
1.46
1.30
1.31
1.29
1.34
4.45
4.43
4.62
6.10
4.66
5.87
4.87
4.91
4.84
4.78
9.41
9.43
8.53
10.33
8.31
9.03
8.10
8.24
8.18
8.46
CI: [-7.9, –13.1] MPa), and bias ≈ 0 MPa (95 % CI: [-1.0, 1.0] MPa). Accuracy remained high and stable under resampling
(R2LOOCV = 0.749 [0.669, 0.807], MAE = 4.88 MPa [4.35, 5.44], MAPE = 4.43% [3.96, 4.94]), and the parametric 95 %
prediction interval for an individual batch at the mean level is ±11.7 MPa (about 10 % of 110 MPa). For FS28 (SVR), strong
LOOCV performance is observed (R2 = 0.779, MAE = 1.33 MPa, MAPE = 8.31%); the regression slope 0.85 (95 % CI: [0.78,
0.92]) and intercept 2.5 MPa (95 % CI: [1.3, 3.7] MPa) indicate mild compression at the extremes, while the mean bias remains
≈ 0 MPa (95 % CI: [-0.25, 0.31] MPa). Metric CIs were obtained by nonparametric bootstrap (10,000 resamples) applied to the
LOOCV held-out pairs.
Decoding UHPC Strength Variability through SHAP: Implications for Low-Waste, High-Quality Manufacturing
The global SHAP analysis of the studied dataset reveals that curing parameters dominate 28-day strength variability, followed
by ingredient moisture (IM) and impurity in silica fume (GRP); all other mix constituents are secondary (Fig. 6).
For both mechanical properties in the secondary curing phase, for example, for compressive strength, batches cured
ARTICLE IN PRESS
Table 4. Comparison of PLS (CS28) and SVR (FS28) with 95% confidence intervals (CIs).
PLS (CS28)
SVR (FS28)
Metric
Units
Value
95% CI
Value
95% CI
Regression slope (pred. vs. obs.)
Regression intercept
R2
MAE
MAPE
Prediction bias (pred−obs)
95% prediction intervala
—
MPa
%
MPa
%
MPa
MPa
0.98
2.6
74.9
4.88
4.43
≈ 0.0
±11.7
[0.88, 1.07]
[−7.9, 13.1]
[66.9, 80.7]
[4.35, 5.44]
[3.96, 4.94]
[−1.0, 1.0]
—
0.85
2.5
77.9
1.33
8.31
≈ 0.0
±3.3
[0.78, 0.92]
[1.3, 3.7]
[69.9, 83.7]
[1.17, 1.49]
[7.13, 9.30]
[−0.25, 0.31]
—
a Half-width about the point prediction for a single new observation at the mean level (parametric PI); relative half-widths
are ∼ ±10% for CS28 (mean ∼ 110 MPa) and ∼ ±19% for FS28 (mean ∼ 17 MPa).
at high temperature (e.g. 40 ◦ C instead of the reference 20 ◦ C) attained higher strengths – for CS28 the mean SHAP
at 40 ◦ C is +17.90 MPa vs. −1.99 MPa at 20 ◦ C (∆CT28 = +19.89 MPa); for FS28 it is +3.44 MPa vs. −0.21 MPa
(∆CT28 = +3.65 MPa), whereas curing at a cold 10 ◦ C resulted in lower strengths – for CS28 the mean SHAP at 10 ◦ C is
−11.93 MPa vs. −1.99 MPa at 20 ◦ C (∆CT28 = −9.94 MPa); for FS28 it is −3.03 MPa vs. −0.21 MPa (∆CT28 = −2.82 MPa)
(Fig. 7a). Mechanistically, insufficient curing heat leaves the microstructure under-developed (lower degree of hydration, higher
porosity), whereas elevated-temperature curing fosters more complete cement hydration and a denser matrix – greatly enhancing
strength. Across both 28-day strengths, CC28 is among the most influential variables, yet its practical effect points in opposite
directions (see CC28 in Fig. 7). Globally, its mean |SHAP| contribution is about 1.72 MPa for compressive strength (CS28,
see Fig. 6a) and 1.29 MPa for flexural strength (FS28, see Fig. 6b). At the level of the two curing classes, switching from
the reference plastic-wrap condition (class 2) to continuous water immersion (class 1) shows absolute mean SHAP values of
+5.61 MPa (class 1) vs. −0.99 MPa (class 2) for CS28 (∆CC28 = +6.60 MPa), and −4.24 MPa (class 1) vs. +0.74 MPa (class
2) for FS28 (∆CC28 = −4.98 MPa). This inversion arises because immersion (class 1) maintains full saturation that densifies
the matrix and boosts compressive capacity, while simultaneously softening the interfacial transition zone and fiber–matrix
bond, which depresses tensile-type (flexural) response. These findings confirm that the reference UHPC mix is highly sensitive
to curing regime – without strict control of curing temperature and moisture, the intended performance (∼ 120 MPa) cannot
be reliably attained. Indeed, the largest strength shortfalls are traced to suboptimal curing conditions, and adjusting those
conditions is shown to rescue the product’s strength in underperforming batches.
Notably, early-age curing conditions show a somewhat different effect (Fig. 7). Raising the curing temperature on day 1
(CT1) actually decreased the eventual 28-day strength when quantified against the reference level: for CS28, the mean SHAP
at 40 ◦ C is −5.30 MPa vs. +1.82 MPa at 20 ◦ C (∆CT1 = −7.12 MPa) (Fig. 7a); for FS28, it is −0.71 MPa vs. +0.28 MPa
(∆CT1 = −0.99 MPa), whereas a cooler initial cure (e.g. 10◦ C) yielded a slight strength improvement – for CS28, +5.38 MPa
vs. +1.82 MPa (∆CT1 = +3.56 MPa); for FS28, +0.31 MPa vs. +0.28 MPa (∆CT1 = +0.03 MPa) (Fig. 7b). This inverse
relationship suggests that overly rapid hydration in the first hours – due to high initial heat – may induce thermal stresses or an
unfavorable early microstructure that ultimately reduce strength, whereas a milder initial cure leads to a more refined matrix by
28 days. By contrast, the curing environment on day 1 (CC1, i.e. 90 % RH versus sealed curing) had only a negligible influence
on final strength (for CS28, −0.42 MPa at class 1 vs. +0.06 MPa at class 2; ∆CC1 = −0.48 MPa; for FS28, −0.43 MPa vs.
+0.07 MPa; ∆CC1 = −0.51 MPa), well inside normal scatter. As long as the fresh UHPC is prevented from drying out in the
first 24 h, additional humidity during that period offers little further benefit.
Notably, the two CC28 curing classes contribute SHAP values of virtually identical magnitude to both mechanical responses.
Given that the mean compressive strength at 28 days (CS28) is roughly (∼ 110 MPa), whereas the mean flexural strength at 28
days (FS28) is only about (∼ 17 MPa) (Table 1), an equal absolute contribution translates into a far larger relative change for
FS28.
The next major contributor to strength variability is the state of the raw materials – foremost, the initial moisture content
(IM) of the ingredients (Fig. 6). SHAP shows that if the cement, sand, and powders carry even slightly higher moisture (e.g.
3.364 kg vs. the 2.925 kg reference), the 28-day strength drops – for CS28, the absolute level values are −2.83 MPa (3.364 kg)
vs. +2.94 MPa (2.925 kg) with ∆IM = −5.77 MPa (Fig. 7a); for FS28, −0.41 MPa vs. +0.34 MPa with ∆IM = −0.75 MPa
(Fig. 7b). The extra free water raises the effective water-to-cement ratio, producing a more porous matrix and thus lower
strength. Conversely, very dry ingredients yielded a positive deviation (e.g. for CS28, +2.94 MPa at 2.925 kg), indicating
the reference mix is calibrated for relatively low moisture. This sensitivity confirms that the mix design is not forgiving
to uncontrolled raw material storage conditions – strict control of raw material dryness is needed to hit the strength target
I
T
AR
E
L
C
IN
S
S
E
R
P
ARTICLE IN PRESS
0HDQ_6+$39DOXH_
28
CT
2
CC 8
1
CT
28
CT
(a) Global SHAP importance (mean |SHAP|, MPa) for predicting
CS28 with the PLS model.
IM
CC
28
GR
P
IT
AP
W
SA
I
SP
P
CC
1
FL
1
CT
1
GR
P
IM
AP
W
SP
P
SA
I
FL
I
CC
IT
I
0HDQ_6+$39DOXH_
(b) Global SHAP importance (mean |SHAP|, MPa) for predicting
FS28 with the SVR model.
Figure 6. Global feature influence measured by mean absolute SHAP (|SHAP|, MPa) across all batches; larger bars indicate
stronger influence on the predicted property. Global influence is reported as mean(|φ j |)58 .
Abbreviations are defined in Table 1.
S
S
E
R
P
consistently.
Another critical material factor is ingredient purity, represented by the graphite content in silica fume (GRP). The reference
formulation assumes high-purity silica fume (0 % GRP); at this level a strength advantage is observed. As impurity levels rise
(simulating lower-grade silica fume), both mechanical properties systematically decline – for CS28, the absolute level value at
the highest GRP tested (0.225) is −2.56 MPa vs. +1.72 MPa at 0.000 (∆GRP = −4.29 MPa) (Fig. 7a); for FS28, −0.99 MPa
vs. +0.31 MPa (∆GRP = −1.30 MPa) (Fig. 7b). These impurities likely act as inert inclusions or interfere with pozzolanic
reactions, undermining the microstructure. Hence, adhering to nominal mix proportions is insufficient if raw material quality
varies; maintaining consistent ingredient quality is vital for consistent outcomes.
Variations in the particle size distribution and dosing of aggregates and admixtures from the reference also influence the
28-day properties. Given that SAII and FLII are removed from the feature pool because of their high correlation with SAI and
FLI, respectively; the sand and filler composition indices (SAI and FLI) represent the proportion of type I vs. type II sand and
filler in the mix, which affects packing density and thus strength. Deviation from the reference blend can be detrimental. When
SAI is pushed higher (favoring more of sand type I), the model output drops (SHAP turning negative at high SAI, see Fig. 7),
while a lower-than-reference SAI (more of sand type II) improves strength. This behavior is attributed to the fact that sand type
II, with its coarser grading and potentially more angular shape, improves the interlocking and particle packing in the granular
skeleton, thereby enhancing the compressive strength. Conversely, a higher fraction of finer sand (type I) can increase the paste
demand and reduce the packing efficiency, leading to higher porosity and lower strength. This suggests that the type II sand
(used more when SAI is low) provided better grading or particle shape, so reducing type I sand beyond the baseline benefited
the mix. Conversely for fillers, a greater fraction of filler type I (higher FLI than the reference) yielded higher strength, whereas
substituting more of filler type II (lower FLI) hurt performance. This trend reflects the superior space-filling capacity of filler
type I, which is finer and better able to occupy voids between cement grains and sand particles. Filler type II, while enhancing
workability, appears less effective in optimizing the particle packing density, resulting in a more porous matrix and diminished
mechanical performance. These trends highlight that maintaining a consistent particle size distribution and blend of sources
as in the reference mix is critical – otherwise the matrix packing efficiency changes, altering the microstructure and strength.
Such deviations in sand and filler grading commonly arise from batch-to-batch variation in raw material supply, measurement
inaccuracies during dosing, or insufficient precision in scale calibration.
In addition, accurate dosing of the chemical admixture is vital. The SHAP analysis indicates that overshooting the reference
SPP (e.g. due to a dosing error adding a few hundredths of a percent more) has a negative effect on both compressive and
flexural strength (Fig. 7), likely because excessive superplasticizer can cause segregation or retardation. Slightly under-dosing
below the reference might marginally improve strength (since the reference dosage may have been a bit high), but generally,
variability in SPP leads to inconsistent workability and air entrainment.
The only critical finding when comparing CS28 and FS28 is the importance of IT (Fig. 6). Relative to the 20◦ C reference,
cooling the constituents to 10◦ C modestly improves CS28 by about +0.95 MPa (+1.40 MPa vs. + 0.45 MPa), whereas heating
them to 40◦ C reduces it by approximately −1.91 MPa (−1.46 MPa vs. + 0.45 MPa); the corresponding shift in FS28 is
I
T
AR
E
L
C
IN
(a) Batch-wise signed SHAP contributions (MPa) to the predicted
CS28; positive values increase the prediction, negative values
decrease it.
CC1YDU&ODVVOHYHOV
0HDQ6+$3YDOXH
SPPYDU WRNJ
0HDQ6+$3YDOXH
0HDQ6+$3YDOXH
ITYDU &
SAIYDU WRNJ
0HDQ6+$3YDOXH
CC28YDU&ODVVOHYHOV
0HDQ6+$3YDOXH
FLIYDU WRNJ
0HDQ6+$3YDOXH
IN
S
S
E
R
P
GRPYDU NJ
0HDQ6+$3YDOXH
0HDQ6+$3YDOXH
IMYDUWRNJ
SPPYDU WRNJ
CT1YDU &
I
T
AR
E
L
C
0HDQ6+$3YDOXH
CC1YDU&ODVVOHYHOV
0HDQ6+$3YDOXH
0HDQ6+$3YDOXH
SAIYDU WRNJ
0HDQ6+$3YDOXH
ITYDU &
GRPYDU NJ
0HDQ6+$3YDOXH
0HDQ6+$3YDOXH
CC28YDU&ODVVOHYHOV
IMYDUWRNJ
CT28YDU &
0HDQ6+$3YDOXH
0HDQ6+$3YDOXH
CT1YDU &
0HDQ6+$3YDOXH
CT28YDU &
0HDQ6+$3YDOXH
0HDQ6+$3YDOXH
0HDQ6+$3YDOXH
ARTICLE IN PRESS
FLIYDU WRNJ
(b) Batch-wise signed SHAP contributions (MPa) to the predicted
FS28; positive values increase the prediction, negative values
decrease it.
Figure 7. SHAP-based interpretation of the best-performing models at the batch level. Panels show signed SHAP values (φ j ,
MPa) per parameter and batch58 . Orange bars (dashed lines) indicate reference levels from the baseline UHPC recipe.
Abbreviations are defined in Table 1.
small (at 10 ◦ C: −0.10 vs. +0.08 MPa, ∆IT = −0.18 MPa; at 40 ◦ C: −0.19 vs. +0.08 MPa, ∆IT = −0.27 MPa), well within
normal scatter. Because similar effects on flexural strength are anticipated, further investigation is warranted before definitive
ARTICLE IN PRESS
1.0
CT28
CC28
1.0
CT28
CC28
0.8
IM
SAI
0.8
IM
SAI
0.6
FLI
SPP
SPP
CT1
0.4
CT1
FLI
CC1
0.6
0.4
CC1
IT
0.2
0.2
GRP
(a) Pairwise interaction heatmap for CS28 (PLS). The ALE-based
Friedman H-statistic (0 ≤ H ≤ 1) is shown; strong non-additivity is
centered on CT28, with the largest pair CT28×CT1 (H = 0.677).
Numeric annotations denote H.
APW
APW
GRP
IT
CC1
CT1
SPP
FLI
SAI
0.0
IM
APW
GRP
IT
CC1
CT1
SPP
FLI
SAI
IM
CC28
CT28
0.0
CC28
APW
CT28
GRP
IT
(b) Pairwise interaction heatmap for FS28 (SVR). No strong
interactions are present; the largest effects are moderate and
CT28-centered (e.g., CT28×GRP, H = 0.289; CT28×IM,
H = 0.288). Numeric annotations denote H.
S
S
E
R
P
Figure 8. Pairwise interaction structure of the models. Heatmaps report the ALE-based Friedman H-statistic for each feature
pair (H ≈ 0 additive; H > 0.30 strong two-way effect). Panel (a) shows pronounced curing-centered non-additivity for CS28,
while panel (b) shows predominantly additive behavior for FS28 with only mild two-factor synergies. Abbreviations are
defined in Table 1.
IN
conclusions are drawn – perhaps the temperature span examined is not wide enough to reveal the full influence of IT.
In summary, the interpretative analysis confirms that strictly adhering to a fixed UHPC recipe is not sufficient. Curing
temperature and humidity in particular govern the extent to which the UHPC mix reaches its potential strength, eclipsing other
factors. Fluctuations in raw material moisture and purity further explain why ostensibly identical batches can diverge in quality.
Meanwhile, the formulation shows encouraging robustness to small errors in dosing and gradation, which is advantageous
for practical production. These insights highlight the need for a smart manufacturing approach: by monitoring the critical
parameters and providing feedback (e.g. adjusting curing protocols for a given batch), one can at-line, i.e., before curing for the
current batch. This holistic understanding of sensitivity and robustness will be vital for minimizing waste and reliably achieving
the ultra-high performance that UHPC formulations are meant to deliver.
E
L
C
I
T
AR
Pairwise Interaction Effects on CS28 and FS28
Pairwise interaction effects are quantified using the ALE-based59 Friedman H-statistic60 (H ∈ [0, 1]), where H ≈ 0 indicates
additivity61 and H > 0.30 indicates a strong two-way effect62 (Fig. 8). The H-statistic analysis reveals that 28-day curing
temperature (CT28) has strong two-way interaction effects with virtually every other factor (H ≈ 0.64–0.68). In particular,
the CT28 ×CT1(early curing temperature) pair achieves H = 0.677, indicating a highly non-additive influence (H > 0.3 is
considered strong). This single interaction contributes nearly half of the model’s explained variance (H 2 ≈ 46%). All other
CT28 combinations (e.g., with internal moisture IM, mixing energy APW, ingredient temperature IT, etc.) yield H ≈ 0.64–0.65
(roughly 40–43% variance each), underscoring that the effect of CT28 on CS28 is strongly contingent on those conditions
rather than additive. The early curing temperature (CT1) itself shows consistent moderate interactions with several parameters
(H ≈ 0.26–0.30) – for instance, CT1 paired with IM, APW, silica/filler indices (SAI, FLI), or superplasticizer dosage (SPP) all
have H between 0.27 and 0.30. Apart from these curing-related synergies, most other factor pairs exhibit much lower H values
(typically < 0.2), suggesting that their influences on CS28 are approximately additive with no significant two-factor coupling.
For example, the largest interaction outside the curing variables is between IM and APW at H ≈ 0.185 (moderate), whereas
many other combinations (e.g., CC28 with FLI or SPP) are near zero and essentially additive.
In contrast to CS28, the 28-day flexural strength model is governed more by additive effects, with no factor pair exceeding
the strong interaction threshold. The highest two-factor interactions in FS28 are only moderate in magnitude (H < 0.30).
Notably, CT28 again emerges as an interactive factor, but to a lesser extent: its joint effects with mixture composition (graphite
content, GRP) and internal moisture (IM) are the largest (H = 0.289 and 0.288, respectively). CT28’s interactions with packing
density (SAI, H ≈ 0.281) and ingredient temperature (IT, H ≈ 0.275) are similar in scale, as are those with mixing power,
ARTICLE IN PRESS
Table 5. Original vs. optimized curing set points for five underperforming batches. The table lists the original curing
conditions alongside the model-recommended optimized values for early-age curing (CT1, CC1) and long-term curing (CT28,
CC28), as well as the corresponding 28-day compressive strengths achieved. Abbreviations are defined in Table 1. (Exp. Nr.:
Experiment number)
Original
Exp. Nr.
52
59
107
132
148
IM
SAI FLI
3.364 6
3.276 6.9
3.042 6
2.925 6
3.364 5.4
6.9
5.7
5.1
6
5.7
Optimized
SPP
IT GRP APW CT1 CC1 CT28 CC28
CS28
CT1 CC1 CT28 CC28
CS28
0.339
0.323
0.323
0.323
0.29
40 0.045
20 0.225
25 0.225
25
0
25
0
85.70
105.11
107.87
95.94
107.86
33
34
26
15
10
117.56
121.74
119.58
118.26
120.68
1.07
1.04
1.21
1.18
1.18
40
40
10
20
20
2
2
2
1
1
10
40
20
10
20
2
2
1
2
1
1
1
2
2
1
31
36
33
25
27
1
2
2
1
2
superplasticizer, and filler (all H ≈ 0.26–0.27). These values imply only moderate departures from additivity, meaning the
effect of CT28 on FS28, while context-dependent, is far less pronounced than it was for CS28. Aside from CT28-related pairs,
very few interactions in the FS28 system are notable – for example, the combination of initial curing condition (CC1 and
mixing energy (APW) shows a moderate H ≈ 0.21. Most other factor pairs (including all involving CC28, the 28-day curing
condition type) register H ≲ 0.15 and can be considered nearly additive. In summary, the FS28 outcome appears predominantly
additive with only mild two-factor synergies, whereas CS28 is distinctly non-additive with strong coupled effects centered on
the curing temperature variables.
S
S
E
R
P
Batch-Specific Curing Optimization Restores UHPC Strength
The previous section established that the 28-day compressive strength of the studied UHPC mix is highly sensitive to curing,
moisture content, and the purity and gradation of constituent materials. A large variance in final strength (85–136 MPa, Table
1) across nominally identical batches underscores that relying on a fixed recipe is insufficient.
To translate these insights into actionable guidance, the BPM (PLS model; Table 3) for the 28-day compressive strength is
embedded in the designed recommendation system (Fig. 3). Five underperforming batches are selected from the investigated
UHPC dataset36 to test the system. For each batch, the original process record (including IM, SAI, FLI, SPP, IT, GRP, and
the original curing schedule) is passed to the recommendation engine. The model’s recommended curing schedule is used
to manufacture a new batch. Table 5 summarizes the input and output variables, and Fig. 9 records the differences between
original and recommended curing conditions. The remanufactured batches achieved significant strength gains of 11.7–31.9
MPa, with an average increase of roughly 19 MPa. In other words, adjusting the curing conditions allowed previously rejected
batches to reach or exceed the 120 MPa design target, thereby salvaging material that would otherwise have been discarded.
The corrective power of the recommendation system is illustrated by experiment 47. The original batch, cured at 40 ◦ C
under plastic wrap during the first 24 h and at 10 ◦ C thereafter, reached only 85.7 MPa. The model prescribed a moderate
reduction of the initial temperature (40 → 33 ◦ C), a switch from plastic wrap to 90 % RH air (CC1: 2 → 1), and a substantial
increase of the long-term temperature (10 → 31 ◦ C) while replacing plastic wrap with water immersion (CC28: 2 → 1). These
adjustments raised the measured strength to 117.6 MPa. Comparable gains are obtained for the four other underperforming
batches (experiments 53, 99, 123, 139; Table 5), all of which converge to 118–122 MPa after optimization.
The improvements are realized through moderate adjustments rather than extreme values. Across the five cases, the
long-term curing temperature CT28 is increased by 7–21 ◦ C relative to the original conditions, but none of the optimized
schedules used the maximum 40 ◦ C identified in the global sensitivity analysis (Fig. 7). This moderation reflects trade-offs
with the other variables: for example, if the mix is already extremely dry or the superplasticizer dosage is slightly high, then an
excessive increase in curing temperature can accelerate hydration beyond the optimum, leading to internal cracking or increased
porosity (Table 5). A moderate elevation ensures sufficient hydration while limiting thermal stress.
Instead, values in the 25–36 ◦ C range are recommended. The early-age curing temperature CT1 is lowered by 5–10 ◦ C
in three experiments but raised by 16 ◦ C in one case (Fig. 9). This variability highlights that the optimum depends on the
entire vector of material properties: while the overall trend from the previous section suggested that reducing CT1 improves
28-day strength and raising CT28 increases strength, the recommendation engine identified that moderate early heat (e.g.,
25–30 ◦ C) can be beneficial when raw materials are extremely dry or have lower impurity levels (Table 5). For example, in
experiment 99 the original batch is cured at 10 ◦ C and still underperformed; the model recommended increasing CT1 to 26 ◦ C.
In this case, the initial moisture of the dry ingredients is low and the sand grading/filler grading deviated from the reference.
A moderate increase in early heat accelerates hydration just enough to offset the slower reaction kinetics associated with dry
powders and sub-optimal packing. Excessive heat would induce thermal stresses, but a gentle rise to 25–30 ◦ C can improve
early microstructure without causing microcracking. Thus, the optimal curing temperature is context-dependent rather than a
I
T
AR
E
L
C
IN
ARTICLE IN PRESS
&7 &7
&& &&
([S1U
&6
&7 &7
&& &&
([S1U
&7 &7
&& &&
([S1U
&7 &7
&& &&
([S1U
&6
&6
&6
&7 &7
&& &&
([S1U
2ULJLQDO
2SWLPL]HG
03DWDUJHW
&6
GD\&RPSUHVVLYH6WUHQJWKLQ03D
Figure 9. Batch-specific curing optimization restores UHPC strength. Arrows indicate the shift from original (triangles) to
optimized (circles) 28-day compressive strength; labels show ∆CS28 and recommended changes in CT1, CT28, CC1, CC28.
The red dashed line marks the 120 MPa target. Modest adjustments recover 11.7–31.9 MPa, bringing all batches to
118–122 MPa. Abbreviations are defined in Table 1. (Exp. Nr.: Experiment number)
S
S
E
R
P
fixed 10 ◦ C.
The global analysis indicated that switching from plastic wrapping to continuous water immersion (for CC28) increases
compressive strength by about 5.6 MPa (Fig. 7a). In practice, the optimal choice is not uniform (Table 5): some remanufactured
batches benefited from immersion (class 1), while others performed better under plastic wrap (class 2). This discrepancy arises
because the PLS model balances compressive strength against the other measured properties; if the raw material moisture
is already high, additional water immersion can push the effective water-to-cement ratio above the optimum, whereas using
plastic wrap helps maintain a lower pore volume.
In summary, the variability observed in UHPC production arises from the cumulative effect of multiple uncertainties. Even
small deviations in raw material moisture (IM), sand grading (SAI vs. SAII), filler grading (FLI, FLII), superplasticizer dosage
(SPP), ingredient temperature (IT), and impurity in silica fume (GRP) can interact in complex ways. For example, an extra
0.4 kg of moisture in the powders increases the effective water-to-cement ratio and lowers the density of the hardened matrix;
simultaneously, a higher proportion of the less well-graded sand type (high SAI) reduces packing density. When these shifts
coincide with an overdosage of superplasticizer, the result is higher porosity and more air entrainment, which drastically lower
compressive strength. Such combined deviations explain why some batches originally attained only ∼85 MPa (Table 5).
The case studies demonstrate that at-line, operator-in-the-loop, data-driven adjustment of curing conditions is a powerful
lever to ensure that UHPC mixes consistently achieve their design strength. Although the global SHAP analysis provides
valuable insights into the relative importance of each variable, the optimal curing profile for a specific batch depends on the
entire constellation of process parameters. Employing a predictive model to navigate this multidimensional space yields tailored
recommendations that restore performance, reduce waste, and contribute to more sustainable concrete production.
E
L
C
IN
I
T
AR
Industrial Deployment Considerations
The framework has been implemented as an at-line, operator-in-the-loop decision-support system. End-of-mix predictions
are generated so that any required adjustments can precede curing, and the exhaustive grid evaluation of candidate curing set
points (∼3,844 combinations) is completed in ∼30 s on a standard workstation (Intel® Core™ i9-10900X CPU, 64 GB RAM),
which is compatible with typical batch cycle times. Sensor integration has been established as follows. Ambient temperature
and relative humidity in the production area are continuously measured and controlled, and curing temperature (for early-age
and long-term phases) is monitored by calibrated probes; relative humidity in environmental chambers is likewise monitored
to enforce prescribed profiles. Material dosing is executed with high accuracy using calibrated scales to eliminate weighing
and admixture-dosage errors. The granular fractions (SAI / SAII and FLI / FLII) are prepared to the specified proportions
and their particle size distributions are periodically characterized by scanning electron microscopy to ensure reproducible
gradation. During mixing, mixer speed and duration are precisely adjustable, the mixer controller is physically connected to
the acquisition computer, and the instantaneous power signal P(t) is streamed at 1 Hz by a Python interface. The stream is
ARTICLE IN PRESS
persisted per second, and the average energy consumption are computed and forwarded to the batch record. A graphical user
interface (GUI) has been developed for operator interaction: batch-level inputs (ingredient temperatures and moisture, verified
dosing amounts, intended curing schedule, and ambient conditions) are entered, the trained model executes in the background,
and the predicted 28-day compressive strength together with any recommended curing adjustments are displayed for operator
approval and implementation at-line.
However, the present system has been developed and validated in a controlled precast production context, where formwork,
cycle times, and curing environments are repeatable. Direct transfer to cast-in-place construction is not anticipated, as field
operations typically lack tight environmental control and exhibit non-standard cycle times with constrained sensing and actuation.
Furthermore, industrial-scale deployment further presupposes a sufficiently dense sensing and data-integration infrastructure
(e.g., continuous curing/maturity monitoring, at-line dosing verification, and ambient-condition logging). Retrofitting legacy
plants may entail prohibitive costs and downtime due to capital expenditure, integration and calibration effort, and limited
digital connectivity; consequently, near-term benefits are most readily realized in digitally mature precast facilities, whereas
broader adoption will depend on lower-cost sensing solutions and lightweight integration.
Conclusions and Future Work
Ultra-high performance concrete (UHPC) production is prone to batch-to-batch variability in the mechanical properties, even
when identical mix proportions are used. This inconsistency often yields off-spec batches that must be discarded, leading to
material waste and added CO2 emissions. Addressing this challenge, the present study developed an at-line end-of-mix ML
recommendation system to identify process-induced strength deviations and enable timely corrections to uphold quality.
A comprehensive 150-batch design-of-experiments captured key sources of process uncertainty – raw material quality,
storage conditions, dosing errors, and curing regimes – while the base mix recipe remained fixed. Ten diverse ML algorithms
were trained on this dataset; the best-performing model (BPM) explained ≥ 75 % of strength variance with a prediction
error ≤ 10 % under leave-one-out cross-validation. SHAP analysis showed that long-term curing temperature and humidity
dominate strength gain, followed by ingredient moisture and impurity in the silica fume. Guided by these insights, the BPM
was embedded in an at-line decision-support tool that proposes batch-specific curing adjustments whenever a strength shortfall
is predicted. Curing temperature and humidity thus become control knobs: if upstream variables stray from their ideal ranges,
the system selects modified curing settings that bring the predicted 28-day strength (CS28) within the target specification of
CS28 = 120 MPa.
Validation with five underperforming batches confirmed the system’s efficacy. Model-recommended adjustments – typically
±20◦ C temperature shifts or humidity changes – raised compressive strength from an initial 86–108 MPa to 118–122 MPa.
The maximum recovery reached 31.9 MPa, and every batch met or exceeded the 120 MPa target. Hence, timely, data-driven
intervention can rescue batches that would otherwise be rejected, eliminating waste, reducing reprocessing, and lowering
the embodied CO2 of production. Because the reference UHPC uses white Portland cement (600 kg/m3 ) and the plant batch
volume is 15 m3 , the five salvaged batches (75 m3 ) contain 45 t of cement; on a cement-only, cradle-to-gate basis (A1–A3)
using 0.913 kg CO2 e kg−1 for white Portland cement, this corresponds to ≈ 41 t CO2 e avoided.
The next step is to scale the framework to full industrial trials, integrating live data streams and larger batch volumes.
Additionally, applying this experimental–ML methodology to other cementitious systems, additional UHPC types, and recycled
concrete could demonstrate its broader relevance to sustainable, reproducible materials manufacturing.
E
L
C
IN
S
S
E
R
P
I
T
AR
Data availability
The datasets generated during and/or analysed during the current study are available in the DaKS – University of Kassel’s
research data repository, https://doi.org/10.48662/daks-56.
Code availability
The underlying code generated during the current study is available from the corresponding author on reasonable request.
Funding
This project is funded by the University of Kassel through the BiTWerk project.
Author contributions
F.R. conceived and designed the study; developed the methodology and software; curated the data; performed formal analysis
and validation; prepared the visualizations; and wrote the original draft. A.A. contributed to conceptualization, data curation,
ARTICLE IN PRESS
investigation, and validation, and contributed to writing the original draft. G.Z. contributed to conceptualization, acquired
funding, provided resources, and administered the project. A.K. contributed to conceptualization, acquired funding, provided
resources, supervised the research, and revised the manuscript. All authors approved the final version of the manuscript.
Competing interests
The authors declare no competing interests.
References
1. Abbas, S., Soliman, A. M. & Nehdi, M. L. Exploring mechanical and durability properties of ultra-high performance
concrete incorporating various steel fiber lengths and dosages. Constr. Build. Mater. 75, 429–441, DOI: 10.1016/j.
conbuildmat.2014.11.017 (2015).
2. Li, J., Wu, Z., Shi, C., Yuan, Q. & Zhang, Z. Durability of ultra-high performance concrete – a review. Constr. Build.
Mater. 255, 119296, DOI: 10.1016/j.conbuildmat.2020.119296 (2020).
3. Richard, P. & Cheyrezy, M. Composition of reactive powder concretes. Cem. Concr. Res. 25, 1501–1511, DOI:
10.1016/0008-8846(95)00144-1 (1995).
4. Russell, H. G. & Graybeal, B. A. Ultra-high performance concrete: A state-of-the-art report for the bridge community.
Tech. Rep. FHWA-HRT-13-060, Federal Highway Administration, McLean, VA (2013).
5. Graybeal, B., Crane, C. K., Perry, V., Corvez, D. & Ahlborn, T. M. Advancing ultra-high-performance concrete. Concr.
Int. 41, 41–45 (2019).
S
S
E
R
P
6. Abrishambaf, A., Pimentel, M. & Nunes, S. Influence of fibre orientation on the tensile behaviour of ultra-high performance
fibre reinforced cementitious composites. Cem. Concr. Res. 97, 28–40, DOI: 10.1016/j.cemconres.2017.03.007 (2017).
7. Charron, J.-P., Denarié, E. & Brühwiler, E. Permeability of ultra high performance fiber reinforced concretes (UHPFRC)
under high stresses. Mater. Struct. 40, 269–277, DOI: 10.1617/s11527-006-9105-0 (2007).
IN
8. Tran, N. T., Tran, T. K., Jeon, J. K., Park, J. K. & Kim, D. J. Fracture energy of ultra-high-performance fiber-reinforced
concrete at high strain rates. Cem. Concr. Res. 79, 169–184, DOI: 10.1016/j.cemconres.2015.09.011 (2016).
E
L
C
9. Yoo, D.-Y., Lee, J.-H. & Yoon, Y.-S. Effect of fiber content on mechanical and fracture properties of ultra high performance
fiber reinforced cementitious composites. Compos. Struct. 106, 742–753, DOI: 10.1016/j.compstruct.2013.07.033 (2013).
I
T
AR
10. Xi, B., Huang, Z., Al-Obaidi, S. & Ferrara, L. Healing capacity of ultra high performance concrete under sustained through
crack tensile stresses and aggressive environments. Cem. Concr. Compos. 145, 105355, DOI: 10.1016/j.cemconcomp.2023.
105355 (2024).
11. Brühwiler, E. & Denarié, E. Rehabilitation and strengthening of concrete structures using ultra-high performance fibre
reinforced concrete. Struct. Eng. Int. 23, 450–457, DOI: 10.2749/101686613X13627347100437 (2013).
12. Charron, J.-P., Denarié, E. & Brühwiler, E. Transport properties of water and glycol in an ultra high performance fiber
reinforced concrete (UHPFRC) under high tensile deformation. Cem. Concr. Res. 38, 689–698, DOI: 10.1016/j.cemconres.
2007.12.006 (2008).
13. Meda, A. & Rosati, G. Design and construction of a bridge in very high performance fiber-reinforced concrete. J. Bridg.
Eng. 8, 281–287, DOI: 10.1061/(ASCE)1084-0702(2003)8:5(281) (2003).
14. Bertola, N., Kälin, E. & Brühwiler, E. Environmental and life-cycle cost evaluation of a composite timber-UHPFRC bridge.
In Proceedings of the Third International Interactive Symposium on Ultra-High Performance Concrete (IIS-UHPC 2023),
1–8, DOI: 10.21838/uhpc.16641 (Iowa State University Digital Press, Des Moines, IA, USA, 2023).
15. Joe, C. D. & Moustafa, M. A. Cost and ecological feasibility of using UHPC in bridge piers. In Proceedings of the First
International Interactive Symposium on Ultra-High Performance Concrete (IIS-UHPC 2016), 1–8, DOI: 10.21838/uhpc.
2016.103 (Iowa State University Digital Press, Des Moines, IA, USA, 2016).
16. Fan, J., Shao, Y., Bandelt, M. J., Adams, M. P. & Ostertag, C. P. Sustainable reinforced concrete design: The role of
ultra-high performance concrete (UHPC) in life-cycle structural performance and environmental impacts. Eng. Struct. 316,
118585, DOI: 10.1016/j.engstruct.2024.118585 (2024).
17. Haber, Z. B., McDonagh, M., Foden, A. & Sadasivam, S. Ultra-high performance concrete (UHPC) overlays: An example
of lifecycle cost analysis. Tech Note FHWA-HRT-23-012, United States. Federal Highway Administration. Office of
Research, Development, and Technology, McLean, VA (2022).
ARTICLE IN PRESS
18. Bertola, N., Küpfer, C. & Brühwiler, E. UHPFRC intervention reduces environmental and economic costs in bridge
management. Res. Sq. DOI: 10.21203/rs.3.rs-6549136/v1 (2025). Preprint.
19. Alsalman, A. et al. Comparative sustainability assessment of proprietary and non-proprietary ultra-high performance
concrete mixtures. Infrastructures 10, 245, DOI: 10.3390/infrastructures10090245 (2025).
20. Rezazadeh P., F., Dürrbaum, A., Zimmermann, G. & Kroll, A. Leveraging ensemble structures to elucidate the impact of
factors that influence the quality of ultra-high performance concrete. In 2023 IEEE Symposium Series on Computational
Intelligence (SSCI), 180–187, DOI: 10.1109/SSCI52147.2023.10371800 (IEEE, Mexico City, Mexico, 2023).
21. Wakjira, T. G., Kutty, A. A. & Alam, M. S. A novel framework for developing environmentally sustainable and costeffective ultra-high-performance concrete (uhpc) using advanced machine learning and multi-objective optimization
techniques. Constr. Build. Mater. 416, 135114, DOI: 10.1016/j.conbuildmat.2024.135114 (2024).
22. Rezazadeh P., F., Dürrbaum, A., Zimmermann, G. & Kroll, A. Holistic modeling of ultra-high performance concrete
production process: Synergizing mix design, fresh concrete properties, and curing conditions. In Proceedings - 33.
Workshop Computational Intelligence: Berlin, 23.-24. November 2023, 215–237, DOI: 10.58895/ksp/1000162754-15 (KIT
Scientific Publishing, Berlin, Germany, 2023).
23. Aylas-Paredes, B. K. et al. Data driven design of ultra high performance concrete prospects and application. Sci. Reports
15, 9248, DOI: 10.1038/s41598-025-94484-2 (2025).
24. Golafshani, E. M., Behnood, A., Kim, T., Ngo, T. & Kashani, A. Metaheuristic optimization based-ensemble learners for the
carbonation assessment of recycled aggregate concrete. Appl. Soft Comput. 159, 111661, DOI: 10.1016/j.asoc.2024.111661
(2024).
S
S
E
R
P
25. Kumar, R., Rai, B. & Samui, P. A comparative study of prediction of compressive strength of ultra-high performance
concrete using soft computing technique. Struct. Concr. 24, 5538–5555, DOI: 10.1002/suco.202200850 (2023).
26. Kandiri, A., Sartipi, F. & Kioumarsi, M. Predicting compressive strength of concrete containing recycled aggregate using
modified ANN with different optimization algorithms. Appl. Sci. 11, 485, DOI: 10.3390/app11020485 (2021).
IN
27. Marani, A., Jamali, A. & Nehdi, M. L. Predicting ultra-high-performance concrete compressive strength using tabular
generative adversarial networks. Materials 13, 4757, DOI: 10.3390/ma13214757 (2020).
E
L
C
28. Ling, H., Qian, C., Kang, W., Liang, C. & Chen, H. Combination of support vector machine and k-fold cross validation to
predict compressive strength of concrete in marine environment. Constr. Build. Mater. 206, 355–363, DOI: 10.1016/j.
conbuildmat.2019.02.071 (2019).
I
T
AR
29. Hoang, N.-D., Pham, A.-D., Nguyen, Q.-L. & Pham, Q.-N. Estimating compressive strength of high performance concrete
with gaussian process regression model. Adv. Civ. Eng. 2016, 1–8, DOI: 10.1155/2016/2861380 (2016).
30. Nguyen, N.-H., Abellán-García, J., Lee, S., García-Castaño, E. & Vo, T. P. Efficient estimating compressive strength of
ultra-high performance concrete using XGBoost model. J. Build. Eng. 52, 104302, DOI: 10.1016/j.jobe.2022.104302
(2022).
31. Chen, T. & Guestrin, C. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International
Conference on Knowledge Discovery and Data Mining, KDD ’16, 785–794, DOI: 10.1145/2939672.2939785 (Association
for Computing Machinery, San Francisco, CA, USA, 2016).
32. Li, Z. et al. Machine learning in concrete science: applications, challenges, and best practices. npj Comput. Mater. 8, 127,
DOI: 10.1038/s41524-022-00810-x (2022).
33. Yeh, I.-C. Modeling of strength of high-performance concrete using artificial neural networks. Cem. Concr. Res. 28,
1797–1808, DOI: 10.1016/S0008-8846(98)00165-3 (1998).
34. Yeh, I.-C. Modeling slump flow of concrete using second-order regressions and artificial neural networks. Cem. Concr.
Compos. 29, 474–480, DOI: 10.1016/j.cemconcomp.2007.02.001 (2007).
35. Taguchi, G. System of Experimental Design: Engineering Methods to Optimize Quality and Minimize Costs
(UNIPUB/Kraus International Publications; American Supplier Institute, White Plains, NY; Dearborn, MI, 1987).
36. Rezazadeh, F., Dürrbaum, A., Abrishambaf, A., Zimmermann, G. & Kroll, A. Mechanical properties of ultra-high
performance concrete (UHPC). Dataset, DOI: 10.48662/daks-56 (2025). Available since 2025-02-24; License: CC BY 4.0.
37. Refaeilzadeh, P., Tang, L. & Liu, H. Cross-validation. In Encyclopedia of Database Systems, 532–538, DOI: 10.1007/
978-0-387-39940-9_565 (Springer US, Boston, MA, 2009).
ARTICLE IN PRESS
38. Stein, M. Large sample properties of simulations using Latin hypercube sampling. Technometrics 29, 143–151, DOI:
10.1080/00401706.1987.10488205 (1987).
39. Carnell, R. optaugmentlhs.r [R script]. GitHub repository: bertcarnell/lhs (2019). Part of the lhs package
(GPL-3). Accessed 12 February 2025.
40. Rezazadeh, F., Abrishambaf, A., Dürrbaum, A., Zimmermann, G. & Kroll, A. Investigating reproducibility of ultra-high
performance concrete with consistent mechanical properties: A modeling pipeline for complex manufacturing. In Schulte,
H., Hoffmann, F. & Mikut, R. (eds.) Proceedings - 34. Workshop Computational Intelligence: Berlin, 21.–22. November
2024, 143–148, DOI: 10.58895/ksp/1000174544-9 (KIT Scientific Publishing, Berlin, Germany, 2024).
41. Pedregosa, F. et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
42. Wegelin, J. A. A survey of partial least squares (pls) methods, with emphasis on the two-block case. Technical Report 371,
University of Washington, Department of Statistics, Seattle, WA, USA (2000).
43. Murphy, K. P. Machine Learning: A Probabilistic Perspective. Adaptive Computation and Machine Learning (MIT Press,
Cambridge, MA, 2012).
44. Fix, E. & Hodges Jr., J. L. Discriminatory analysis. nonparametric discrimination: Consistency properties. Int. Stat. Rev.
57, 238–247, DOI: 10.2307/1403797 (1989).
45. Hamel, L. H. Knowledge Discovery with Support Vector Machines. Wiley Series on Methods and Applications in Data
Mining (John Wiley & Sons, Hoboken, NJ, 2009).
46. Breiman, L. Random forests. Mach. Learn. 45, 5–32, DOI: 10.1023/A:1010933404324 (2001).
S
S
E
R
P
47. Friedman, J. H. Greedy function approximation: a gradient boosting machine. The Annals Stat. 29, 1189–1232, DOI:
10.1214/aos/1013203451 (2001).
48. Rasmussen, C. E. Gaussian processes in machine learning. In Advanced Lectures on Machine Learning: ML Summer
Schools 2003, Canberra, Australia, February 2–14, 2003; Tübingen, Germany, August 4–16, 2003, Revised Lectures, vol.
3176 of Lecture Notes in Computer Science, 63–71, DOI: 10.1007/978-3-540-28650-9_4 (Springer, Berlin, Heidelberg,
2004).
E
L
C
IN
49. Deb, K., Pratap, A., Agarwal, S. & Meyarivan, T. A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE
Transactions on Evol. Comput. 6, 182–197, DOI: 10.1109/4235.996017 (2002).
I
T
AR
50. Ge, Z., Song, Z., Ding, S. X. & Huang, B. Data mining and analytics in the process industry: The role of machine learning.
IEEE Access 5, 20590–20616, DOI: 10.1109/ACCESS.2017.2756872 (2017).
51. Rezazadeh, F. & Kroll, A. Predicting the compressive strength of concrete up to 28 days-ahead: Comparison of 16 machine
learning algorithms on benchmark datasets. In Proceedings - 32. Workshop Computational Intelligence: Berlin, 1. – 2.
Dezember 2022, 53–75, DOI: 10.58895/ksp/1000151141-4 (KIT Scientific Publishing, Berlin, Germany, 2022).
52. Sagi, O. & Rokach, L. Ensemble learning: A survey. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 8, DOI:
10.1002/widm.1249 (2018).
53. Taffese, W. Z., Zhu, Y. & Chen, G. Ensemble-learning model based ultimate moment prediction of reinforced concrete
members strengthened by UHPC. Eng. Struct. 305, 117705, DOI: 10.1016/j.engstruct.2024.117705 (2024).
54. Nguyen, M. H., Nguyen, T.-A. & Ly, H.-B. Ensemble xgboost schemes for improved compressive strength prediction of
uhpc. Structures 57, 105062, DOI: 10.1016/j.istruc.2023.105062 (2023).
55. Lundberg, S. M. & Lee, S.-I. A unified approach to interpreting model predictions. In Advances in Neural Information
Processing Systems 30 (NeurIPS 2017), 4765–4774 (Curran Associates, Inc., Long Beach, CA, USA, 2017). 1705.07874.
56. Shapley, L. S. A value for n-person games. In Kuhn, H. W. & Tucker, A. W. (eds.) Contributions to the Theory of Games,
Volume II, 307–317, DOI: 10.1515/9781400881970-018 (Princeton University Press, Princeton, NJ, 1953).
57. Benesty, J., Chen, J., Huang, Y. & Cohen, I. Pearson correlation coefficient. In Noise Reduction in Speech Processing,
vol. 2 of Springer Topics in Signal Processing, 1–4, DOI: 10.1007/978-3-642-00296-0_5 (Springer, Berlin, Heidelberg,
2009).
58. Lundberg, S. M. et al. From local explanations to global understanding with explainable AI for trees. Nat. Mach. Intell. 2,
56–67, DOI: 10.1038/s42256-019-0138-9 (2020).
59. Apley, D. W. & Zhu, J. Visualizing the effects of predictor variables in black box supervised learning models. J. Royal
Stat. Soc. Ser. B 82, 1059–1086, DOI: 10.1111/rssb.12377 (2020).
ARTICLE IN PRESS
60. Friedman, J. H. & Popescu, B. E. Predictive learning via rule ensembles. The Annals Appl. Stat. 2, 916–954, DOI:
10.1214/07-AOAS148 (2008).
61. Molnar, C. Interpretable Machine Learning (2024). Feature Interaction & ALE chapters.
62. Herbinger, J. et al. Decomposing global feature effects based on generalized functional anova. J. Mach. Learn. Res. 25,
1–49 (2024).
I
T
AR
E
L
C
IN
S
S
E
R
P