TY - JOUR
T1 - A Data-Driven Influential Factor Analysis Method for Fly Ash-Based Geopolymer Using Optimized Machine-Learning Algorithms
AU - Ma, Guowei
AU - Cui, Aidi
AU - Huang, Yimiao
AU - Dong, Wei
PY - 2022
Y1 - 2022
N2 - As a promising environmentally friendly construction material that can be used to replace concrete, fly ash-based geopolymer (FABG) should meet the working strength requirement. However, the optimal mixture design of FABG could be difficult to obtain through experimental methods due to a variety of influential factors and their complex interrelationships. To address this problem and explore the influence patterns of those factors, this study developed an ensemble machine learning modeling method that integrated three algorithms: support vector regressor (SVR), random forest regressor (RFR) and extreme gradient boosting (XGBoost). A database containing 896 experimental instances was constructed by reviewing open resources. During the modeling, established estimators were tuned with a metaheuristic algorithm called differential evolution (DE). After analysis, the XGBoost model was determined as the strength prediction model of FABG, because it showed the best performance with the largest R2 scores (0.97 and 0.91) without overfitting by the minimum mean absolute error (MAE) gap between the training and testing subsets. Additionally, a further understanding of how the factors affect the predicted values of the model was given by the SHapley Additive exPlanations (SHAP) theory. The results show that curing conditions had the biggest impact on the model output, followed by alkali-activator solution variables and the mole of sodium hydroxide. Therefore, the proposed method can accurately predict the strength of produced FABG and assist in understanding the influence patterns of various factors.
AB - As a promising environmentally friendly construction material that can be used to replace concrete, fly ash-based geopolymer (FABG) should meet the working strength requirement. However, the optimal mixture design of FABG could be difficult to obtain through experimental methods due to a variety of influential factors and their complex interrelationships. To address this problem and explore the influence patterns of those factors, this study developed an ensemble machine learning modeling method that integrated three algorithms: support vector regressor (SVR), random forest regressor (RFR) and extreme gradient boosting (XGBoost). A database containing 896 experimental instances was constructed by reviewing open resources. During the modeling, established estimators were tuned with a metaheuristic algorithm called differential evolution (DE). After analysis, the XGBoost model was determined as the strength prediction model of FABG, because it showed the best performance with the largest R2 scores (0.97 and 0.91) without overfitting by the minimum mean absolute error (MAE) gap between the training and testing subsets. Additionally, a further understanding of how the factors affect the predicted values of the model was given by the SHapley Additive exPlanations (SHAP) theory. The results show that curing conditions had the biggest impact on the model output, followed by alkali-activator solution variables and the mole of sodium hydroxide. Therefore, the proposed method can accurately predict the strength of produced FABG and assist in understanding the influence patterns of various factors.
KW - Compressive strength
KW - Feature importance
KW - Fly ash-based geopolymer (FABG)
KW - Machine learning
KW - Mixture design
KW - Prediction model
UR - http://www.scopus.com/inward/record.url?scp=85129605281&partnerID=8YFLogxK
U2 - 10.1061/(ASCE)MT.1943-5533.0004266
DO - 10.1061/(ASCE)MT.1943-5533.0004266
M3 - Article
AN - SCOPUS:85129605281
SN - 0899-1561
VL - 34
JO - Journal of Materials in Civil Engineering
JF - Journal of Materials in Civil Engineering
IS - 7
M1 - 04022132
ER -