TY - GEN
T1 - Comparison of Machine Learning Techniques and Variables for Groundwater Dissolved Organic Nitrogen Prediction in an Urban Area
AU - Wang, Benya
AU - Oldham, Carolyn
AU - Hipsey, Matthew R.
PY - 2016
Y1 - 2016
N2 - © 2016 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license.Dissolved inorganic nitrogen (DIN) are typically the main focus of nutrient management strategies; however, some studies have found that dissolved organic nitrogen (DON) can be the dominant form of total nitrogen (TN) in several Australian estuaries and catchments. To better understand nitrogen cycling and explore the relationships between measured groundwater DON and environmental factors, thirteen machine learning (ML) techniques were compared in this study. DON was simulated under two scenarios using a range of input variables: 1) detailed nutrient data with landscape and sampling factors, and 2) limited nutrient data with landscape and sampling factors. Most of the tested ML algorithms more accurately predicted DON than when it was estimated from the difference between TN and DIN. Some models show greater adaptability to different modelling conditions, with only a few approaches able to predict with high accuracy using limited input variables (scenario 2). From the models tested, bagged mars, cubist and random forest were selected as optimal. Sample depth, sampling date and specific surface water area were the important non-nutrient input variables for DON prediction, which reveals the significant effect of surface environmental factors and seasonality on groundwater DON.
AB - © 2016 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license.Dissolved inorganic nitrogen (DIN) are typically the main focus of nutrient management strategies; however, some studies have found that dissolved organic nitrogen (DON) can be the dominant form of total nitrogen (TN) in several Australian estuaries and catchments. To better understand nitrogen cycling and explore the relationships between measured groundwater DON and environmental factors, thirteen machine learning (ML) techniques were compared in this study. DON was simulated under two scenarios using a range of input variables: 1) detailed nutrient data with landscape and sampling factors, and 2) limited nutrient data with landscape and sampling factors. Most of the tested ML algorithms more accurately predicted DON than when it was estimated from the difference between TN and DIN. Some models show greater adaptability to different modelling conditions, with only a few approaches able to predict with high accuracy using limited input variables (scenario 2). From the models tested, bagged mars, cubist and random forest were selected as optimal. Sample depth, sampling date and specific surface water area were the important non-nutrient input variables for DON prediction, which reveals the significant effect of surface environmental factors and seasonality on groundwater DON.
UR - http://www.hic2016.org/html/index.php
U2 - 10.1016/j.proeng.2016.07.527
DO - 10.1016/j.proeng.2016.07.527
M3 - Conference paper
VL - 154
T3 - Procedia Engineering
SP - 1176
EP - 1184
BT - Procedia Engineering
A2 - Yoo, D
A2 - Kim, H
A2 - Jung, D
A2 - Song, C
PB - Elsevier
T2 - 12th International Conference on Hydroinformatics
Y2 - 21 August 2016 through 26 August 2016
ER -