Machine learning the gap between real and simulated nebulae: A domain-adaptation approach to classify ionised nebulae in nearby galaxies

Francesco Belfiore, Michele Ginolfi, Guillermo Blanc, Mederic Boquien, Melanie Chevance, Enrico Congiu, Simon C.O. Glover, Brent Groves, Ralf S. Klessen, J. Eduardo Méndez-Delgado, Thomas G. Williams

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)

Abstract

Classifying ionised nebulae in nearby galaxies is crucial to studying stellar feedback mechanisms and understanding the physical conditions of the interstellar medium. This classification task is generally performed by comparing observed line ratios with photoionisation simulations of different types of nebulae (HII regions, planetary nebulae, and supernova remnants). However, due to simplifying assumptions, such simulations are generally unable to fully reproduce the line ratios in observed nebulae. This discrepancy limits the performance of the classical machine-learning approach, where a model is trained on the simulated data and then used to classify real nebulae. For this study, we used a domain-adversarial neural network (DANN) to bridge the gap between photoionisation models (source domain) and observed ionised nebulae from the PHANGS-MUSE survey (target domain). The DANN is an example of a domain-adaptation algorithm, whose goal is to maximise the performance of a model trained on labelled data in the source domain on an unlabelled target domain by extracting domain-invariant features. Our results indicate a significant improvement in classification performance in the target domain when employing the DANN framework compared to a classical neural network (NN) classifier. Additionally, we investigated the impact of adding noise to the source dataset, finding that noise injection acts as a form of regularisation, further enhancing the performances of both the NN and DANN models on the observational data. The combined use of domain adaptation and noise injection improved the classification accuracy in the target domain by 23%. This study highlights the potential of domain adaptation methods in tackling the domain-shift challenge when using theoretical models to train machine-learning pipelines in astronomy.

Original languageEnglish
Article numberA212
Pages (from-to)1-11
Number of pages11
JournalAstronomy and Astrophysics
Volume694
DOIs
Publication statusE-pub ahead of print - 14 Feb 2025

Fingerprint

Dive into the research topics of 'Machine learning the gap between real and simulated nebulae: A domain-adaptation approach to classify ionised nebulae in nearby galaxies'. Together they form a unique fingerprint.

Cite this