CARA : convolutional autoencoders for the detection of radio anomalies

dc.contributor.authorBrand, Kevin
dc.contributor.authorGrobler, Trienko L.
dc.contributor.authorKleynhans, Waldo
dc.date.accessioned2025-03-28T08:24:43Z
dc.date.available2025-03-28T08:24:43Z
dc.date.issued2025-02
dc.descriptionDATA AVAILABILITY : The FRGADB data set and the corresponding FIRST fits cutouts that were used for our work are publicly available at https://doi.org/10.5281/zenodo.13773680. A Github repository containing the code for the experiments is also publicly available at https://github.com/KBrand26/CARA.en_US
dc.description.abstractWith the advent of modern radio interferometers, a significant influx in data is expected. This influx will render the manual inspection of samples infeasible and thus necessitates the development of automated approaches to find radio sources with anomalous morphologies. In this paper, we investigate the use of autoencoders for anomalous source detection, based on the assumption that autoencoders will reconstruct anomalies poorly. Specifically, we compare an autoencoder architecture from the literature to two other autoencoder architectures, as well as to four conventional machine learning models. Our results showed that the reconstruction errors of these autoencoders were generally more informative with respect to identifying anomalies than machine learning models were when trained on PCA components. Furthermore, we found that the use of a memory unit in our autoencoders resulted in the best performance, as it further restricted the ability of autoencoders to generalize to anomalous sources. Whilst investigating the use of different reconstruction error metrics as anomaly scores, we determined that they were more informative when combined than they were in isolation. Thus, applying the machine learning models to the combined anomaly scores from the autoencoders resulted in the best overall performance. Particularly, random forests and XGBoost models were the most effective, with isolation forests also being competitive when using a small number of labelled anomalies to tune their hyperparameters. Such isolation forests are also more likely to generalize to unseen classes of anomalies than supervised models such as random forests and XGBoost.en_US
dc.description.departmentElectrical, Electronic and Computer Engineeringen_US
dc.description.librarianhj2024en_US
dc.description.sdgSDG-09: Industry, innovation and infrastructureen_US
dc.description.urihttps://academic.oup.com/rastien_US
dc.identifier.citationBrand, K., Grobler, T.L. & Kleynhans, W. 2025, 'CARA : convolutional autoencoders for the detection of radio anomalies', RAS Techniques and Instruments, vol. 4, art. rzaf005, doi : 10.1093/rasti/rzaf005.en_US
dc.identifier.issn2752-8200 (online)
dc.identifier.other10.1093/rasti/rzaf005
dc.identifier.urihttp://hdl.handle.net/2263/101783
dc.language.isoenen_US
dc.publisherOxford University Pressen_US
dc.rights© 2025 The Author(s). Published by Oxford University Press on behalf of Royal Astronomical Society. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/).en_US
dc.subjectMachine learningen_US
dc.subjectData methodsen_US
dc.subjectAnomaly detectionen_US
dc.subjectRadio continuum: galaxiesen_US
dc.subjectAutoencodersen_US
dc.subjectDecision tree ensemblesen_US
dc.subjectSDG-09: Industry, innovation and infrastructureen_US
dc.titleCARA : convolutional autoencoders for the detection of radio anomaliesen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Brand_CARA_2025.pdf
Size:
5.77 MB
Format:
Adobe Portable Document Format
Description:
Article

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: