Analysis of Catastrophic Interference with Application to Spline Neural Architectures

dc.contributor.advisorBosman, Anna Sergeevna
dc.contributor.emailheinrich.vandeventer@outlook.comen_US
dc.contributor.postgraduateVan Deventer, Heinrich Pieter
dc.date.accessioned2024-03-01T10:56:26Z
dc.date.available2024-03-01T10:56:26Z
dc.date.created2024-05-13
dc.date.issued2024-02-14
dc.descriptionDissertation (MSc(Computer Science))--University of Pretoria,2024en_US
dc.description.abstractContinual learning is the sequential learning of different tasks by a machine learning model. Continual learning is known to be hindered by catastrophic interference or forgetting, i.e. rapid unlearning of earlier learned tasks when new tasks are learned. Despite their practical success, artificial neural networks (ANNs) are prone to catastrophic interference. This study analyses how gradient descent and overlapping representations between distant input points lead to distal interference and catastrophic interference. Distal interference refers to the phenomenon where training a model on a subset of the domain leads to non-local changes on other subsets of the domain. This study shows that uniformly trainable models without distal interference must be exponentially large. A novel antisymmetric bounded exponential layer B-spline ANN architecture named ABEL-Spline is proposed that can approximate any continuous function, is uniformly trainable, has polynomial computational complexity, and provides some guarantees for distal interference. Experiments are presented to demonstrate the theoretical properties of ABEL-Splines. ABEL-Splines are also evaluated on benchmark regression problems. It is concluded that the weaker distal interference guarantees in ABEL-Splines are insufficient for model-only continual learning. It is conjectured that continual learning with polynomial complexity models requires augmentation of the training data or algorithm.en_US
dc.description.availabilityUnrestricteden_US
dc.description.degreeMSc (Computer Science)en_US
dc.description.departmentComputer Scienceen_US
dc.description.facultyFaculty of Engineering, Built Environment and Information Technologyen_US
dc.description.sdgSDG-09: Industry, innovation and infrastructureen_US
dc.description.sponsorshipComputing resources provided by the South African Centre for High-Performance Computing (CHPC).en_US
dc.description.sponsorshipSupported by the National Research Foundation (NRF) of South Africa Thuthuka Grant Number 13819413/TTK210316590115.en_US
dc.identifier.citation*en_US
dc.identifier.doihttps://doi.org/10.25403/UPresearchdata.25260349en_US
dc.identifier.otherA2024en_US
dc.identifier.urihttp://hdl.handle.net/2263/95024
dc.language.isoenen_US
dc.publisherUniversity of Pretoria
dc.rights© 2023 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
dc.subjectUCTDen_US
dc.subjectmachine learningen_US
dc.subjectcontinual learningen_US
dc.subjectcatastrophic forgettingen_US
dc.subjectcatastrophic interferenceen_US
dc.subjectoverlapping representationen_US
dc.subjectsparse distributed representationen_US
dc.subjectregressionen_US
dc.subjectsplineen_US
dc.subjectartificial neural networken_US
dc.subjectuniversal function approximationen_US
dc.titleAnalysis of Catastrophic Interference with Application to Spline Neural Architecturesen_US
dc.typeDissertationen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Heinrich_van_Analysis_2024.pdf
Size:
14.95 MB
Format:
Adobe Portable Document Format
Description:
Mini Dissertation

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: