Clustering time-course data using P-splines and mixed effects mixture models

Bredenkamp, Deidre

UPSpace Home
→
University of Pretoria: Research Output
→
Theses and Dissertations (University of Pretoria)
→
View Item

Please note that UPSpace will be unavailable from Friday, 2 May at 18:00 (South African Time) until Sunday, 4 May at 20:00 due to scheduled system upgrades. We apologise for any inconvenience this may cause and appreciate your understanding.

Clustering time-course data using P-splines and mixed effects mixture models

Bredenkamp, Deidre

URI: http://hdl.handle.net/2263/83444

Date: 2022

Abstract:

In the field of biology, gene expressions are evaluated over time to study complicated biological processes and genetic supervisory networks. Because the process is continuous, time-course gene-expression data may be represented by a continuous function. This mini dissertation addresses cluster analysis of time-course data in a mixture model framework. To take into account the time dependency of such time-course data, as well as the degree of error present in many datasets, the mixed effects model with penalized B-splines is considered. In this mini dissertation the performance of such a mixed effects model has been studied with regards to the clustering of time-course gene expression data in a mixture model system. The EM algorithm has been implemented to fit the mixture model in a mixed effects model structure. For each subject the best linear unbiased smooth estimate of its time-course trajectory has been calculated and subjects with similar mean curves have been clustered in the same cluster. Model validation statistics such has the model accuracy and the coefficient of determination (R 2 ) indicates that the model can cluster simulated data effectively into clusters that differ in either the form of the curves or the timing to the curves’ peaks. The proposed technique is further evidenced by clustering time-course gene expression data consisting of microarray samples from lung tissue of mice exposed to different Influenza strains from 14 time-points.

Description:

Mini Dissertation (MCom (Advanced Data Analytics))--University of Pretoria 2022.

Show full item record

Files in this item

Name: Bredenkamp_Cluste ...

Size: 8.895Mb

Format: PDF

Description: Mini Dissertation

View/Open

This item appears in the following Collection(s)

Search UPSpace

Browse

All of UPSpace
This Collection
- Issue Date
- Authors
- Titles
- Subjects
- Supervisor
- UP Author
- UP Postgraduate
- Type

Clustering time-course data using P-splines and mixed effects mixture models

Clustering time-course data using P-splines and mixed effects mixture models

Abstract:

Description:

Files in this item

This item appears in the following Collection(s)

Search UPSpace

Browse

All of UPSpace

This Collection

My Account

UPSpace Workspace