Application of mixed-effects modelling and supervised classification techniques to public health data

dc.availability.bitstreamrestricted
dc.check.chapterOfThesisThe whole thesis should be under restricted access due to the confidential data.en
dc.contributor.advisorFitzgerald, Tonyen
dc.contributor.advisorO'Sullivan, Kathleen (Catherine)en
dc.contributor.authorYang, Shuai
dc.date.accessioned2020-06-02T12:26:29Z
dc.date.available2020-06-02T12:26:29Z
dc.date.issued2019-09-28
dc.date.submitted2019-09-28
dc.description.abstractThis thesis consists of two parts. In PART A, we describe the application of mixed-effects modelling to 24 hour blood pressure. The blood pressure follows a 24-h circadian rhythm and the exaggerated morning surge in BP is an independent risk factor for cardiovascular diseases. In this project, the data analysed is from the Mitchelstown study. Morning SBP pattern between 4:00 am and 12:00 am was modelled using a piecewise linear mixed-effects model. Based on the likelihood function, the optimal breakpoint is at 7:30 am. Morning surge was characterised by the slope after the breakpoint. Model results revealed that the average slope between 7:30 am and 12:00 am is 2.47 mmHg/30 min (95\% CI: 2.35-2.59 mmHg/30 min). The Empirical Bayes estimates of subject-specific slopes were compared by age, gender, smoking, BMI, hypertension and diabetics. There were no significant differences in subject-specific morning surge between groups. Additionally, the relationship between chronic kidney disease (CKD) and the morning surge was explored using the multivariable logistic regression allowing for age, gender, smoking, BMI, hypertension and diabetics. Model results revealed that the association between the morning surge and CKD was not statistically significant. In PART B, supervised classification techniques are applied to SEYLE data. This project explores factors associated with drop-out in the SEYLE study. SEYLE study measured the mental health and wellbeing of adolescents with a baseline assessment and follow-up assessments at 3 and 12 months. Participant adherence is important when drawing inferences based on longitudinal data. However, drop-out in longitudinal studies are inevitable especially in adolescents. The primary objective of this project is to identify students with a high probability of drop-out in the SEYLE study using the Irish cohort. Multivariable logistic regression and decision trees (classification tree (CT), conditional inference tree, and evolutionary tree) were developed on a training data set. Factors considered included measures of sociodemographic, risk behaviours, lifestyle, general health, relationship and support, negative life events and psychiatric symptoms. Model performance was assessed on a test data set. Logistic regression analysis revealed that students aged 15/16, with chronic disease, normal anxiety level, high levels of hyperactivity, or lack of regular physical activity were significantly more likely to drop out of the SEYLE study. CT was regraded as the best tree and identified four subgroups based on age, anxiety and depression. Adolescents aged 15/16 without anxiety but with depression were classified as `drop-out' in this CT model. The choice between logistic regression and CT depends on the objective of the user. Logistic regression was the best at discriminating drop-out. However, CT is a simpler model and was marginally better at predicting drop-out.en
dc.description.statusNot peer revieweden
dc.description.versionAccepted Versionen
dc.format.mimetypeapplication/pdfen
dc.identifier.citationYang, Y. 2019. Application of mixed-effects modelling and supervised classification techniques to public health data. MRes Thesis, University College Cork.en
dc.identifier.endpage109en
dc.identifier.urihttps://hdl.handle.net/10468/10104
dc.language.isoenen
dc.publisherUniversity College Corken
dc.rights© 2019, Shuai Yang.en
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/en
dc.subjectLogistic regressionen
dc.subjectClassification tree (CT)en
dc.subjectConditional inference treeen
dc.subjectEvolutionary treeen
dc.subjectMixed-effects modellingen
dc.subjectSupervised classification techniquesen
dc.subjectPublic health dataen
dc.titleApplication of mixed-effects modelling and supervised classification techniques to public health dataen
dc.typeMasters thesis (Research)en
dc.type.qualificationlevelMastersen
dc.type.qualificationnameMRes - Master of Researchen
dc.type.qualificationnameMSc - Master of Scienceen
Files
Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
YangS-Master2019.zip
Size:
1.33 MB
Format:
http://www.iana.org/assignments/media-types/application/zip
Description:
Supplementary Data
Loading...
Thumbnail Image
Name:
YangS-Master2019.pdf
Size:
1.66 MB
Format:
Adobe Portable Document Format
Description:
Full Text E-thesis
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
5.2 KB
Format:
Item-specific license agreed upon to submission
Description: