Application of mixed-effects modelling and supervised classification techniques to public health data

Show simple item record

dc.contributor.advisor Fitzgerald, Tony en
dc.contributor.advisor O'Sullivan, Kathleen (Catherine) en
dc.contributor.author Yang, Shuai
dc.date.accessioned 2020-06-02T12:26:29Z
dc.date.available 2020-06-02T12:26:29Z
dc.date.issued 2019-09-28
dc.date.submitted 2019-09-28
dc.identifier.citation Yang, Y. 2019. Application of mixed-effects modelling and supervised classification techniques to public health data. MRes Thesis, University College Cork. en
dc.identifier.endpage 109 en
dc.identifier.uri http://hdl.handle.net/10468/10104
dc.description.abstract This thesis consists of two parts. In PART A, we describe the application of mixed-effects modelling to 24 hour blood pressure. The blood pressure follows a 24-h circadian rhythm and the exaggerated morning surge in BP is an independent risk factor for cardiovascular diseases. In this project, the data analysed is from the Mitchelstown study. Morning SBP pattern between 4:00 am and 12:00 am was modelled using a piecewise linear mixed-effects model. Based on the likelihood function, the optimal breakpoint is at 7:30 am. Morning surge was characterised by the slope after the breakpoint. Model results revealed that the average slope between 7:30 am and 12:00 am is 2.47 mmHg/30 min (95\% CI: 2.35-2.59 mmHg/30 min). The Empirical Bayes estimates of subject-specific slopes were compared by age, gender, smoking, BMI, hypertension and diabetics. There were no significant differences in subject-specific morning surge between groups. Additionally, the relationship between chronic kidney disease (CKD) and the morning surge was explored using the multivariable logistic regression allowing for age, gender, smoking, BMI, hypertension and diabetics. Model results revealed that the association between the morning surge and CKD was not statistically significant. In PART B, supervised classification techniques are applied to SEYLE data. This project explores factors associated with drop-out in the SEYLE study. SEYLE study measured the mental health and wellbeing of adolescents with a baseline assessment and follow-up assessments at 3 and 12 months. Participant adherence is important when drawing inferences based on longitudinal data. However, drop-out in longitudinal studies are inevitable especially in adolescents. The primary objective of this project is to identify students with a high probability of drop-out in the SEYLE study using the Irish cohort. Multivariable logistic regression and decision trees (classification tree (CT), conditional inference tree, and evolutionary tree) were developed on a training data set. Factors considered included measures of sociodemographic, risk behaviours, lifestyle, general health, relationship and support, negative life events and psychiatric symptoms. Model performance was assessed on a test data set. Logistic regression analysis revealed that students aged 15/16, with chronic disease, normal anxiety level, high levels of hyperactivity, or lack of regular physical activity were significantly more likely to drop out of the SEYLE study. CT was regraded as the best tree and identified four subgroups based on age, anxiety and depression. Adolescents aged 15/16 without anxiety but with depression were classified as `drop-out' in this CT model. The choice between logistic regression and CT depends on the objective of the user. Logistic regression was the best at discriminating drop-out. However, CT is a simpler model and was marginally better at predicting drop-out. en
dc.format.mimetype application/pdf en
dc.language.iso en en
dc.publisher University College Cork en
dc.rights © 2019, Shuai Yang. en
dc.rights.uri https://creativecommons.org/licenses/by-nc-nd/4.0/ en
dc.subject Logistic regression en
dc.subject Classification tree (CT) en
dc.subject Conditional inference tree en
dc.subject Evolutionary tree en
dc.subject Mixed-effects modelling en
dc.subject Supervised classification techniques en
dc.subject Public health data en
dc.title Application of mixed-effects modelling and supervised classification techniques to public health data en
dc.type Masters thesis (Research) en
dc.type.qualificationlevel Masters en
dc.type.qualificationname MRes - Master of Research en
dc.type.qualificationname MSc - Master of Science en
dc.internal.availability Full text not available en
dc.description.version Accepted Version en
dc.description.status Not peer reviewed en
dc.internal.school Mathematical Sciences en
dc.check.chapterOfThesis The whole thesis should be under restricted access due to the confidential data. en
dc.internal.conferring Summer 2020 en
dc.availability.bitstream restricted


Files in this item

This item appears in the following Collection(s)

Show simple item record

© 2019, Shuai Yang. Except where otherwise noted, this item's license is described as © 2019, Shuai Yang.
This website uses cookies. By using this website, you consent to the use of cookies in accordance with the UCC Privacy and Cookies Statement. For more information about cookies and how you can disable them, visit our Privacy and Cookies statement