Mixture Model Analysis with Misclassified Covariates: Methods and Applications

dc.contributor.advisorShen, Hua
dc.contributor.advisorKopciuk, Karen
dc.contributor.authorZhang, Ruixuan
dc.contributor.committeememberLiu, Juxin
dc.contributor.committeememberLu, Xuewen
dc.date2024-11
dc.date.accessioned2024-10-04T16:00:46Z
dc.date.available2024-10-04T16:00:46Z
dc.date.issued2024-09-20
dc.description.abstractMixture models are crucial for analyzing data with underlying sub-populations. Misclassification introduces discrepancies between observations and true values, which can severely bias parameter estimation, especially for mixture models when subgroups are not easily identifiable. We propose a method to enhance parameter estimation within the framework of mixture models, and mitigate the impact of misclassified covariates by utilizing them as surrogates in the Expectation-Maximization algorithm. Simulations consider both non-differential and differential misclassification with varying sample sizes, sensitivities, specificities, subgroup proportions and misclassified covariate proportions. Results demonstrate robust performance compared to naive or ad hoc approaches ignoring the misclassification issue, even under challenging conditions, such as low sensitivity and specificity for the misclassified covariate, or small sample sizes. For illustration, we apply our method to the 2015 Behavioral Risk Factor Surveillance System data. We conclude with a discussion of the implications of our findings and directions for future research.
dc.identifier.citationZhang, R. (2024). Mixture model analysis with misclassified covariates: methods and applications (Master's thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca.
dc.identifier.urihttps://hdl.handle.net/1880/119928
dc.language.isoen
dc.publisher.facultyScience
dc.publisher.institutionUniversity of Calgary
dc.rightsUniversity of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission.
dc.subjectMixture model
dc.subjectMisclassification
dc.subject.classificationBiostatistics
dc.subject.classificationStatistics
dc.titleMixture Model Analysis with Misclassified Covariates: Methods and Applications
dc.typemaster thesis
thesis.degree.disciplineMathematics & Statistics
thesis.degree.grantorUniversity of Calgary
thesis.degree.nameMaster of Science (MSc)
ucalgary.thesis.accesssetbystudentI do not require a thesis withhold – my thesis will have open access and can be viewed and downloaded publicly as soon as possible.
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ucalgary_2024_zhang_ruixuan.pdf.pdf
Size:
31.6 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.62 KB
Format:
Item-specific license agreed upon to submission
Description: