Covariate Balancing Using Statistical Learning Methods in the Presence of Missingness in Confounders

dc.contributor.advisorShen, Hua
dc.contributor.authorMason, Levi James
dc.contributor.committeememberChekouo, Thierry T.
dc.contributor.committeememberDeardon, Rob
dc.date2019-11
dc.date.accessioned2019-09-24T21:22:56Z
dc.date.available2019-09-24T21:22:56Z
dc.date.issued2019-09-20
dc.description.abstractIn observational studies researchers do not have control over treatment assignment. A consequence of such studies is that an imbalance in observed covariates between the treatment and control groups possibly exists. This imbalance can arise due to the fact that treatment assignment is frequently influenced by observed covariates (Austin, 2011a). As a result, directly comparing the outcomes between these two groups could lead to a biased estimation of the treatment effect (d’Agostino, 1998). The propensity score, defined as the probability of treatment assignment conditional on observed covariates, can be used in matching, stratification, and weighting to balance the observed covariates between the treatment and control groups in order to more accurately estimate the treatment effect (Rosenbaum and Rubin, 1983). This study looked at using statistical learning techniques to estimate the propensity score. The techniques included in this study were: logistic regression, classification and regression trees, pruned classification and regression trees, bagging classification and regression trees, boosted classification and regression trees, and random forests. These estimated propensity scores were then used in linearized propensity score matching, stratification, and inverse probability of treatment weighting using stabilized weights to estimate the treatment effect. Comparisons among these methods were made in a simulation study setting. Both a binary and continuous outcome were analyzed. In addition, a simulation was performed to assess the use of multiple imputation using predictive mean matching when a confounder had data missing at random. Based on the results from the simulation studies it was demonstrated that the most accurate treatment effect estimates came from inverse probability of treatment weighting using stabilized weights where the propensity scores were estimated by logistic regression, random forests, or bagging classification and regression trees. These results were then applied in a retrospective cohort data set with a missing confounder to determine the treatment effect of adjuvant radiation on breast cancer individuals.en_US
dc.identifier.citationMason, L. J. (2019). Covariate Balancing Using Statistical Learning Methods in the Presence of Missingness in Confounders (Master's thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca.en_US
dc.identifier.doihttp://dx.doi.org/10.11575/PRISM/37119
dc.identifier.urihttp://hdl.handle.net/1880/111057
dc.language.isoengen_US
dc.publisher.facultyScienceen_US
dc.publisher.institutionUniversity of Calgaryen
dc.rightsUniversity of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission.en_US
dc.subjectStatistical learningen_US
dc.subjectCausal inferenceen_US
dc.subject.classificationEducation--Mathematicsen_US
dc.subject.classificationStatisticsen_US
dc.titleCovariate Balancing Using Statistical Learning Methods in the Presence of Missingness in Confoundersen_US
dc.typemaster thesisen_US
thesis.degree.disciplineMathematics & Statisticsen_US
thesis.degree.grantorUniversity of Calgaryen_US
thesis.degree.nameMaster of Science (MSc)en_US
ucalgary.item.requestcopytrueen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ucalgary_2019_mason_levi.pdf
Size:
590.14 KB
Format:
Adobe Portable Document Format
Description:
Thesis
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.62 KB
Format:
Item-specific license agreed upon to submission
Description: