A Bayesian Variable Selection Model for Semi-Continuous Response Using Gaussian Process

dc.contributor.advisorChekouo, Thierry
dc.contributor.advisorDeardon, Rob
dc.contributor.authorLipman, Danika
dc.contributor.committeememberWu, Jingjing
dc.contributor.committeememberLu, Xuewen
dc.contributor.committeememberSafo, Sandra
dc.contributor.committeememberChekouo, Thierry
dc.contributor.committeememberDeardon, Rob
dc.date2023-11
dc.date.accessioned2023-09-12T21:38:07Z
dc.date.available2023-09-12T21:38:07Z
dc.date.issued2023-09-06
dc.description.abstractTo my knowledge, there is not a statistical method that can perform Bayesian variable selection in a setting where there is a semi-continuous response with a non-linear relationship to predictor variables. I have developed a two-part model to accommodate a semi-continuous response, that uses Gaussian processes to capture the non-linear relationship between input variables and outcomes. Bayesian variable selection is induced in both parts of the model through the construction of the kernel matrices. I have employed the Nystr\"{o}m approximation for kernel matrices to reduce the computational complexity that occurs when working with kernel matrices and large sample sizes. I perform simulation studies and determine my method is competitive in prediction and variable selection with methods such as elastic net, and other methods that capture non-linearity such as random forests, and gradient boosted trees. In addition, I apply my method to a coronary artery disease (CAD) dataset from the Duke Database for Cardiovascular Disease (DDCD) to determine key gene expression features associated with the CAD index, a measure of CAD severity.
dc.identifier.citationLipman, D. (2023). A Bayesian variable selection model for semi-continuous response using Gaussian process (Master's thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca.
dc.identifier.urihttps://hdl.handle.net/1880/117001
dc.identifier.urihttps://doi.org/10.11575/PRISM/41845
dc.language.isoen
dc.publisher.facultyGraduate Studies
dc.publisher.institutionUniversity of Calgary
dc.rightsUniversity of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission.
dc.subjectBayesian
dc.subjectGaussian Process
dc.subjectVariable Selection
dc.subject.classificationStatistics
dc.titleA Bayesian Variable Selection Model for Semi-Continuous Response Using Gaussian Process
dc.typemaster thesis
thesis.degree.disciplineMathematics & Statistics
thesis.degree.grantorUniversity of Calgary
thesis.degree.nameMaster of Science (MSc)
ucalgary.thesis.accesssetbystudentI do not require a thesis withhold – my thesis will have open access and can be viewed and downloaded publicly as soon as possible.
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ucalgary_2023_lipman_danika.pdf
Size:
2.7 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.62 KB
Format:
Item-specific license agreed upon to submission
Description: