Comparative Study of Variable Selection Methods for Genetic Data

dc.contributor.advisorDavid DS Spade
dc.contributor.committeememberGabriella Pinter
dc.contributor.committeememberJeb Willenbring
dc.creatorKubillus, Anna-Lena
dc.date.accessioned2025-01-16T19:00:26Z
dc.date.available2025-01-16T19:00:26Z
dc.date.issued2023-05-01
dc.description.abstractAssociation studies for genetic data are essential to understand the genetic basis of complex traits. However, analyzing such high-dimensional data needs suitable feature selection methods. For this reason, we compare three methods, Lasso Regression, Bayesian Lasso Regression, and Ridge Regression combined with significance tests, to identify the most effective method for modeling quantitative trait expression in genetic data. All methods are applied to both simulated and real genetic data and evaluated in terms of various measures of model performance, such as the mean absolute error, the mean squared error, the Akaike information criterion, and the Bayesian information criterion. The results show that all methods perform better than the ordinary least squares model on the prediction of future data. Moreover, the Lasso Regression outperforms all methods in terms of execution time and simplicity of the model, which therefore leads to better interpretability and makes it the best choice for association studies. Overall this thesis provides valuable insights into the strength and limitations of existing feature selection methods for modeling quantitative trait expression and highlights its importance in association studies for genetic data.
dc.identifier.urihttp://digital.library.wisc.edu/1793/87686
dc.relation.replaceshttps://dc.uwm.edu/etd/3177
dc.subjectAssociation Study
dc.subjectBayesian Lasso Regression
dc.subjectFeature Selection
dc.subjectGenetic data
dc.subjectLasso Regression
dc.titleComparative Study of Variable Selection Methods for Genetic Data
dc.typethesis
thesis.degree.disciplineMathematics
thesis.degree.grantorUniversity of Wisconsin-Milwaukee
thesis.degree.nameMaster of Science

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Kubillus_uwm_0263M_13505.pdf
Size:
598.11 KB
Format:
Adobe Portable Document Format
Description:
Main File