CLASSIFICATION AND EXPLANATION OF IRON DEFICIENCY ANEMIA FROM COMPLETE BLOOD COUNT DATA USING MACHINE LEARNING

dc.contributor.advisorSusan McRoy
dc.creatorpullakhandam, siddartha
dc.date.accessioned2025-01-16T19:19:30Z
dc.date.available2025-01-16T19:19:30Z
dc.date.issued2024-05-01
dc.description.abstractAnemia is a global health problem, and over 2 billion people are affected. Although, the major cause of anemia is iron deficiency (IDA), global estimates suggest that only about half of anemia could be attributed to ID. The typical test of anemia involves measurement of hemoglobin using Complete Blood Count (CBC) test, which also gives additional information on blood cell numbers and morphology. The diagnosis of iron deficiency anemia (IDA, both anemic and ID co-exist in a subject) requires additional expensive serum ferritin test. However, blood cell count, and morphology can also be utilized for diagnosis of IDA. The goal of this study therefore is to evaluate and compare methods for training, testing, and explaining machine learning (ML) models using data from routine CBC tests to identify IDA. Here we evaluate data-driven, machine learning methods to classify IDA from more available CBC data using a US-NHANES dataset of over 19,500 instances and explain the results as ranked feature importance. The results show that, using CBC variables, IDA can be classified with a precision-recall area under the curve (PR AUC) of 0.88 and recall/sensitivity of 0.98 and 0.84 for the original dataset and an unseen one, collected in Kenya respectively. The explanations indicate which aspects of the CBC results most contribute to a diagnosis, revealing that optimization made only minor changes to the model and that the features used remained consistent with professional practice, suggesting that the approach would be acceptable to health professionals.
dc.identifier.urihttp://digital.library.wisc.edu/1793/88053
dc.relation.replaceshttps://dc.uwm.edu/etd/3508
dc.titleCLASSIFICATION AND EXPLANATION OF IRON DEFICIENCY ANEMIA FROM COMPLETE BLOOD COUNT DATA USING MACHINE LEARNING
dc.typethesis
thesis.degree.disciplineComputer Science
thesis.degree.grantorUniversity of Wisconsin-Milwaukee
thesis.degree.nameMaster of Science

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
pullakhandam_uwm_0263M_13791.pdf
Size:
1.11 MB
Format:
Adobe Portable Document Format
Description:
Main File