Leveraging Biomedical Ontological Knowledge to Improve Clinical Term Embeddings

dc.contributor.advisorRohit Kate
dc.contributor.committeememberRohit Kate
dc.contributor.committeememberJake Luo
dc.contributor.committeememberTian Zhao
dc.contributor.committeememberJun Zhang
dc.contributor.committeememberZeyun Yu
dc.creatorAbuzahra, Fuad Hatem
dc.date.accessioned2025-01-16T18:57:18Z
dc.date.issued2023-05-01
dc.description.abstractABSTRACT Leveraging Biomedical Ontological Knowledge to Improve Clinical Term Embeddings by Fuad Abu Zahra The University of Wisconsin-Milwaukee, 2023 Under the Supervision of Dr. Rohit J. Kate This research is on obtaining and using word embeddings for natural language processing tasks in the biomedical domain. Word embeddings are vector representations of words commonly obtained from large text corpora. This research leverages the biomedical ontology of SNOMED CT as an alternate source for obtaining embeddings for clinical terms. The existing graph-based methods can only give embeddings for concepts (i.e., nodes of the graph) of an ontology, hence we developed a novel method to obtain embeddings for clinical words and terms from their concept embeddings. These embeddings were evaluated on benchmark datasets of clinical term similarity and on the clinical term normalization task and were found to work better than corpus-based embeddings. However, unlike corpus-based embeddings, the embeddings obtained from SNOMED CT do not incorporate linguistic knowledge as the method was not trained on text data. Therefore, we also developed two new methods to combine the two resources of embeddings – by generating a synthetic corpus out of SNOMED CT ontology and using it for additional training using corpus-based methods, and by fine-tuning a corpus-based system on SNOMED CT concept embeddings. The evaluation showed that the combined embeddings obtained using these methods perform better than either type of embeddings.
dc.description.embargo2024-06-05
dc.embargo.liftdate2024-06-05
dc.identifier.urihttp://digital.library.wisc.edu/1793/87619
dc.relation.replaceshttps://dc.uwm.edu/etd/3116
dc.subjectBidirectional Encoder Representations from Transformers (BERT)
dc.subjectClinical Ontology
dc.subjectMedical Ontology
dc.subjectOntology Embeddings
dc.subjectSNOMED CT
dc.subjectWord Embeddings
dc.titleLeveraging Biomedical Ontological Knowledge to Improve Clinical Term Embeddings
dc.typedissertation
thesis.degree.disciplineEngineering
thesis.degree.grantorUniversity of Wisconsin-Milwaukee
thesis.degree.nameDoctor of Philosophy

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Abuzahra_uwm_0263D_13468.pdf
Size:
1.35 MB
Format:
Adobe Portable Document Format
Description:
Main File