Disease Name Extraction from Clinical Text Using Conditional Random Fields

dc.contributor.advisorRohit J. Kate
dc.contributor.committeememberSusan McRoy
dc.contributor.committeememberRashmi Prasad
dc.creatorGhiasvand, Omid
dc.date.accessioned2025-01-16T19:36:50Z
dc.date.available2025-01-16T19:36:50Z
dc.date.issued2014-05-01
dc.description.abstractThe aim of the research done in this thesis was to extract disease and disorder names from clinical texts. We utilized Conditional Random Fields (CRF) as the main method to label diseases and disorders in clinical sentences. We used some other tools such as MetaMap and Stanford Core NLP tool to extract some crucial features. MetaMap tool was used to identify names of diseases/disorders that are already in UMLS Metathesaurus. Some other important features such as lemmatized versions of words, and POS tags were extracted using the Stanford Core NLP tool. Some more features were extracted directly from UMLS Metathesaurus, including semantic types of words. We participated in the SemEval 2014 competition's Task 7 and used its provided data to train and evaluate our system. Training data contained 199 clinical texts, development data contained 99 clinical texts, and the test data contained 133 clinical texts, these included discharge summaries, echocardiogram, radiology, and ECG reports. We obtained competitive results on the disease/disorder name extraction task. We found through ablation study that while all features contributed, MetaMap matches, POS tags, and previous and next words were the most effective features.
dc.identifier.urihttp://digital.library.wisc.edu/1793/88350
dc.relation.replaceshttps://dc.uwm.edu/etd/495
dc.subjectClinical Text
dc.subjectConditional Random Fields
dc.subjectMetamap
dc.subjectNamed Entity Recognition
dc.subjectNatural Language Processing
dc.subjectUMLS
dc.titleDisease Name Extraction from Clinical Text Using Conditional Random Fields
dc.typethesis
thesis.degree.disciplineEngineering
thesis.degree.grantorUniversity of Wisconsin-Milwaukee
thesis.degree.nameMaster of Science

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Ghiasvand_uwm_0263m_10703.pdf
Size:
685.33 KB
Format:
Adobe Portable Document Format
Description:
Main File