Determining the syntactic structure of medical terms in clinical notes

Bridget T. McInnes; Ted Pedersen; Serguei V Pakhomov

doi:10.3115/1572392.1572395

Determining the syntactic structure of medical terms in clinical notes

Bridget T. McInnes, Ted Pedersen, Serguei V Pakhomov

Pharmaceutical Care and Health Systems

Research output: Contribution to conference › Paper › peer-review

6 Scopus citations

Abstract

This paper demonstrates a method for determining the syntactic structure of medical terms. We use a model-fitting method based on the Log Likelihood Ratio to classify three-word medical terms as right or left-branching. We validate this method by computing the agreement between the classification produced by the method and manually annotated classifications. The results show an agreement of 75% - 83%. This method may be used effectively to enable a wide range of applications that depend on the semantic interpretation of medical terms including automatic mapping of terms to standardized vocabularies and induction of terminologies from unstructured medical text.

Original language	English (US)
Pages	9-16
Number of pages	8
DOIs	https://doi.org/10.3115/1572392.1572395
State	Published - 2007
Event	ACL 2007 Workshop on Biological, Translational, and Clinical Language Processing, BioNLP 2007 - Prague, Czech Republic Duration: Jun 29 2007 → …

Other

Other	ACL 2007 Workshop on Biological, Translational, and Clinical Language Processing, BioNLP 2007
Country/Territory	Czech Republic
City	Prague
Period	6/29/07 → …

Bibliographical note

Funding Information:
This research was supported in part by the NLM Training Grant in Medical Informatics (T15 LM07041-19). Ted Pedersen’s participation in this project was supported by the NSF Faculty Early Career Development Award (#0092784).

Publisher Copyright:
© 2007 Association for Computational Linguistics.

Access

10.3115/1572392.1572395

OpenUrl availability

Full text

Cite this

@conference{31cf6c204f57410ab947753abe61e5a7,

title = "Determining the syntactic structure of medical terms in clinical notes",

abstract = "This paper demonstrates a method for determining the syntactic structure of medical terms. We use a model-fitting method based on the Log Likelihood Ratio to classify three-word medical terms as right or left-branching. We validate this method by computing the agreement between the classification produced by the method and manually annotated classifications. The results show an agreement of 75% - 83%. This method may be used effectively to enable a wide range of applications that depend on the semantic interpretation of medical terms including automatic mapping of terms to standardized vocabularies and induction of terminologies from unstructured medical text.",

author = "McInnes, {Bridget T.} and Ted Pedersen and Pakhomov, {Serguei V}",

note = "Funding Information: This research was supported in part by the NLM Training Grant in Medical Informatics (T15 LM07041-19). Ted Pedersen{\textquoteright}s participation in this project was supported by the NSF Faculty Early Career Development Award (#0092784). Publisher Copyright: {\textcopyright} 2007 Association for Computational Linguistics.; ACL 2007 Workshop on Biological, Translational, and Clinical Language Processing, BioNLP 2007 ; Conference date: 29-06-2007",

year = "2007",

doi = "10.3115/1572392.1572395",

language = "English (US)",

pages = "9--16",

}

TY - CONF

T1 - Determining the syntactic structure of medical terms in clinical notes

AU - McInnes, Bridget T.

AU - Pedersen, Ted

AU - Pakhomov, Serguei V

N1 - Funding Information: This research was supported in part by the NLM Training Grant in Medical Informatics (T15 LM07041-19). Ted Pedersen’s participation in this project was supported by the NSF Faculty Early Career Development Award (#0092784). Publisher Copyright: © 2007 Association for Computational Linguistics.

PY - 2007

Y1 - 2007

N2 - This paper demonstrates a method for determining the syntactic structure of medical terms. We use a model-fitting method based on the Log Likelihood Ratio to classify three-word medical terms as right or left-branching. We validate this method by computing the agreement between the classification produced by the method and manually annotated classifications. The results show an agreement of 75% - 83%. This method may be used effectively to enable a wide range of applications that depend on the semantic interpretation of medical terms including automatic mapping of terms to standardized vocabularies and induction of terminologies from unstructured medical text.

AB - This paper demonstrates a method for determining the syntactic structure of medical terms. We use a model-fitting method based on the Log Likelihood Ratio to classify three-word medical terms as right or left-branching. We validate this method by computing the agreement between the classification produced by the method and manually annotated classifications. The results show an agreement of 75% - 83%. This method may be used effectively to enable a wide range of applications that depend on the semantic interpretation of medical terms including automatic mapping of terms to standardized vocabularies and induction of terminologies from unstructured medical text.

UR - http://www.scopus.com/inward/record.url?scp=84886757299&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84886757299&partnerID=8YFLogxK

U2 - 10.3115/1572392.1572395

DO - 10.3115/1572392.1572395

M3 - Paper

AN - SCOPUS:84886757299

SP - 9

EP - 16

T2 - ACL 2007 Workshop on Biological, Translational, and Clinical Language Processing, BioNLP 2007

Y2 - 29 June 2007

ER -

Determining the syntactic structure of medical terms in clinical notes

Abstract

Other

Bibliographical note

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this