Linguistics Research - Human Language, Phonetics, Syntax, Phonology

Linguistics Research Today is a free monthly online journal that collates and summarizes the latest research about Linguistics, including details on human language, phonetics, syntax, phonology.


Linguistics Research Today

Home

View Latest Issue

Information About Linguistics

Books on Linguistics

Advertising in Research Today

View Other Research Today Publications



Measures of semantic similarity and relatedness in the biomedical domain.

Pedersen T, Pakhomov SV, Patwardhan S, Chute CG

Department of Computer Science, 1114 Kirby Drive, University of Minnesota, Duluth, MN 55812, USA. tpederse@d.umn.edu

Measures of semantic similarity between concepts are widely used in Natural Language Processing. In this article, we show how six existing domain-independent measures can be adapted to the biomedical domain. These measures were originally based on WordNet, an English lexical database of concepts and relations. In this research, we adapt these measures to the SNOMED-CT ontology of medical concepts. The measures include two path-based measures, and three measures that augment path-based measures with information content statistics from corpora. We also derive a context vector measure based on medical corpora that can be used as a measure of semantic relatedness. These six measures are evaluated against a newly created test bed of 30 medical concept pairs scored by three physicians and nine medical coders. We find that the medical coders and physicians differ in their ratings, and that the context vector measure correlates most closely with the physicians, while the path-based measures and one of the information content measures correlates most closely with the medical coders. We conclude that there is a role both for more flexible measures of relatedness based on information derived from corpora, as well as for measures that rely on existing ontological structures.

Published 16 May 2007 in J Biomed Inform, 40(3): 288-99.
Full-text of this article is available online (may require subscription).

Place a permanent text-link or advertisement here for just US$15.

© 2005-2008 Linguistics Research Today. All Rights Reserved.



Linguistics Research Today Archive:

Volume 1 (2005)
  Issue 1 (August)
  Issue 2 (September)
  Issue 3 (October)
  Issue 4 (November)
  Issue 5 (December)

Volume 2 (2006)
  Issue 1 (January)
  Issue 2 (February)
  Issue 3 (March)
  Issue 4 (April)
  Issue 5 (May)
  Issue 6 (June)
  Issue 7 (July)
  Issue 8 (August)
  Issue 9 (September)
  Issue 10 (October)
  Issue 11 (November)
  Issue 12 (December)

Volume 3 (2007)
  Issue 1 (January)
  Issue 2 (February)
  Issue 3 (March)
  Issue 4 (April)
  Issue 5 (May)
  Issue 6 (June)
  Issue 7 (July)
  Issue 8 (August)
  Issue 9 (September)
  Issue 10 (October)
  Issue 11 (November)
  Issue 12 (December)

Volume 4 (2008)
  Issue 1 (January)
  Issue 2 (February)
  Issue 3 (March)
  Issue 4 (April)
  Issue 5 (May)



Linguistics Books

Proust and the Squid: The Story and Science of the Reading Brain

Proust and the Squid: The Story and Science of the Reading Brain