Semantic Similarity and Relatedness

The goal of this research is to explore semantic similarity and relatedness measures to automatically determine the similarity or relatedness between biomedical and clinical concepts.

Software

UMLS-Interface This package provides a Perl interface to the Unified Medical Language System (UMLS). The UMLS is a knowledge representation framework encoded designed to support broad scope biomedical research queries. There exists three major sources in the UMLS. The Metathesaurus which is a taxonomy of medical concepts, the Semantic Network which categorizes concepts in the Metathesaurus, and the SPECIALIST Lexicon which contains a list of biomedical and general English terms used in the biomedical domain. The UMLS-Interface package is set up to access the Metathesaurus and the Semantic Network present in a mysql database.

UMLS-Similarity This package is a suite of Perl modules that implement a number of semantic similarity measures. The measures use the UMLS-Interface module to access the UMLS to generate similarity scores between concepts. Currently, this package includes programs that implement the similarity measures described by Leacock & Chodorow (1998), Wu & Palmer (1994), Nguyen & Al-Mubaid (2006), Rada, et. al. (1989), Jiang & Conrath (1997), Resnik (1995) and Lin (1998), and the relatedness measures proposed by Banerjee & Pedersen (2002), Patwardhan (2003).

Publications

Tutorials and Demos


Datasets


Papers that have used UMLS::Similarity