Patent Number: 6,253,170

Title: Bootstrapping sense characterizations of occurrences of polysemous words in dictionary representations of a lexical knowledge base in computer memory

Abstract: The present invention is directed to characterizing the sense of an occurrence of a polysemous word in a representation of a dictionary. In a preferred embodiment, the representation of the dictionary is made up of a plurality of text segments containing word occurrences having a word sense characterization and word occurrences not having a word sense characterization. The embodiment first selects a plurality of the dictionary text segments that each contain a first word. The embodiment then identifies from among the selected text segments a first and a second occurrence of a second word. The identified second occurrence of the second word has a word sense characterization. The embodiment then attributes to the first occurrence of the second word sense characterization of the second occurrence of the second word.

Inventors: Dolan; William B. (Redmond, WA)

Assignee: Microsoft Corporation

International Classification: G06F 17/27 (20060101); G06F 017/28 (); G06F 017/21 (); G06F 017/30 ()

Expiration Date: 06/26/2018