Patent Number: 6,253,169

Title: Method for improvement accuracy of decision tree based text categorization

Abstract: A text categorization method automatically classifies electronic documents by developing a single pooled dictionary of words for a sample set of documents, and then generating a decision tree model, based on the pooled dictionary, for classifying new documents. Adaptive resampling techniques are applied to improve the accuracy of the decision tree model.

Inventors: Apte; Chidanand (Chappaqua, NY), Damerau; Frederick J. (North Salem, NY), Weiss; Sholom M. (Highland Park, NJ)

Assignee: International Business Machines Corporation

International Classification: G06F 17/30 (20060101); G06E 017/27 (); G06E 021/00 ()

Expiration Date: 06/26/2018