Patent Number: 6,298,357

Title: Structure extraction on electronic documents

Abstract: A method and apparatus for extracting structure information from an unstructured electronic document is described. The method includes the step of identifying a structural type for each instance in the electronic document by examining presentation attributes associated with each instance. Examples of presentation attributes which are examined include numbering formats, indentations, and font sizes and weights.

Inventors: Wexler; Michael C. (Santa Clara, CA), Young; Jeffrey E. (San Jose, CA)

Assignee: Adobe Systems Incorporated

International Classification: G06F 17/27 (20060101); G06F 015/00 ()

Expiration Date: 10/02/2018