Patent Number: 8,818,918

Title: Determining the importance of data items and their characteristics using centrality measures

Abstract: Computer-implemented methods, systems, and articles of manufacture for determining the importance of a data item. A method includes: (a) receiving a node graph; (b) approximating a number of neighbor nodes of a node; and (c) calculating a average shortest path length of the node to the remaining nodes using the approximation step, where this calculation demonstrates the importance of a data item represented by the node. Another method includes: (a) receiving a node graph; (b) building a decomposed line graph of the node graph; (c) calculating stationary probabilities of incident edges of a node graph node in the decomposed line graph, and (d) calculating a summation of the stationary probabilities of the incident edges associated with the node, where the summation demonstrates the importance of a data item represented by the node. Both methods have at least one step carried out using a computer device.

Inventors: Lin; Ching-Yung (Hawthorne, NY), Tong; Hanghang (Hawthorne, NY), Sun; Jimeng (Hawthorne, NY), Papadimitriou; Spyridon (White Plains, NY), Kang; U (Pittsburgh, PA)

Assignee: International Business Machines Corporation

International Classification: G06F 17/30 (20060101); G06F 15/18 (20060101)

Expiration Date: 8/26/12018