Patent Number: 6,311,183

Title: Method for finding large numbers of keywords in continuous text streams

Abstract: A method of full-text scanning for matches in a large dictionary of keywords is described, suitable for SDI (selective dissemination of information). The method is applicable to large dictionaries (hundreds of thousands of entries) and to arbitrary byte sequences for both patterns and sample streams. The approach employs Boyer-Moore-Horspool skipping, extended to pattern collections and digrams, followed by an n-gram hash test, which also identifies a subset of feasible keywords for conventional pattern matching at each location of a putative match.

Inventors: Cohen; Jonathan Drew (Hanover, MD)

Assignee: The United States of America as represented by the Director of National Security Agency

International Classification: G06F 17/30 (20060101); G06F 017/30 ()

Expiration Date: 10/30/2018