Patent Number: 7,769,742

Title: Web crawler scheduler that utilizes sitemaps from websites

Abstract: Methods and systems for a web crawler scheduler that utilizes sitemaps from websites are described. A web crawler scheduling system receives a notification from a website or web server. In response to the notification, the system accesses one or more sitemap(s) for documents associated with the website or web server. The system schedules crawls of the documents based on information identified from the sitemaps. The system crawls at least a subset of the documents scheduled for crawling.

Inventors: Brawer; Sascha B. (Berne, CH), Ibel; Maximilian (Pfaeffikon, CH), Keller; Ralph Michael (Zumikon, CH), Shivakumar; Narayanan (Kirkland, WA)

Assignee: Google Inc.

International Classification: G06F 7/00 (20060101)

Expiration Date: 8/03/12018