I was looking for search engines and found an open source engine called "nutch" (Java Tomcat) which is sponsored by Yahoo! Research and Internet Archive(?).
Intrestingly the Apache Lucene site advertising nutch uses Google to search the site locally...