Google has upgraded its Search Appliance to index more documents, do so more intelligently and perform more queries per minute. The new version includes improved security and allows for collections of indexed documents to be partitioned.
"We launched this product quietly in 2002 and it has grown nicely and become a successful business for Google,"said Dave Girouard, Google's enterprise unit general manager. "This is our first major new upgrade of the product." The appliance is aimed at companies, educational institutions and government agencies that want to make their sites searchable using Google technology.
Providing good search functionality for websites is complicated for many organisations, said David Schatsky, an analyst with Jupiter Research. "The difficulties are related to various elements, such as technology, operational processes and user understanding. As a result, many companies feel they don't have the internal competencies to make search an effective tool and are thus attracted by the brand name and reputation of Google in a box."
In terms of performance enhancements, the new version can index as many as 1.5 million documents, which is five times as many as the first version, and execute 300 queries per minute, also a five-fold improvement.
The new version also features more intelligent and efficient document crawling. The first version crawled documents in batch fashion, meaning it would scan and index the entire collection of documents every time the administrator scheduled a refresh. The new version only scans and indexes documents that have changed since the last crawl, an improvement that speeds up the process and reduces consumption of bandwidth and processing power, Google said.
In addition, administrators don't have to schedule the updates, since the new version is continuously crawling the collection, which results in changes being indexed more promptly. Thus, with the first version, the Search Appliance would be configured to run a batch update once a day, or once every two days, which could delay changes until the update was run, while the new version detects changes soon after they're made, Girouard said.
Users can also be prevented from viewing documents they're not authorised to access. After executing a query, the upgraded product rounds up all the documents that contain the keywords and then filters those documents based on the user who made the query, showing only the documents that the user has permission to view, he said.
Another new feature is the ability to create different collections of documents, whereas the first version allowed only for the creation of one collection of documents. Therefore, with the new version, a company might create a collection of searchable documents for its sales and marketing employees, a different one for its call-center employees, and so on.
The new version of the Search Appliance is twice as tall as the first version because it has more powerful hardware, which in turn generates more heat and requires more space for cooling, Girouard said. That means it is 2U (3.5 inches) high, and 19 inches wide. Google doesn't reveal which vendor makes the appliance's hardware. "It's commodity hardware - the same general hardware we use in our Google data centers," he said.
A basic installation of the Search Appliance can be completed in as little as 30 minutes, allowing an IS department to have it up and running in a matter of hours, he said. Installations that involve deeper customization will take longer to complete.
The Search Appliance can crawl and index documents in more than 250 file formats, as long as the documents are accessible via HTTP. It supports 28 languages, and search formats such as natural language, keywords and Boolean, he said.
It delivers cached page results, allows for document sorting by date, lets users search within results and features a self-learning spell checker that suggests alternate spellings for queries. For administrators, the product generates usage reports and crawl analysis.
The product is sold as a stand-alone device under its GB 1001 model number. A GB 1001 with a capacity of 150,000 documents starts at $32,000, while one with the maximum capacity of 1.5 million documents costs $175,000. The new version is available now. Included in the price are two years of customer support.
The Search Appliance is also sold in pre-configured stacks of multiple GB 1001s. The GB 5005 is a stack of five devices, while the GB 8008 is a stack of 12 devices. (In the first version, the GB 8008 was a stack of eight devices.) Google pre-configures these stacked devices to work together, he said.