Sunday, June 13, 2010

Web indexing delivers new content more quickly – Google Caffeine launch

Wednesday Caffeine, Google's new web indexing system, went live. Announcing the worldwide launch of Caffeine, Google said its search engine technology makes more fresh minted web content available and produces that new content much faster than before. Searchers and web content developers don’t have to change the way they use Google. But links to a broader range of relevant content are now presented much sooner following the content is published. The Caffeine overhaul of the web indexing technology also provides Google a lot more flexibility to keep pace with a web that is evolving at an accelerating rate.

Article Resource: Google Caffeine launch – web indexing delivers new content faster By Personal Money Store

Google Caffeine release explains speed isn't everything

Caffeine offers 50 percent of freshwater search results. The common Google user might have a hard time translating that into a benefit. PCWorld tested a comparison of web indexing systems when Caffeine was in development and found that results took .15 seconds on the regular Google search and .09 seconds on Caffeine. Since Caffeine is the regular Google search now, the test can't be repeated. And .06 seconds probably won’t make much of a difference for searchers, regardless how tight the deadline seems to be. Nevertheless, what shows up .06 seconds faster will make a difference for those who work with content publishing.

Real time information publishing

The urgent benefit of the Google Caffeine launch to the average user is fresher content, and more of it. Google’s Matt Cutts explained to Search Engine Land that that “Caffeine benefits both searchers and content owners because it means that all content (and not just content deemed "real time") could be searchable within seconds after it’s crawled." As outlined by Search Engine Land, the old Google would crawl a set of pages, process those pages and add them to the index. Instead of one page at a time proceeding, the whole batch had to go at the exact same time. Now Google crawls and processes pages individually and instantly.

Caffeine has astronomical storage capacity

For Caffeine to eliminate the delay between when it finds a page and makes it available needs an astronomical amount of storage. Carrie Grimes said Caffeine indexed web pages on an enormous scale. Caffeine processes thousands and thousands of pages in parallel — every second. Paper pages processed at that rate would stack 3 miles high — every second. Caffeine takes up nearly 100 million gigabytes of storage in a single database and adds new data at a rate of hundreds of thousands of gigabytes per day. You would need to have 625,000 of the largest iPods to store that much info; if these were stacked end-to-end they would go for a lot more than 40 miles. PC World adds the bill from Apple would be $155,625,000.

Attempting to keep up with Caffeine

The Google Caffeine launch doesn’t change content publishing or web search. But some essential information were pointed out by Resource Shelf. Data will be changing locations daily. The Cache and the page is refreshed a lot more often. If a searcher needs content on a page the way it looked at noon on Wednesday, it's smart to make a copy with something like Zotero, a Firefox extension because by 12:15 p.m. on Wednesday the content on the page might change when the cache is updated.

Additional information at these websites

PC World

pcworld.com/article/198384/google_jolts_search_with_fresher_results_with_caffeine.html?tk=hp_blg

Searchengineland.com

searchengineland.com/googles-new-indexing-infrastructure-caffeine-now-live-43891?utm_source=feedburner&utm_medium=feed&utm_campaign=Feed: searchengineland (Search Engine Land)&utm_content=Google Reader

Official Google Blog

googleblog.blogspot.com



No comments: