Google Gets Caffeinated

Google’s latest web indexing system is promising “50 percent fresher results for web searches.”  It’s called Caffeine.

The main difference with Caffeine is this:  The old index would search one group of sites, then another, then another, then another, with each group being less prioritized than the previous group.  The Caffeine algorithm is indexing everything constantly.  Caffeine is what Google needs to follow ever-changing services like Twitter.  Our world is becoming digitalized at a very high rate, and Caffeine will help Google keep up.  

Google, in a blog post, used these metaphors:

Caffeine lets us index web pages on an enormous scale. In fact, every second Caffeine processes hundreds of thousands of pages in parallel. If this were a pile of paper it would grow three miles taller every second. Caffeine takes up nearly 100 million gigabytes of storage in one database and adds new information at a rate of hundreds of thousands of gigabytes per day. You would need 625,000 of the largest iPods to store that much information; if these were stacked end-to-end they would go for more than 40 miles.



Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )


Connecting to %s