Developers Arena

Social Media Web Tips, Social Media News & Technology Updates

Categories:

Why Facebook Uses Apache Hadoop and HBase

Dhruba Borthakur, a Hadoop Engineer at Facebook, has published part of a paper he co-authored with several Facebook engineers on Apache Hadoop at Facebook. The first part of the paper explains Facebook’s requirements and non-requirements for a data store for its revamped Facebook Messages application and the reasons it chose Apache Hadoop to power it. The paper will be published at SIGMOD 2011.

Sponsor

The requirements:

Elasticity
High write throughput
Efficient and low-latency strong consistency semantics within a data center
Efficient random reads from disk
High Availability and Disaster Recovery
Fault Isolation
Atomic read-modify-write primitives
Range Scans

The non-requirements:

Tolerance of network partitions within a single data center
Zero Downtime in case of individual data center failure
Active-active serving capability across different data centers

You can find out much by reading the paper. It was written by Dhruba Borthakur, Kannan Muthukkaruppan, Karthik Ranganathan, Samuel Rash, Joydeep Sen Sarma, Jonathan Gray, Nicolas Spiegelberg, Hairong Kuang Dmytro Molkov, Aravind Menon, Rodrigo Schmidt and Amitanand Aiyer.

Image Credit: Massimo Barbieri

Discuss

Posted in General, Technology, Web.

Tagged with Big data.

No comments

By Klint Finley – May 21, 2011

0 Responses

Stay in touch with the conversation, subscribe to the RSS feed for comments on this post.

« Groupon’s New Partnership With Loopt: Is This How Location Will Be Monetized? Poll: Does Cloud Computing Change the Role of the CIO? »

Proudly powered by WordPress and Carrington.

Carrington Theme by Crowd Favorite

Why Facebook Uses Apache Hadoop and HBase

0 Responses

About Developers Arena

Recent Posts

Categories

Recent Comments

Why Facebook Uses Apache Hadoop and HBase

0 Responses

Subscribe

About Developers Arena

Recent Posts

Categories

Tags

Recent Comments