Data storage mechanism in FacebookGautam Goswami
We are all almost familiar with social media mainly Facebook where photo uploads total 300 million per day. Daily generates 4.5 billion likes. Every 60 seconds, 510 comments are posted, 293,000 statuses are updated. It’s really curious to know how Facebook stores such a huge volume of data that’s totally impossible using traditional database management systems (RDBMS) . Facebook is using distributed database management system called Cassandra.
Cassandra was initially developed by Facebook to enhance their inbox search feature. In July, 2008, it was released as open source project on Google code. Eventually, it became an incubator project for Apache community in 2009. It has been developed by using Java programming language and coming under the NoSQL Database management system. Cassandra has been designed to achieve high scalable, extreme performance and to hold very large volume of data across many commodity servers in a cluster. This is a schema less database and maintain column family concept to hold data. Cassandra is a proven fault-tolerance on commodity hardware or cloud infrastructure.