Tag - storing data streams storing data streams

Real-time Distributed Data-streaming with Kafka

Real time distributed data streaming Originally written in Scala and Java, Apache Kafka is a fast, horizontally scaling, fault-tolerant messaging platform for distributed data streaming first started at LinkedIn. It provides a publisher-subscriber mechanism for processing and storing data streams in a fault-tolerant way. It is used for building real-time data pipelines by streaming social data, Geo-spatial data or sensor data from various devices. Kafka acts like a plugin for Spark, Hadoop, Storm, HBase, Flink and many others for big data analytics. Using...

Read more...