article thumbnail

A Detailed Guide of Interview Questions on Apache Kafka

Analytics Vidhya

Introduction Apache Kafka is an open-source publish-subscribe messaging application initially developed by LinkedIn in early 2011. It is a message broker application and a logging service that is distributed, segmented, and […] The post A Detailed Guide of Interview Questions on Apache Kafka appeared first on Analytics Vidhya.

Kafka 201
article thumbnail

The Good and the Bad of Apache Kafka Streaming Platform

AltexSoft

Kafka can continue the list of brand names that became generic terms for the entire type of technology. In this article, we’ll explain why businesses choose Kafka and what problems they face when using it. In this article, we’ll explain why businesses choose Kafka and what problems they face when using it. What is Kafka?

Kafka 93
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Hadoop vs Spark: Main Big Data Tools Explained

AltexSoft

Hadoop and Spark are the two most popular platforms for Big Data processing. To come to the right decision, we need to divide this big question into several smaller ones — namely: What is Hadoop? To come to the right decision, we need to divide this big question into several smaller ones — namely: What is Hadoop? scalability.

article thumbnail

The Good and the Bad of Hadoop Big Data Framework

AltexSoft

Depending on how you measure it, the answer will be 11 million newspaper pages or… just one Hadoop cluster and one tech specialist who can move 4 terabytes of textual data to a new location in 24 hours. The Hadoop toy. So the first secret to Hadoop’s success seems clear — it’s cute. What is Hadoop?

Hadoop 59
article thumbnail

Unapologetically Technical Episode 8 – Tom Scott

Jesse Anderson

We discuss the key features and how they enable analytics uses of data stored in Kafka. We go in-depth into Streambased. We cover how it works and the ease of use. Don’t forget to subscribe to my YouTube channel to get the latest on Unapologetically Technical!

Kafka 100
article thumbnail

Brief History of Data Engineering

Jesse Anderson

Doug Cutting took those papers and created Apache Hadoop in 2005. They were the first companies to commercialize open source big data technologies and pushed the marketing and commercialization of Hadoop. Hadoop was hard to program, and Apache Hive came along in 2010 to add SQL. We lacked a scalable pub/sub system.

article thumbnail

Unapologetically Technical Episode 10 – Michael Drogalis

Jesse Anderson

In this episode, I interview Michael Drogalis, the founder and CEO of ShadowTraffic where we talked about the early Hadoop era and how he saw the need for Kafka in the industry. And just like that, we’re down to the 10th episode of Unapologetically Technical!

Hadoop 100