Sat.Jun 06, 2020 - Fri.Jun 12, 2020

article thumbnail

What, why, when to use Apache Kafka, with an example

Start Data Engineering

I have seen, heard and been asked questions and comments like What is Kafka and When should I use it? I don’t understand why we have to use Kafka The objective of this post is to get you up to speed with what Apache Kafka is, when to use them and the foundational concepts of Apache Kafka with a simple example. What is Apache Kafka First let’s understand what Apache Kafka is.

Kafka 130
article thumbnail

EC2 & Session Manager (Toronto Project)

Team Data Science

Welcome back to this Toronto Specific data engineering project. We left off last time concluding finance has the largest demand for data engineers who have skills with AWS, and sketched out what our data ingestion pipeline will look like. I began building out the data ingestion pipeline by launching an EC2 instance. I should note that if you have created an AWS account, but have not yet created an Identity Access Management (IAM) admin role, and are therefore still using root credentials, I am s

Project 130
article thumbnail

My Python/Java/Spring/Go/Whatever Client Won’t Connect to My Apache Kafka Cluster in Docker/AWS/My Brother’s Laptop. Please Help!

Confluent

tl;dr When a client wants to send or receive a message from Apache Kafka®, there are two types of connection that must succeed: The initial connection to a broker (the […].

Kafka 123
article thumbnail

Rising from the Ashes

Teradata

Teradata's own Sir Freek Cox on dedicating one's life to charity and good works. Read more.

105
105
article thumbnail

Apache Airflow® 101 Essential Tips for Beginners

Apache Airflow® is the open-source standard to manage workflows as code. It is a versatile tool used in companies across the world from agile startups to tech giants to flagship enterprises across all industries. Due to its widespread adoption, Airflow knowledge is paramount to success in the field of data engineering.

article thumbnail

Data Management Trends From An Investor Perspective

Data Engineering Podcast

Summary The landscape of data management and processing is rapidly changing and evolving. There are certain foundational elements that have remained steady, but as the industry matures new trends emerge and gain prominence. In this episode Astasia Myers of Redpoint Ventures shares her perspective as an investor on which categories she is paying particular attention to for the near to medium term.

More Trending

article thumbnail

Scaling Apache Kafka to 10+ GB Per Second in Confluent Cloud

Confluent

Apache Kafka® is the defacto standard for event streaming today. The semantics of the partitioned consumer model that Kafka pioneered have enabled scale at a level and at a cost […].

Kafka 101
article thumbnail

The Lure and the Fallacy of the New Bright Shiny Object

Teradata

Teradata has an extraordinary legacy; yet only in the IT world is legacy looked at as a negative. Learn how Teradata's legacy gives it the ultimate competitive advantage.

IT 75
article thumbnail

Why are Scala Type Classes Useful?

Rock the JVM

FP fans discuss the challenge of type classes in pure functional programming with Scala: why are they difficult, and why do we really need them?

Scala 52
article thumbnail

MongoDB Performance Tuning - Top 5 Resources

Rockset

In the course of implementing the Rockset connector to MongoDB , we did a fair amount of research on the MongoDB user experience, both online and through user interviews. We learned a lot about how organizations operated MongoDB in production and found that many of our discussions invariably touched upon what it took to achieve performance at scale.

MongoDB 52
article thumbnail

Apache Airflow® Best Practices: DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Announcing the MongoDB Atlas Sink and Source Connectors in Confluent Cloud

Confluent

We are excited to announce the preview release of the fully managed MongoDB Atlas source and sink connectors in Confluent Cloud, our fully managed event streaming service based on Apache […].

MongoDB 77
article thumbnail

Teradata’s Differentiators – And Why They Matter

Teradata

Why is Teradata uniquely positioned to help the modern enterprise unlock the business value of data, quickly and coherently? Read more.

Data 69
article thumbnail

Why are Scala Type Classes Useful?

Rock the JVM

FP fans discuss the challenge of type classes in pure functional programming with Scala: why are they difficult, and why do we really need them?

Scala 52
article thumbnail

Getting Started - Build your first Charts

Preset

Create a first chart with Superset

article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

Confluent Hack Day 2020: Hack from Home

Confluent

At Confluent, every now and then we like to take a day away from our normal sprint tasks to hack. There are a ton of benefits to hack days, including: […].

66
article thumbnail

Today, I Join Teradata

Teradata

New Teradata President and CEO Steve McMillan on the reasons why he joined the company as chief executive and the opportunities he sees for the future.

69