Remove Events Remove Metadata Remove Process
article thumbnail

Functional Data Engineering — a modern paradigm for batch data processing

Maxime Beauchemin

Batch data processing  — historically known as ETL —  is extremely challenging. In this post, we’ll explore how applying the functional programming paradigm to data engineering can bring a lot of clarity to the process. The greater the claim made using analytics, the greater the scrutiny on the process should be.

article thumbnail

Interesting startup idea: benchmarking cloud platform pricing

The Pragmatic Engineer

Results are stored in git and their database, together with benchmarking metadata. Code and raw data repository:   Version control: GitHub Heavily using GitHub Actions for things like getting warehouse data from vendor APIs, starting cloud servers, running benchmarks, processing results, and cleaning up after tuns.

Cloud 278
article thumbnail

Sysmon Security Event Processing in Real Time with KSQL and HELK

Confluent

During a recent talk titled Hunters ATT&CKing with the Right Data , which I presented with my brother Jose Luis Rodriguez at ATT&CKcon, we talked about the importance of documenting and modeling security event logs before developing any data analytics while preparing for a threat hunting engagement. Yeah…I can do that already!

Process 83
article thumbnail

Journey to Event Driven – Part 4: Four Pillars of Event Streaming Microservices

Confluent

Event-first thinking enables us to build a new atomic unit: the event. Four pillars of event streaming. Pillar 1 – Business function: Payment processing pipeline. Pillar 4 – Operational plane: Event logging, DLQs and automation. Journey to Event Driven – Part 2: Programming Models for the Event-Driven Architecture.

Kafka 94
article thumbnail

Bringing The Power Of The DataHub Real-Time Metadata Graph To Everyone At Acryl Data

Data Engineering Podcast

Summary The binding element of all data work is the metadata graph that is generated by all of the workflows that produce the assets used by teams across the organization. What are the convenience features that you are building to augment the capabilities and integration process for DataHub?

Metadata 100
article thumbnail

2. Diving Deeper into Psyberg: Stateless vs Stateful Data Processing

Netflix Tech

By Abhinaya Shetty , Bharath Mummadisetty In the inaugural blog post of this series, we introduced you to the state of our pipelines before Psyberg and the challenges with incremental processing that led us to create the Psyberg framework within Netflix’s Membership and Finance data engineering team.

article thumbnail

Using Graph Processing for Kafka Stream Visualizations

Confluent

Stream processing engines like KSQL furthermore give you the ability to manipulate all of this fluently. So we can improve a portion of just about any event streaming application by adding graph abilities to it. You can use this as an example of how to add graph abilities to any event streaming application. Here we go!

Kafka 55