Remove 2022 Remove Big Data Tools Remove Project
article thumbnail

Data Engineering Annotated Monthly – June 2022

Big Data Tools

ShardingSphere – One more thing I learned while preparing this installment is that there is an entire top-level project to convert traditional databases into distributed ones. InLong 1.2.0 – This is one of the more interesting projects I hadn’t already heard of before preparing this installment.

article thumbnail

Data Engineering Annotated Monthly – June 2022

Big Data Tools

ShardingSphere – One more thing I learned while preparing this installment is that there is an entire top-level project to convert traditional databases into distributed ones. InLong 1.2.0 – This is one of the more interesting projects I hadn’t already heard of before preparing this installment.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Data Engineering Annotated Monthly – April 2022

Big Data Tools

YuniKorn 1.0.0 – If you’ve been anxiously waiting for Kubernetes to come to data engineering, your wishes have been granted. A top-level ASF project, YuniKorn 1.0 is a scheduler targeting big data and ML workflows, and of course, it is cloud-native. That wraps up April’s Data Engineering Annotated.

article thumbnail

Data Engineering Annotated Monthly – April 2022

Big Data Tools

YuniKorn 1.0.0 – If you’ve been anxiously waiting for Kubernetes to come to data engineering, your wishes have been granted. A top-level ASF project, YuniKorn 1.0 is a scheduler targeting big data and ML workflows, and of course, it is cloud-native. That wraps up April’s Data Engineering Annotated.

article thumbnail

Data Engineering Annotated Monthly – September 2022

Big Data Tools

This time I learned about Brooklin, a LinkedIn service for streaming data in a heterogeneous environment. The official GitHub for the project says that it is characterized by high reliability and throughput, claiming that Brooklin can run hundreds of streaming pipelines simultaneously. Nevertheless, the project looks very interesting.

article thumbnail

Data Engineering Annotated Monthly – September 2022

Big Data Tools

This time I learned about Brooklin, a LinkedIn service for streaming data in a heterogeneous environment. The official GitHub for the project says that it is characterized by high reliability and throughput, claiming that Brooklin can run hundreds of streaming pipelines simultaneously. Nevertheless, the project looks very interesting.

article thumbnail

Data Engineering Annotated Monthly – October 2022

Big Data Tools

One of the great things about ASF projects is that they usually work nicely together, and this is no exception. Apache Age 1.1.0 – Sometimes, we data engineers do work that doesn’t deal directly with big data. That wraps up October’s Data Engineering Annotated. For example, the current 1.1.3