Remove Engineering Remove Java Remove Python
article thumbnail

Useful classes for data engineers - Scala & Java

Waitingforcode

In this blog post I'll share with you a list of Java and Scala classes I use almost every time in data engineering projects. The part for Python will follow next week! We all have our habits and as programmers, libraries and frameworks are definitely a part of the group.

Scala 130
article thumbnail

Delivering Modern Enterprise Data Engineering with Cloudera Data Engineering on Azure

Cloudera

After the launch of CDP Data Engineering (CDE) on AWS a few months ago, we are thrilled to announce that CDE, the only cloud-native service purpose built for enterprise data engineers, is now available on Microsoft Azure. . Prerequisites for deploying CDP Data Engineering on Azure can be found here. Easy job deployment.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Using Trino And Iceberg As The Foundation Of Your Data Lakehouse

Data Engineering Podcast

In this episode Dain Sundstrom, CTO of Starburst, explains how the combination of the Trino query engine and the Iceberg table format offer the ease of use and execution speed of data warehouses with the infinite storage and scalability of data lakes. Go to dataengineeringpodcast.com/dagster today to get started. Your first 30 days are free!

Data Lake 262
article thumbnail

Data News — Week 24.11

Christophe Blefari

yato, is a small Python library that I've developed, yato stands for yet another transformation orchestrator. Obviously Benoit prefers Kestra, at the expense of writing YAML and running a Java application. New Apache Arrow engines — Arrow has become one of the most used library when it comes to built in-memory engines.

Metadata 272
article thumbnail

Brief History of Data Engineering

Jesse Anderson

Apache Spark came in 2009 and gave a unified batch and streaming engine. Apache Flink came in 2011 and gave us our first real streaming engine. As clearly evident as that sounds now, my writing about needing data engineering went heavily against the grain of everything that was written at the time.

article thumbnail

Enhancing The Abilities Of Software Engineers With Generative AI At Tabnine

Data Engineering Podcast

Generative AI has accelerated the ability of developer tools to provide useful suggestions that speed up the work of engineers. Tabnine is one of the main platforms offering an AI powered assistant for software engineers. Python, Java, C++, etc.) With Materialize, you can! How does that translate into computer languages?

article thumbnail

How to learn data engineering

Christophe Blefari

Learn data engineering, all the references ( credits ) This is a special edition of the Data News. But right now I'm in holidays finishing a hiking week in Corsica 🥾 So I wrote this special edition about: how to learn data engineering in 2024. The idea is to create a living reference about Data Engineering.