article thumbnail

Useful classes for data engineers - Scala & Java

Waitingforcode

We all have our habits and as programmers, libraries and frameworks are definitely a part of the group. In this blog post I'll share with you a list of Java and Scala classes I use almost every time in data engineering projects. The part for Python will follow next week!

Scala 130
article thumbnail

Ready-to-go sample data pipelines with Dataflow

Netflix Tech

Workflow Definitions Below you can see a typical file structure of a sample workflow package written in SparkSQL. ??? In every sample workflow package there are three workflow definition files that work together to provide flexible functionality. See an example high water mark job from the main workflow definition. -

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

A guide to UDP in Scala with FS2

Rock the JVM

Setting Up Let’s create a new Scala 3 project and add the following to your build.sbt file. The UDP Server Create Fs2Udp.scala in the following path, src/main/scala/com/rockthejvm/fs2Udp/Fs2Udp.scala and add the following code: package com.rockthejvm.fs2Udp import cats.effect. val scala3Version = "3.3.1" lazy val root = project.

Scala 90
article thumbnail

Value Classes in Scala Explained

Rock the JVM

Discover a powerful technique for eliminating hard-to-trace bugs with ad-hoc type definitions: learn how Scala 2's newtypes and Scala 3's opaque types can enhance your code's safety and maintainability

Scala 52
article thumbnail

Value Classes in Scala Explained

Rock the JVM

Discover a powerful technique for eliminating hard-to-trace bugs with ad-hoc type definitions: learn how Scala 2's newtypes and Scala 3's opaque types can enhance your code's safety and maintainability

Scala 52
article thumbnail

Scala CLI Tutorial: Creating a CLI Sudoku Solver

Rock the JVM

Antonio is an alumnus of Rock the JVM, now a senior Scala developer with his own contributions to Scala libraries and junior devs under his mentorship. Which brings us to this article: Antonio originally started from my Sudoku backtracking article and built a Scala CLI tutorial for the juniors he’s mentoring.

Scala 52
article thumbnail

Data pipeline asset management with Dataflow

Netflix Tech

It could be a JAR compiled from Scala, a Python script or module, or a simple SQL file. The important thing is that this business logic can be built in a separate repository and maintained independently from the workflow definition. By external assets we simply mean some executable carrying the actual business logic of the job.