Brief History of Data Engineering
Jesse Anderson
DECEMBER 12, 2022
Apache Spark came in 2009 and gave a unified batch and streaming engine. At various times it’s been Java, Scala, and Python. Hadoop didn’t support doing things in real-time, and Apache Storm was open sourced in 2011. It didn’t get wide adoption as it was a bit early for real-time, and the API was difficult to wield.
Let's personalize your content