Remove 2004 Remove Hadoop Remove Project
article thumbnail

Brief History of Data Engineering

Jesse Anderson

They created MapReduce and GFS in 2004. Doug Cutting took those papers and created Apache Hadoop in 2005. They were the first companies to commercialize open source big data technologies and pushed the marketing and commercialization of Hadoop. Hadoop was hard to program, and Apache Hive came along in 2010 to add SQL.

article thumbnail

A Prequel to Data Mesh

Towards Data Science

My personal take on justifying the existence of Data Mesh A senior stakeholder at one my projects mentioned that they wanted to decentralise their data platform architecture and democratise data across the organisation. Result: Hadoop & NoSQL frameworks emerged. The concept of `Data Marts` was introduced.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Q&A with Greg Rahn – The changing Data Warehouse market

Cloudera

I was part of this migration project, and then after undergrad, I went on to be a software engineer for a utility company, who was using DB2 on the mainframe and migrating to Oracle on Unix. Greg Rahn: Toward the end of that eight-year stint, I saw this thing coming up called Hadoop and an engine called Hive. Michael Moreno: Nice!

article thumbnail

Data Analysis with Spark

Zalando Engineering

For the sake of comparison, let’s recap the Hadoop way of working: Hadoop saves intermediate states to disk and communicates over a network. In fact, in a 2004 mapReduce research paper the designer states that key-value pairs is a key choice in designing mapReduce. Provides in memory storage for cached RDD’s.

article thumbnail

Industry Interview Series- How Big Data is Transforming Business Intelligence?

ProjectPro

Solocal has taken big data to the next stage of BI by designing a novel vision of BI with the open source distributed computing framework Hadoop. It replaced its traditional BI structure by integrating big data and Hadoop."-April For example, say we get a project on analyzing Twitter data. So what is BI? So what is BI?

article thumbnail

How to Become a Data Engineer in 2024?

Knowledge Hut

Facebook It is a social media platform created originally by Mark Zuckerberg for college students in 2004. Most of the Data engineers working in the field enroll themselves in several other training programs to learn an outside skill, such as Hadoop or Big Data querying, alongside their Master's degree and PhDs.