Remove Data Management Remove Systems Remove White Paper
article thumbnail

Simplifying Data Integration Through Eventual Connectivity

Data Engineering Podcast

If you are struggling to maintain a tangle of data pipelines then you might find some new ideas for reducing your workload. What is eventual connectivity and how does it address the problems with ETL in the current data landscape? How much up-front modeling is necessary to make this a viable approach to data integration?

article thumbnail

Data Serialization Formats with Doug Cutting and Julien Le Dem - Episode 8

Data Engineering Podcast

To help other people find the show you can leave a review on iTunes , or Google Play Music , and tell your friends and co-workers This is your host Tobias Macey and today I’m interviewing Julien Le Dem and Doug Cutting about data serialization formats and how to pick the right one for your systems.

Hadoop 100
article thumbnail

Setting The Stage For The Next Chapter Of The Cassandra Database

Data Engineering Podcast

Summary The Cassandra database is one of the first open source options for globally scalable storage systems. Since its introduction in 2008 it has been powering systems at every scale. Cassandra is primarily used as a system of record. How did you get involved in the Cassandra project and how would you characterize your role?

Database 100
article thumbnail

Do You Manage Your Data Debt Alongside Your Technical Debt?

The Modern Data Company

Like technical debt, data debt represents the liability accrued over time by inefficient and outdated methods and technologies for handling corporate data. It is important to note that data debt has methodological and technological components. Data management technologies grow old and out of date.

article thumbnail

Exploring The Insights And Impact Of Dan Delorey's Distinguished Career In Data

Data Engineering Podcast

In this episode he takes a trip down memory lane to weave an interesting and informative narrative about the broader themes throughout his work and their echoes in the modern data ecosystem. Can you start by sharing what your current relationship to the data ecosystem is and the cliffs-notes version of how you ended up there?

article thumbnail

Why Your Master Data Management Needs Data Governance

Precisely

With cloud computing, the capacity to extract value from data is greater than ever. As this realization grows, businesses are shifting their investments from hardware to technologies that optimize data assets. Master Data Management systems (MDM) play an important role in harmonizing data assets across large and midsize enterprises.

article thumbnail

In AI we trust? Why we Need to Talk About Ethics and Governance (part 2 of 2)

Cloudera

In 2019, the Gradient institute published a white paper outlining the practical challenges for Ethical AI. They identified four main categories: capturing intent, system design, human judgement & oversight, regulations. An AI system trained on data has no context outside of that data. System Design.