Sat.Sep 05, 2020 - Fri.Sep 11, 2020

article thumbnail

Data Champions: Balancing IT and Business Needs

Cloudera

Digital transformation has been on the agenda for a long time, but the sudden need to respond to the unprecedented challenges of 2020, has meant the buzzword has become an executable reality for many enterprises. I recently came across a KPMG report that revealed that 80% of executives are increasing investments on emerging technologies now, to drive higher realized value in the future.

IT 105
article thumbnail

The Cause and Effect of Supply Chain Fragility, and How to Fix It

Teradata

The fragility of your supply chain existed long before COVID-19 brought it into sharp relief. Discover the secret to true supply chain resilience.

IT 105
article thumbnail

Simplify Your Data Architecture With The Presto Distributed SQL Engine

Data Engineering Podcast

Summary Databases are limited in scope to the information that they directly contain. For analytical use cases you often want to combine data across multiple sources and storage locations. This frequently requires cumbersome and time-consuming data integration. To address this problem Martin Traverso and his colleagues at Facebook built the Presto distributed query engine.

article thumbnail

Seamlessly Swapping the API backend of the Netflix Android app

Netflix Tech

How we migrated our Android endpoints out of a monolith into a new microservice by Rohan Dhruva , Ed Ballot As Android developers, we usually have the luxury of treating our backends as magic boxes running in the cloud, faithfully returning us JSON. At Netflix, we have adopted the Backend for Frontend (BFF) pattern : instead of having one general purpose “backend API”, we have one backend per client (Android/iOS/TV/web).

Java 97
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Cloudera Named Leader in The Forrester Wave: Notebook-Based Predictive Analytics and Machine Learning, Q3 2020

Cloudera

Cloudera has been named a Leader in The Forrester Wave : Notebook-Based Predictive Analytics and Machine Learning, Q3 2020. At Cloudera, we are committed to always staying at the forefront of data and analytics innovation — enabling enterprises to more optimally work with data to deliver analytic results across the business quickly and securely. For enterprise machine learning teams, this means having the right platform, tools, and processes that streamline end-to-end ML to tackle once-impossibl

article thumbnail

How Teradata Vantage with Native Object Store Decreases Costs, Increases Business Value

Teradata

The latest release of Teradata Vantage with Native Object Store enables companies to not only drive down costs by leveraging object store technologies, but also improve manageability and drive business insights with the power of Vantage.

More Trending

article thumbnail

Top Marketing Challenges for Tech Companies

Grouparoo

Martech Challenges in 2020 In the process of starting Grouparoo, we interviewed a hundred people who work in Marketing at various levels and roles. They spanned levels from independent contributors to executives and covered a wide range of marketing disciplines including Marketing Ops, Marketing Automation, Product Marketing, and more. Across our interviews, we heard about a diversity of experiences, but we heard a few common themes: Marketing’s scope is increasing Marketing is becoming more and

article thumbnail

Covid-19 Accelerates The Need for Retail, Manufacturing Supply Chains To Adapt

Cloudera

The ongoing disruption to critical supply chains in both the manufacturing and retail space has seen businesses having to respond quickly, turning to data, analytics, and new technologies to better predict and manage ‘real-time’ business disruptions. . To find out more about how COVID-19 has impacted the manufacturing and retail industries Vijay Raja, Director of Industry & Solutions Marketing at Cloudera sat down for a round-table discussion with Michael Ger , Managing Director of Manufactu

article thumbnail

To Integrate or Not to Integrate Data? That is the Question.

Teradata

Learn why a data-centric organization requires an objective approach to manage and integrate its data.

Data 59
article thumbnail

Deploy a Scala Application to AWS Lambda

Rock the JVM

Deploying Scala Code to AWS Lambda Is a Breeze: Discover Our Step-by-Step Tutorial to Guide You Through the Process

Scala 52
article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Test-data management  support in Test Automation Development

Data Science Blog: Data Engineering

Data is centric in testing of several applications because data is critical to organizations. Businesses are becoming more data-driven, and hence it is imperative that as Automation Test developers, the value of the test-data is understood and completely harnessed during Test Automation development. The test-data involved in both Manual/Automation testing encompasses the test-data inputs, test-data outputs, and the test-data flow.

article thumbnail

How-to: Index Data from S3 Using CDP Data Hub

Cloudera

This blog post will present a simple “hello world” kind of example on how to get data that is stored in S3 indexed and served by an Apache Solr service hosted in a Data Discovery and Exploration cluster in CDP. For the curious: DDE is a pre-templeted Solr-optimized cluster deployment option in CDP, and recently released in tech preview. We will only cover AWS and S3 environments in this blog.

AWS 85
article thumbnail

Teradata: An Enduring Legacy

Teradata

Teradata’s legacy of success is based upon three building blocks: People – Technology --Partnership. Learn how how a small piece of that legacy began and grew.

article thumbnail

Deploying Confluent Operator on Red Hat OpenShift Container Platform on AWS

Confluent

Confluent Operator allows you to deploy and manage Confluent Platform as a cloud-native, stateful container application on Kubernetes and OpenShift. The automation provided by Kubernetes, Operator, and Helm greatly simplifies […].

AWS 52
article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

Refined Types in Scala Quickly Explained

Rock the JVM

Explore how to impose constraints on values at compile time using the Refined library

Scala 52
article thumbnail

The Future Of The Telco Industry And Impact Of 5G & IoT – Part 1

Cloudera

Technology like IoT, edge computing and 5G are changing the face of CSPs. Communication Service Providers (CSPs) are in the middle of a data-driven transformation. The current scale and pace of change in the Telecommunications sector is being driven by the rapid evolution of new technologies like the Internet of Things (IoT), 5G, advanced data analytics, and edge computing.

article thumbnail

Accelerate AI & ML projects using Databricks

RandomTrees

Databricks and its role in Data Prep for AI solutions Databricks is a buzzword in Data Science. It is so due to a lot of reasons. In order to work with massive amounts of data in petabytes or even more, Apache Spark is widely used. Apache Spark is an open-source, fast cluster computing system and a highly popular framework for big data analysis. This framework processes the data in parallel that helps to boost the performance.

Project 52
article thumbnail

Implementing Message Prioritization in Apache Kafka

Confluent

Users of messaging technologies such as JMS and AMQP often use message prioritization so that messages can be processed in a different order based on their importance. It doesn’t take […].

Kafka 45
article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

Meet Boris Malensek, Our Head Of Engineering In Merchant Operations

Zalando Engineering

We spoke about his professional journey within Zalando, the evolution of Merchant Operations, and the engineering culture within the company. The interview was initially conducted for Zalando’s External Talent Community. Boris, let’s go back to the start. What attracted you to Zalando in the first place? The main reason for my attraction to Zalando was how quickly the company was able to adapt to change.

article thumbnail

Operational Database Security – Part 1

Cloudera

In this blog post, we are going to take a look at some of the OpDB related security features of a CDP Private Cloud Base deployment. We are going to talk about encryption, authentication and authorization. . Data-at-rest encryption. Transparent data-at-rest encryption is available through the Transparent Data Encryption (TDE) feature in HDFS. . TDE provides the following features: Transparent, end-to-end encryption of data.

article thumbnail

Accelerate your Data Migration to Snowflake

RandomTrees

Snowflake Overview A data warehouse is a critical part of any business organization. Lot of cloud-based data warehouses are available in the market today, out of which let us focus on Snowflake. Snowflake is an analytical data warehouse that is provided as Software-as-a-Service (SaaS). Built on new SQL database engine, it provides a unique architecture designed for the cloud.

article thumbnail

Building an effective data approach in a hybrid cloud world – part 3

Cloudera

In our last two posts, we talked with Deloitte’s Marc Beierschoder and Martin Mannion respectively about the requirement organizations have to deploy their data and analytics , quickly, into a hybrid environment. On top of that, there is the fundamental aspect of consistent security and governance of your enterprise data cloud and need for multiple users with different requirements to access data flexibly.

Cloud 67
article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, Terrence Sheflin, and Mahyar Ghasemali

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.