Sat.Sep 05, 2020 - Fri.Sep 11, 2020

article thumbnail

Simplify Your Data Architecture With The Presto Distributed SQL Engine

Data Engineering Podcast

Summary Databases are limited in scope to the information that they directly contain. For analytical use cases you often want to combine data across multiple sources and storage locations. This frequently requires cumbersome and time-consuming data integration. To address this problem Martin Traverso and his colleagues at Facebook built the Presto distributed query engine.

article thumbnail

Data Champions: Balancing IT and Business Needs

Cloudera

Digital transformation has been on the agenda for a long time, but the sudden need to respond to the unprecedented challenges of 2020, has meant the buzzword has become an executable reality for many enterprises. I recently came across a KPMG report that revealed that 80% of executives are increasing investments on emerging technologies now, to drive higher realized value in the future.

IT 103
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Seamlessly Swapping the API backend of the Netflix Android app

Netflix Tech

How we migrated our Android endpoints out of a monolith into a new microservice by Rohan Dhruva , Ed Ballot As Android developers, we usually have the luxury of treating our backends as magic boxes running in the cloud, faithfully returning us JSON. At Netflix, we have adopted the Backend for Frontend (BFF) pattern : instead of having one general purpose “backend API”, we have one backend per client (Android/iOS/TV/web).

Java 95
article thumbnail

The Cause and Effect of Supply Chain Fragility, and How to Fix It

Teradata

The fragility of your supply chain existed long before COVID-19 brought it into sharp relief. Discover the secret to true supply chain resilience.

IT 105
article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

How to introduce Data Science at your company

DareData

Machine Learning, Data Science and Artificial Intelligence are three terms that have been intertwined and used in multiple conversations during the past decade. Probably, in the business world, no other theme has caused so many questions, doubts, eyebrow raises and el dorado hopes. If you are reading this post you might have some level of interest in understanding what Data Science / Machine Learning or Artificial Intelligence are and trust me, you are not alone in the world.

article thumbnail

Cloudera Named Leader in The Forrester Wave: Notebook-Based Predictive Analytics and Machine Learning, Q3 2020

Cloudera

Cloudera has been named a Leader in The Forrester Wave : Notebook-Based Predictive Analytics and Machine Learning, Q3 2020. At Cloudera, we are committed to always staying at the forefront of data and analytics innovation — enabling enterprises to more optimally work with data to deliver analytic results across the business quickly and securely. For enterprise machine learning teams, this means having the right platform, tools, and processes that streamline end-to-end ML to tackle once-impossibl

More Trending

article thumbnail

How Teradata Vantage with Native Object Store Decreases Costs, Increases Business Value

Teradata

The latest release of Teradata Vantage with Native Object Store enables companies to not only drive down costs by leveraging object store technologies, but also improve manageability and drive business insights with the power of Vantage.

article thumbnail

Test-data management  support in Test Automation Development

Data Science Blog: Data Engineering

Data is centric in testing of several applications because data is critical to organizations. Businesses are becoming more data-driven, and hence it is imperative that as Automation Test developers, the value of the test-data is understood and completely harnessed during Test Automation development. The test-data involved in both Manual/Automation testing encompasses the test-data inputs, test-data outputs, and the test-data flow.

article thumbnail

Covid-19 Accelerates The Need for Retail, Manufacturing Supply Chains To Adapt

Cloudera

The ongoing disruption to critical supply chains in both the manufacturing and retail space has seen businesses having to respond quickly, turning to data, analytics, and new technologies to better predict and manage ‘real-time’ business disruptions. . To find out more about how COVID-19 has impacted the manufacturing and retail industries Vijay Raja, Director of Industry & Solutions Marketing at Cloudera sat down for a round-table discussion with Michael Ger , Managing Director of Manufactu

article thumbnail

Deploying Confluent Operator on Red Hat OpenShift Container Platform on AWS

Confluent

Confluent Operator allows you to deploy and manage Confluent Platform as a cloud-native, stateful container application on Kubernetes and OpenShift. The automation provided by Kubernetes, Operator, and Helm greatly simplifies […].

AWS 52
article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, and Terrence Sheflin

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

Accelerate AI & ML projects using Databricks

RandomTrees

Databricks and its role in Data Prep for AI solutions Databricks is a buzzword in Data Science. It is so due to a lot of reasons. In order to work with massive amounts of data in petabytes or even more, Apache Spark is widely used. Apache Spark is an open-source, fast cluster computing system and a highly popular framework for big data analysis. This framework processes the data in parallel that helps to boost the performance.

Project 52
article thumbnail

Teradata: An Enduring Legacy

Teradata

Teradata’s legacy of success is based upon three building blocks: People – Technology --Partnership. Learn how how a small piece of that legacy began and grew.

article thumbnail

How-to: Index Data from S3 Using CDP Data Hub

Cloudera

This blog post will present a simple “hello world” kind of example on how to get data that is stored in S3 indexed and served by an Apache Solr service hosted in a Data Discovery and Exploration cluster in CDP. For the curious: DDE is a pre-templeted Solr-optimized cluster deployment option in CDP, and recently released in tech preview. We will only cover AWS and S3 environments in this blog.

AWS 84
article thumbnail

Implementing Message Prioritization in Apache Kafka

Confluent

Users of messaging technologies such as JMS and AMQP often use message prioritization so that messages can be processed in a different order based on their importance. It doesn’t take […].

Kafka 45
article thumbnail

How to Drive Cost Savings, Efficiency Gains, and Sustainability Wins with MES

Speaker: Nikhil Joshi, Founder & President of Snic Solutions

Is your manufacturing operation reaching its efficiency potential? A Manufacturing Execution System (MES) could be the game-changer, helping you reduce waste, cut costs, and lower your carbon footprint. Join Nikhil Joshi, Founder & President of Snic Solutions, in this value-packed webinar as he breaks down how MES can drive operational excellence and sustainability.

article thumbnail

Accelerate your Data Migration to Snowflake

RandomTrees

Snowflake Overview A data warehouse is a critical part of any business organization. Lot of cloud-based data warehouses are available in the market today, out of which let us focus on Snowflake. Snowflake is an analytical data warehouse that is provided as Software-as-a-Service (SaaS). Built on new SQL database engine, it provides a unique architecture designed for the cloud.

article thumbnail

To Integrate or Not to Integrate Data? That is the Question.

Teradata

Learn why a data-centric organization requires an objective approach to manage and integrate its data.

Data 59
article thumbnail

The Future Of The Telco Industry And Impact Of 5G & IoT – Part 1

Cloudera

Technology like IoT, edge computing and 5G are changing the face of CSPs. Communication Service Providers (CSPs) are in the middle of a data-driven transformation. The current scale and pace of change in the Telecommunications sector is being driven by the rapid evolution of new technologies like the Internet of Things (IoT), 5G, advanced data analytics, and edge computing.

article thumbnail

Deploy a Scala Application to AWS Lambda

Rock the JVM

Deploying Scala Code to AWS Lambda Is a Breeze: Discover Our Step-by-Step Tutorial to Guide You Through the Process

Scala 52
article thumbnail

Improving the Accuracy of Generative AI Systems: A Structured Approach

Speaker: Anindo Banerjea, CTO at Civio & Tony Karrer, CTO at Aggregage

When developing a Gen AI application, one of the most significant challenges is improving accuracy. This can be especially difficult when working with a large data corpus, and as the complexity of the task increases. The number of use cases/corner cases that the system is expected to handle essentially explodes. 💥 Anindo Banerjea is here to showcase his significant experience building AI/ML SaaS applications as he walks us through the current problems his company, Civio, is solving.

article thumbnail

Meet Boris Malensek, Our Head Of Engineering In Merchant Operations

Zalando Engineering

We spoke about his professional journey within Zalando, the evolution of Merchant Operations, and the engineering culture within the company. The interview was initially conducted for Zalando’s External Talent Community. Boris, let’s go back to the start. What attracted you to Zalando in the first place? The main reason for my attraction to Zalando was how quickly the company was able to adapt to change.

article thumbnail

Operational Database Security – Part 1

Cloudera

In this blog post, we are going to take a look at some of the OpDB related security features of a CDP Private Cloud Base deployment. We are going to talk about encryption, authentication and authorization. . Data-at-rest encryption. Transparent data-at-rest encryption is available through the Transparent Data Encryption (TDE) feature in HDFS. . TDE provides the following features: Transparent, end-to-end encryption of data.

article thumbnail

Building an effective data approach in a hybrid cloud world – part 3

Cloudera

In our last two posts, we talked with Deloitte’s Marc Beierschoder and Martin Mannion respectively about the requirement organizations have to deploy their data and analytics , quickly, into a hybrid environment. On top of that, there is the fundamental aspect of consistent security and governance of your enterprise data cloud and need for multiple users with different requirements to access data flexibly.

Cloud 66
article thumbnail

Refined Types in Scala Quickly Explained

Rock the JVM

Explore how to impose constraints on values at compile time using the Refined library

Scala 52
article thumbnail

The Ultimate Guide To Data-Driven Construction: Optimize Projects, Reduce Risks, & Boost Innovation

Speaker: Donna Laquidara-Carr, PhD, LEED AP, Industry Insights Research Director at Dodge Construction Network

In today’s construction market, owners, construction managers, and contractors must navigate increasing challenges, from cost management to project delays. Fortunately, digital tools now offer valuable insights to help mitigate these risks. However, the sheer volume of tools and the complexity of leveraging their data effectively can be daunting. That’s where data-driven construction comes in.