Sat.Oct 03, 2020 - Fri.Oct 09, 2020

article thumbnail

Self Service Real Time Data Integration Without The Headaches With Meroxa

Data Engineering Podcast

Summary Analytical workloads require a well engineered and well maintained data integration process to ensure that your information is reliable and up to date. Building a real-time pipeline for your data lakes and data warehouses is a non-trivial effort, requiring a substantial investment of time and energy. Meroxa is a new platform that aims to automate the heavy lifting of change data capture, monitoring, and data loading.

article thumbnail

Project Metamorphosis Month 6: Secure Apache Kafka in Confluent Cloud

Confluent

The cloud opens up exciting new opportunities for information gathering, analysis, and sharing that can make every organization’s products and services better. Thanks to the cloud and its decentralized nature, […].

Cloud 111
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

7 New Ways Cloudera Is Investing in Our Culture

Cloudera

As Cloudera offices around the world continue to cope with the impact of COVID-19, we have worked hard to ease stress and adapt to remote working. People are the heart of our company and we’re investing in creative, new ways to make every Clouderan feel valued and appreciated. Clouderans are superstars at work and at home, and burn-out is unhealthy for employees, their families, and the company.

Designing 103
article thumbnail

Announcing Vantage on Google Cloud

Teradata

Teradata Vantage on Google Cloud is now generally available! Vantage on Google Cloud is an as-a-service offer in which customers can get the most analytic value from their data. Read more.

article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

The Curse of Dimensionality

Domino Data Lab: Data Engineering

Danger of Big Data Big data is the rage. This could be lots of rows (samples) and few columns (variables) like credit card transaction data, or lots of columns (variables) and few rows (samples) like genomic sequencing in life sciences research. The Curse of Dimensionality , or Large P, Small N, ((P >> N)) problem applies to the latter case of lots of variables measured on a relatively few number of samples.

article thumbnail

Restoring Balance to the Cluster: Self-Balancing Clusters in Confluent Platform 6.0

Confluent

Apache Kafka® scales well. A Kafka cluster can grow to tens or hundreds of brokers and easily sustain tens of GB per second of read and write traffic. But scaling […].

Kafka 103

More Trending

article thumbnail

Zero Down – and Pay Only for What You Use with Teradata Consumption Pricing

Teradata

Consumption Pricing is a usage-based option with automatic elasticity in which you pay only for compute resources consumed for successful queries, plus storage. Learn more.

59
article thumbnail

Scala 3: Type Lambdas Quickly Explained

Rock the JVM

Exploring type lambdas in Scala 3: a quick yet powerful structure that will elevate your high-level thinking

Scala 52
article thumbnail

Introducing Cluster Linking in Confluent Platform 6.0

Confluent

With the release of Confluent Platform 6.0 comes a preview of Confluent Cluster Linking available to self-managed customers and in Confluent Cloud for our early access partners. Cluster Linking is […].

Cloud 95
article thumbnail

Building a Simple CRUD web application and image store using Cloudera Operational Database and Flask

Cloudera

The Cloudera Operational Database (COD) is a managed dbPaaS solution available as an experience in Cloudera Data Platform (CDP). It offers multi-modal client access with NoSQL key-value using Apache HBase APIs and relational SQL with JDBC (via Apache Phoenix). The latter makes COD accessible to developers who are used to building applications that use MySQL, Postgres, etc.

article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, and Terrence Sheflin

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

Accelerating Innovation in the Analytic Ecosystem: Accessibility

Teradata

In the final part of this 3-part series on reducing conflict between business & IT to accelerate innovation, we focus on enabling accessibility to data with security & governance.

article thumbnail

Types, Kinds, and Type Constructors in Scala

Rock the JVM

Discover Scala's powerful type system: explore type organization, type constructors, and their significance

Scala 52
article thumbnail

Getting Started with Kafka Connect for New Relic

Confluent

It’s 3:00 am and PagerDuty keeps firing you alerts about your application being down. You need to figure out what the issue is, if it’s impacting users, and resolve it […].

Kafka 49
article thumbnail

3 Tools to Help Debug Slow Queries in MongoDB

Rockset

Regardless of what database you pick to run your application—MongoDB, Postgres, Oracle, or Cassandra—you will eventually encounter the same issue: slow queries. Slow queries can be the result of inefficient query design, inefficient table design, or general infrastructure problems. Although it may be tempting to add more machines or further complicate your data infrastructure to speed up your queries, improving the queries themselves is usually the best place to start when you want to improve da

MongoDB 40
article thumbnail

How to Drive Cost Savings, Efficiency Gains, and Sustainability Wins with MES

Speaker: Nikhil Joshi, Founder & President of Snic Solutions

Is your manufacturing operation reaching its efficiency potential? A Manufacturing Execution System (MES) could be the game-changer, helping you reduce waste, cut costs, and lower your carbon footprint. Join Nikhil Joshi, Founder & President of Snic Solutions, in this value-packed webinar as he breaks down how MES can drive operational excellence and sustainability.

article thumbnail

Retailers - Don't be a Data Zombie!

Teradata

The retailer of the future is like a live organism - it will use a data brain to develop new agility and effective responses to rapidly evolving situations. Read more.

Retail 52
article thumbnail

Build A StackOverflow Dashboard (Part 2): Crafting BigQuery Views and Superset Charts

Preset

In part 2, we'll start to visualize trends using Superset charts.

article thumbnail

How Zalando prepares for Cyber Week

Zalando Engineering

Introduction Cyber Week has become an increasingly important time of the year in e-commerce. In 2019 , we have attracted 840,000 new customers and our sales (Gross Merchandise Volume) increased by 32% compared to the previous year. During the event we grew faster as a business than throughout the year where we grow at a 20-25% rate. Our peak orders per minute reached 7,200 compared to 4,200 the year before (+71% YoY).

article thumbnail

7 Requirements for Digital Transformation

Cloudera

Digital transformation is not just about technological transformation of the organization, it’s about transforming the culture of an organization. It’s not enough to bolt technology onto an existing strategy and consider it transformed. That’s the message from our Chief Marketing Officer Mick Hollison discussing digital transformation with Charlene Li at Cloudera Now. .

article thumbnail

Improving the Accuracy of Generative AI Systems: A Structured Approach

Speaker: Anindo Banerjea, CTO at Civio & Tony Karrer, CTO at Aggregage

When developing a Gen AI application, one of the most significant challenges is improving accuracy. This can be especially difficult when working with a large data corpus, and as the complexity of the task increases. The number of use cases/corner cases that the system is expected to handle essentially explodes. 💥 Anindo Banerjea is here to showcase his significant experience building AI/ML SaaS applications as he walks us through the current problems his company, Civio, is solving.

article thumbnail

The Superset REST API

Preset

A high level tour of Apache Superset's REST API

40
article thumbnail

Collaboration is Key to Reducing Pain and Finding Value in Data

Cloudera

This is a guest blog post, authored by John Zantey, Director and Co-founder, Qabsu. When it comes to cloud, being an early adopter does not necessarily put you ahead of the game. I know of companies that have been perpetually “doing cloud” for 10 years, but very few that have “done cloud” in a way that democratises and makes data accessible, with minimal pain points.

Bytes 67