Sat.Nov 14, 2020 - Fri.Nov 20, 2020

article thumbnail

Analysing historical and live data with ksqlDB and Elastic Cloud

Confluent

Building data pipelines isn’t always straightforward. The gap between the shiny “hello world” examples of demos and the gritty reality of messy data and imperfect formats is sometimes all too […].

Cloud 139
article thumbnail

Announcing the 2020 Data Impact Award Winners

Cloudera

What a fantastic 24-hours it has been here at Cloudera. During the first-ever virtual broadcast of our annual Data Impact Awards (DIA) ceremony, we had the great pleasure of announcing this year’s finalists and winners. Streamed to hundreds of people around the globe, we were able to come together to celebrate some incredible successes. . In a year marked by unusual events, and disruption to our “normal” lives, it was a pleasure to recognize our customers’ most impressive achievements.

Medical 124
article thumbnail

Self Service Data Management From Ingest To Insights With Isima

Data Engineering Podcast

Summary The core mission of data engineers is to provide the business with a way to ask and answer questions of their data. This often takes the form of business intelligence dashboards, machine learning models, or APIs on top of a cleaned and curated data set. Despite the rapid progression of impressive tools and products built to fulfill this mission, it is still an uphill battle to tie everything together into a cohesive and reliable platform.

article thumbnail

Web Scraping Using R.!

Data Science Blog: Data Engineering

In this blog, I’ll show you, How to Web Scrape using R.? What is R.? R is a programming language and its environment built for statistical analysis, graphical representation & reporting. R programming is mostly preferred by statisticians, data miners, and software programmers who want to develop statistical software. R is also available as Free Software under the terms of the Free Software Foundation’s GNU General Public License in source code form.

article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Use Cases and Architectures for HTTP and REST APIs with Apache Kafka

Confluent

This blog post presents the use cases and architectures of REST APIs and Confluent REST Proxy, and explores a new management API and improved integrations into Confluent Server and Confluent […].

article thumbnail

How a modern data platform supports government fraud detection

Cloudera

November 15-21 marks International Fraud Awareness Week – but for many in government, that’s every week. From bogus benefits claims to fraudulent network activity, fraud in all its forms represents a significant threat to government at all levels. Some experts estimate the U.S. government loses nearly 150 billion dollars due to potential fraud each year, McKinsey & Company reports.

More Trending

article thumbnail

Scala 3: Givens vs Implicits Quickly Explained

Rock the JVM

Building on the previous article's insights into givens, let's explore how they stack up against the traditional Scala implicits

Scala 52
article thumbnail

How Real-Time Stream Processing Safely Scales with ksqlDB, Animated

Confluent

Software engineering memes are in vogue, and nothing is more fashionable than joking about how complicated distributed systems can be. Despite the ribbing, many people adopt them. Why? Distributed systems […].

Process 122
article thumbnail

Fraud Detection using Deep Learning

Cloudera

One of the many areas where machine learning has made a large difference for enterprise business is in the ability to make accurate predictions in the realm of fraud detection. Knowing that a transaction is fraudulent is a critical requirement for financial services companies, but knowing that a transaction that was flagged by a rules-based system as fraudulent is a valid transaction, can be equally important.

article thumbnail

NLP Heroes, Pinot, Data Testing, and More: Top 10 Links From Across the Web

Data Council

Here's our November 2020 roundup of good reads and podcast episodes that might be relevant for your career in data: 1. Heroes of NLP: Quoc Le (Deeplearning.ai) NLP researcher Quoc Le was recently Andrew Ng’s guest as part of the ‘Heroes of NLP’ video series. Their discussion covered Le’s impressive journey, from growing up in Vietnam and developing his first basic chatbot in high school to becoming Google Brain’s first intern, and everything that followed.

Data 52
article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Grouparoo Raises $3M Seed Round

Grouparoo

We are excited to announce that Grouparoo has raised $3M in seed funding to make SaaS integrations easier for engineering. This round was led by Eniac Ventures and Fuel Capital. We’re also honored and humbled to have great participants in the round including Hack VC , Liquid2 , SCM Advisors , Stacy Brown-Philpot , J Zac Stein , Meka Asonye , Jonathan Grant , and others with experience that will be helpful in our journey.

article thumbnail

The Cloud-Native Evolution of Apache Kafka on Kubernetes

Confluent

It’s almost KubeCon! Let’s talk about the state of cloud-native Apache Kafka® and other distributed systems on Kubernetes. Over the last decade, our industry has seen the rise of container […].

Kafka 75
article thumbnail

Combating Fraud in Insurance with Data

Cloudera

Well, it is International Fraud Awareness Week, focused on promoting fraud prevention and education. A fantastic initiative! Maybe I am naïve but I feel a bit sad that there is a need for “fraud week”. The insurance industry has a long and intimate relationship with fraud in many different ways. Insurance fraud can take place at a process or business function level, most notably in claims or underwriting.

Insurance 106
article thumbnail

Is Skepticism Thwarting Your Grandiose AI Plans?

Teradata

The currency of Trust is taking on a new form. As insurance companies rely more on artificial intelligence to make decisions, humans must now trust machines as much as Humans.

article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

Scala 3: Given and Using Clauses

Rock the JVM

Explore Scala 3's given/using clauses: a crucial feature for modern Scala programming, and learn how to leverage them effectively

Scala 52
article thumbnail

Kafka Summit 2021 – Double the Fun

Confluent

My own sense of the passage of time in 2020 is no sure guide, but honestly it seems like Kafka Summit just happened—yet here we are, deep into planning for […].

Kafka 69
article thumbnail

Fraud Prevention – 3 Data Strategies for Financial Services

Cloudera

Fraud awareness in the Financial Services industry is more important than ever. According to the September 2020 benchmarking report conducted by the Association of Certified Fraud Examiners (ACFE) in response to the coronavirus, 77% of survey respondents, representing a range of industries, have observed an increase in the overall level of fraud as of August, compared with 68% in May.

Banking 106
article thumbnail

Brinker International, Inc.

Teradata

Cloud data analytics is helping to assist in the delivery of serving quality, hot food, with a passionate customer experience that keeps visitors craving seconds.

Food 52
article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

Smart Schema: Enabling SQL Queries on Semi-Structured Data

Rockset

Rockset is a real-time indexing database in the cloud for serving low-latency, high-concurrency queries at scale. It is particularly well-suited for serving the real-time analytical queries that power apps, such as personalization or recommendation engines, location search, and so on. In this blog post, we show how Rockset’s Smart Schema feature lets developers use real-time SQL queries to extract meaningful insights from raw semi-structured data ingested without a predefined schema.

article thumbnail

#ClouderaLife Spotlight: Teresa Morris, Sr. Manager, Technical Partner Support

Cloudera

Meet Teresa Morris! A 3.5 year Clouderan working as a Sr. Manager, Technical Partner Support. Her role entails building and managing support partnerships – it’s one she finds rewarding. “It’s not a one project kind of thing, it’s a whole experience of managing partnerships that bring more business. Being a part of a digital transformation and all the things that drive customers experience is so fulfilling.” .

article thumbnail

With great technology, comes great responsibility

Cloudera

On November 18th, we kicked off our EMEA Influential Women in Data series. As a company, diversity and inclusion is at the very core of our leadership and our company. We’re lucky enough to interact with incredible talent on a daily basis amongst our clients and teams – talent that this series aims to showcase. Our first interviewee was Helen Davis, Assistant Director of IT and Digital at West Midlands Police (WMP).

article thumbnail

What Banks Can Learn From Disney

Teradata

Being the best bank isn't good enough! In order to win, the Bank of the Future needs to take a page from Big Tech’s playbook & use data to drive personalization of services.

Banking 52
article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, Terrence Sheflin, and Mahyar Ghasemali

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.