Sat.Oct 31, 2020 - Fri.Nov 06, 2020

article thumbnail

The Journey Begins

Team Data Science

Week 1: 10/9/20 - 10/16/20 In my quest to further improve my overall data science skills, I pulled the trigger on October 9th, 2020, and enrolled in a Data Engineering boot camp lead by Andreas Kretz. First a little bit about myself. I have a background in Aerospace Engineering and have been in the industry for close to 15 years now. A little more than a year ago, I decided to pivot to Machine Learning and Data Science.

article thumbnail

How insurers can better deliver at “The Moment of Truth”

Cloudera

It’s all about the Customer. Customers today expect services to be highly personalized. In a digital world tuned to understand your likes, dislikes, interests and preferences we expect a similar level of customization in all aspects of our lives. Insurance is no different. Insurance is not something the average consumer thinks about every day but when a life changing event happens, insurance becomes extremely important.

Insurance 120
article thumbnail

Keeping Netflix Reliable Using Prioritized Load Shedding

Netflix Tech

How viewers are able to watch their favorite show on Netflix while the infrastructure self-recovers from a system failure By Manuel Correa , Arthur Gonigberg , and Daniel West Getting stuck in traffic is one of the most frustrating experiences for drivers around the world. Everyone slows to a crawl, sometimes for a minor issue or sometimes for no reason at all.

article thumbnail

Add Version Control To Your Data Lake With LakeFS

Data Engineering Podcast

Summary Data lakes are gaining popularity due to their flexibility and reduced cost of storage. Along with the benefits there are some additional complexities to consider, including how to safely integrate new data sources or test out changes to existing pipelines. In order to address these challenges the team at Treeverse created LakeFS to introduce version control capabilities to your storage layer.

Data Lake 100
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Branding Yourself

Team Data Science

Week 2: 10/16/20 - 10/23/20 Week 2 of the course consists of Modules 3 & 4. If you have not read my first blog go here. Module 3 focuses on creating a professional LinkedIn profile. Your LinkedIn profile is the world's access to you and how you want to be seen professionally. Below is a screenshot. So here, I have a professionally taken photograph, what I am interested in below, and the 'About' section that summarizes Me.in a professional sense.

article thumbnail

Cloudera’s Pivot to a Virtual Internship Program

Cloudera

Typically, running smooth and successful internship programs requires in-person interactions with high touchpoints. From onboarding and regular meetings to coffee chats and welcome events to meet the team – it takes a lot to integrate a new intern. They’re not only new to the organization but new to the workforce, after all. . Yet, with most tech companies going fully remote, Early Talent teams had to consider their options.

More Trending

article thumbnail

Connect Teradata Vantage to Salesforce Data With Azure Data Factory

Teradata

This "how-to" guide will help you to connect Teradata Vantage using the Native Object Store feature to query Salesforce data sourced by Microsoft Azure Data Factory.

Data 59
article thumbnail

Build a Slack Dashboard (Part 3): Transforming Data and Creating Cross Channel Visualizations

Preset

Build a beautiful Slack dashboard using open source tools Meltano and Superset. Part 3 of 3.

article thumbnail

An Overview of Real Time Data Warehousing on Cloudera

Cloudera

Users today are asking ever more from their data warehouse. This is resulting in advancements of what is provided by the technology, and a resulting shift in the art of the possible. As an example of this, in this post we look at Real Time Data Warehousing (RTDW), which is a category of use cases customers are building on Cloudera and which is becoming more and more common amongst our customers.

article thumbnail

What’s New in Confluent Cloud Security

Confluent

Today, the ability to capture and harness the value of data in real time is critical for businesses to remain competitive in a data-driven world. Apache Kafka®, a scalable, open-source, […].

Cloud 52
article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Liquidity Monitoring: Dislocation

Ripple Engineering

In a recent post , my teammate Jennifer Xia outlined our motivation and initial direction for tracking XRP liquidity in support of RippleNet’s On-Demand Liquidity (ODL) service. ODL leverages the digital asset XRP to facilitate cross-border payments by sourcing destination currencies right at the time of payment. Jennifer’s post introduces the concept of order books and defines the implied FX rate or the FX rate implied by a pair of trades bridged through XRP.

Finance 52
article thumbnail

Data Quality at Airbnb

Airbnb Tech

Part 1 —  Rebuilding at Scale Authors: Jonathan Parks, Vaughn Quoss, Paul Ellwood Introduction At Airbnb, we’ve always had a data-driven culture. We’ve assembled top-notch data science and engineering teams, built industry-leading data infrastructure, and launched numerous successful open source projects, including Apache Airflow and Apache Superset.

article thumbnail

Coffee with Cloudera Partners: IBM

Cloudera

Featuring: Jerry Green, World Wide Open Source Sales and Strategy Leader at IBM. IBM and Cloudera joined forces to bring the best of both companies to enterprises seeking advanced data and AI solutions. Jerry Green, World Wide Open Source Sales and Strategy Leader at IBM, has been instrumental with the relationship since its inception. We wanted to probe deeper into the man, the myth, the legend, Jerry Green!

article thumbnail

Scala 3: Indentation Quickly Explained

Rock the JVM

Some people love it, some hate it: Scala 3's indented syntax might surprise you with its potential to enhance your code structure

Scala 52
article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

What Happened to the CEO in Waiting?

Teradata

Since the 2008 financial crisis the CFO's role has turned inward, & they have lost influence. What role should they play in the Bank of the Future, and can data be their savior?

Banking 52
article thumbnail

Responding to Security Vulnerabilities in Open Source Project

Preset

Preset's commitment to security in Apache Superset™

Project 40
article thumbnail

The Security Challenges of Data Warehousing in the Cloud

Cloudera

Many organizations struggle to meet growing and variable data warehouse demands. No matter how much they pad their annual IT budgets, there never seems to be enough capacity to cover unexpected business requests. This leads to resource restrictions for the various business units that use the platform. . When business units are not well served by central IT, “shadow IT” emerges.

Cloud 76
article thumbnail

Power BI Template App for SalesForce

FreshBI

So, what is a Power BI Template App? A Power BI Template App is a published Power BI solution that can be used by any company that has the data platform for which the Template App was created. Wouldn’t it be nice to pick your entire Power BI Solution off the shelf - one crafted for your specific business needs and your specific data structure. Power BI Template Apps are designed to be such an out-of-the-box solution and this blog post is an example of such for a Power BI Solution for Salesforce.

BI 52
article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

Cloudera at BioData World Congress 2020 – Use Cases at Top 5 Pharmaceutical Organizations

Cloudera

BioData World Congress 2020 is next week, and I am looking forward to the opportunity to meet with decision makers and thought leaders working in omics, diagnostics and R&D from across Europe and beyond. Cloudera’s work with BioPharma organizations helps them link clinical and business knowledge with analytics expertise to drive patient-level insights and operational decision making in a dynamic environment.