Sat.Oct 29, 2022 - Fri.Nov 04, 2022

article thumbnail

9 Skills You Need to Become a Data Engineer

KDnuggets

A data engineer is a fast-growing profession with amazing challenges and rewards. Which skills do you need to become a data engineer? In this post, we’ll take a look at both hard and soft skills.

article thumbnail

The Scoop: Turmoil at Twitter

The Pragmatic Engineer

👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. We cover one out of six topics in today’s subscriber-only The Scoop issue. To get this newsletter every week, subscribe here. On Wednesday, 26 October, Elon Musk entered Twitter’s headquarters in San Francisco with a sink, marking his arrival at the company he’d just bought.

article thumbnail

See, Build, Test, Experiment: Using Data Science to Change the World with Erick Webbe

Jesse Anderson

My guest this week is Erick Webbe , Head of Data Science at bol.com. Bol.com is the biggest online retailer in northwestern Europe, serving about 12 million customers, as a general retailer similar to Amazon.com. Erick has a Master’s degree in Applied Physics. His background in physics forms a basis for his philosophy on life and work. That’s a “philosophy that I still apply to my work every single day […] we think about how we can best help them overcome that problem or solve it, and then

article thumbnail

Expanding The Reach of Business Intelligence Through Ubiquitous Embedded Analytics With Sisense

Data Engineering Podcast

Summary Business intelligence has grown beyond its initial manifestation as dashboards and reports. In its current incarnation it has become a ubiquitous need for analytics and opportunities to answer questions with data. In this episode Amir Orad discusses the Sisense platform and how it facilitates the embedding of analytics and data insights in every aspect of organizational and end-user experiences.

article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

30 Resources for Mastering Data Visualization

KDnuggets

Want to master data visualization? This list of 30 resources and tools will help you get started on your path toward mastering data visualization.

Data 160
article thumbnail

When Private Cloud is the Right Fit for Public Sector Missions

Cloudera

It’s no secret that IT modernization is a top priority for the US federal government. A quick trip in the congressional time machine to revisit 2017’s Modernizing Government Technology Act surfaces some of the most salient points regarding agencies’ challenges: The federal government spends nearly 75% of its annual information technology funding on operating and maintaining existing legacy information technology systems.

Cloud 95

More Trending

article thumbnail

Analytics Engineering Without The Friction Of Complex Pipeline Development With Optimus and dbt

Data Engineering Podcast

Summary One of the most impactful technologies for data analytics in recent years has been dbt. It’s hard to have a conversation about data engineering or analysis without mentioning it. Despite its widespread adoption there are still rough edges in its workflow that cause friction for data analysts. To help simplify the adoption and management of dbt projects Nandam Karthik helped create Optimus.

article thumbnail

15 Free Machine Learning and Deep Learning Books

KDnuggets

Check out this list of 15 FREE ebooks for learning machine learning and deep learning.

article thumbnail

Cloudera Partner Network: Poised to Heat up Channel Growth

Cloudera

You asked. We delivered: Introducing the Cloudera Partner Network. Cloudera’s channel programs team constantly strives to improve how we engage, reward, and collaborate with our partners. And based on your feedback, we’re replacing Cloudera Connect with a new program, Cloudera Partner Network (CPN). It provides more comprehensive tools and support to help you go to market faster, as well as industry-leading incentives and promotions that we’ve aligned with your business and sales models. .

article thumbnail

Diagnose and Debug Apache Kafka Issues: Understanding Reduced Message Throughput

Confluent

Learn how to pinpoint common Kafka issues, which producer metrics to monitor, and how to optimize Kafka to keep latency low and throughput high.

Kafka 72
article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Can Web3 beat public cloud? by Colin Eberhardt

Scott Logic

There are a growing number of voices heralding Web3 as the future of the internet, and this technology (concept?) is receiving considerable coverage at conferences, in the technology press, and internet forums. I decided it was time to put Web3 to the test and see how it fares against the contemporary approach to building apps - the cloud. Unfortunately I found Web3 to be very lacking.

Cloud 59
article thumbnail

The Gap Between Deep Learning and Human Cognitive Abilities

KDnuggets

How do we bridge this gap between deep learning and human cognitive ability?

article thumbnail

Re-Imagining Data Observability

Databand.ai

Re-Imagining Data Observability Ryan Yackel 2022-11-04 10:36:35 Data observability has become one of the hottest topics of the year – and for good reason. Data observability provides an end-to-end view into exactly what’s happening with data pipelines across an organization’s data fabric. And it does so in real time. That means instead of the CEO getting a frantic call at 3 am that something is broken, teams can fix issues proactively before they become a bigger problem.

Data 52
article thumbnail

Vue 2 vs. Vue 3: What Are the Differences and Which Version Should You Choose?

Trio

Vue.js 2 or Vue 2 has been powering interactive web development for quite a few years now. ‘The progressive JavaScript framework’ is one of the most preferred technologies for developing web interfaces, as evident in the 2022 Stack Overflow Developer Survey.

article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

How Pinterest Leverages Realtime User Actions in Recommendation to Boost Homefeed Engagement Volume

Pinterest Engineering

Xue Xia, Software Engineer, Homefeed Ranking; Neng Gu, Software Engineer, Content & User Understanding; Dhruvil Deven Badani, Engineering Manager, Homefeed Ranking; Andrew Zhai, Software Engineer, Advanced Technologies Group Image from [link] In this blog post, we will demonstrate how we improved Pinterest Homefeed engagement volume from a machine learning model design perspective — by leveraging realtime user action features in Homefeed recommender system.

article thumbnail

Should I Learn Julia?

KDnuggets

Do you think learning Julia is better for your data science career? Let’s find out.

article thumbnail

Demystifying event streams: Transforming events into tables with dbt

dbt Developer Hub

Let’s discuss how to convert events from an event-driven microservice architecture into relational tables in a warehouse like Snowflake. Here are a few things we’ll address: Why you may want to use an architecture like this How to structure your event messages How to use dbt macros to make it easy to ingest new event streams Event Streams at Merit ​ At Merit, we’re building the leading verified identity platform.

Kafka 52
article thumbnail

Node.js vs. Go In 2023: Side-By-Side Comparison

Trio

article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

Learn How to Secure Your Kafka Cluster Automatically

Confluent

Kafka security is crucial, yet complex. We'll show you how to automatically secure a Kafka cluster for easy authentication and encryption to meet compliance objectives.

Kafka 52
article thumbnail

365 Data Science courses free until November 21

KDnuggets

The unlimited access initiative provides a risk-free way to break into data science.

article thumbnail

On Demand Webinar: Map And Monitor Your Data Journey

DataKitchen

Chris Bergh shares how to do a Data Journey in the on-demand webinar! The post On Demand Webinar: Map And Monitor Your Data Journey first appeared on DataKitchen.

Data 52
article thumbnail

Internet Egress Filtering of Services at Lyft

Lyft Engineering

Using Envoy as an Explicit CONNECT and Transparent Proxy Photo from Dan Meyers Unrestricted egress traffic from services poses a significant security risk as it allows external threats to exfiltrate data and download arbitrary payloads from untrusted, dangerous hosts. While ingress filtering from the Internet is ubiquitous using firewalls, it is far less common that companies are controlling and observing traffic leaving their network.

article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, Terrence Sheflin, and Mahyar Ghasemali

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

5 Steps for Migrating from Elasticsearch to Rockset for Real-Time Analytics

Rockset

Nothing to Fear Migration is often viewed as a 4 letter word in IT. Something to avoid, something to fear and definitely not something to do on a whim. It’s an understandable position given the risk and horror stories associated with “Migration Projects”. This blog outlines best practices from customers I have helped migrate from Elasticsearch to Rockset , reducing risk and avoiding common pitfalls.

article thumbnail

The AI Education Gap and How to Close It

KDnuggets

AI education is broken, how do we solve it? Individuals end up learning a specific tool or tactic in a vacuum. They are missing the real-world applicability and collaboration that is critical to building impactful AI solutions in line with the organization’s strategy.

Education 122
article thumbnail

Real-time Data Integration from Oracle to Google BigQuery Using Striim

Striim

Hosted on the Google Cloud Blog, read on to learn how relational databases like Oracle store data but Striim and Google Cloud BigQuery ensure timely and accurate analytics at scale.

article thumbnail

From Dealership to Concierge – Leveraging Vehicle Data to Transform the Car Buying Experience

Teradata

Analyzing vehicle data should be transforming customer experience in the auto industry. Unfortunately, it is behind many others in terms of delivering individualized experiences based on insight.

Data 52
article thumbnail

How to Drive Cost Savings, Efficiency Gains, and Sustainability Wins with MES

Speaker: Nikhil Joshi, Founder & President of Snic Solutions

Is your manufacturing operation reaching its efficiency potential? A Manufacturing Execution System (MES) could be the game-changer, helping you reduce waste, cut costs, and lower your carbon footprint. Join Nikhil Joshi, Founder & President of Snic Solutions, in this value-packed webinar as he breaks down how MES can drive operational excellence and sustainability.

article thumbnail

The New Rockset Query Editor Experience

Rockset

Developing SQL queries is an essential part of the Rockset product experience. We're excited to announce the release of a new query editor in the Rockset Console with improved performance and an updated design. Upgraded Performance The main motivation for the new query editor experience was to resolve the performance issues of our old query editor. While it was generally usable, typing in the old query editor would lag when working on large queries and become sluggish after long periods of time.

SQL 52
article thumbnail

How to Create a Sampling Plan for Your Data Project

KDnuggets

When simple random sampling is not that simple.

Project 116
article thumbnail

Tapping Into New Depths Of Business Analytics With IIM Indore!

U-Next

. According to a report by techjury , 7 out of 10 business rate data discovery as very important. Every organization/ small business/ large business etc., today employs Business Analytics on different levels to propel their businesses ahead. Data is at the core of every technological advancement made in recent times. . It’s been over a decade since the domain of data, and its analytics had been declared the hottest jobs of the 21 st century, and yet it shows no signs of slowing down or l

article thumbnail

Hard user separation with NixOS

Tweag

This guide explains how to install NixOS on a computer, with a twist. If you use the same computer in different contexts, let’s say for work and for your private life, you may wish to install two different operating systems to protect your private life data from mistakes or hacks from your work. For instance a cryptolocker you got from a compromised work email won’t lock out your family photos.

Coding 52
article thumbnail

The Cloud Development Environment Adoption Report

Cloud Development Environments (CDEs) are changing how software teams work by moving development to the cloud. Our Cloud Development Environment Adoption Report gathers insights from 223 developers and business leaders, uncovering key trends in CDE adoption. With 66% of large organizations already using CDEs, these platforms are quickly becoming essential to modern development practices.