Sat.Oct 29, 2022 - Fri.Nov 04, 2022

article thumbnail

The Scoop: Turmoil at Twitter

The Pragmatic Engineer

👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. We cover one out of six topics in today’s subscriber-only The Scoop issue. To get this newsletter every week, subscribe here. On Wednesday, 26 October, Elon Musk entered Twitter’s headquarters in San Francisco with a sink, marking his arrival at the company he’d just bought.

article thumbnail

See, Build, Test, Experiment: Using Data Science to Change the World with Erick Webbe

Jesse Anderson

My guest this week is Erick Webbe , Head of Data Science at bol.com. Bol.com is the biggest online retailer in northwestern Europe, serving about 12 million customers, as a general retailer similar to Amazon.com. Erick has a Master’s degree in Applied Physics. His background in physics forms a basis for his philosophy on life and work. That’s a “philosophy that I still apply to my work every single day […] we think about how we can best help them overcome that problem or solve it, and then

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Expanding The Reach of Business Intelligence Through Ubiquitous Embedded Analytics With Sisense

Data Engineering Podcast

Summary Business intelligence has grown beyond its initial manifestation as dashboards and reports. In its current incarnation it has become a ubiquitous need for analytics and opportunities to answer questions with data. In this episode Amir Orad discusses the Sisense platform and how it facilitates the embedding of analytics and data insights in every aspect of organizational and end-user experiences.

article thumbnail

9 Skills You Need to Become a Data Engineer

KDnuggets

A data engineer is a fast-growing profession with amazing challenges and rewards. Which skills do you need to become a data engineer? In this post, we’ll take a look at both hard and soft skills.

article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

Consistent caching mechanism in Titus Gateway

Netflix Tech

by Tomasz Bak and Fabio Kung Introduction Titus is the Netflix cloud container runtime that runs and manages containers at scale. In the time since it was first presented as an advanced Mesos framework, Titus has transparently evolved from being built on top of Mesos to Kubernetes, handling an ever-increasing volume of containers. As the number of Titus users increased over the years, the load and pressure on the system increased substantially.

Systems 91
article thumbnail

When Private Cloud is the Right Fit for Public Sector Missions

Cloudera

It’s no secret that IT modernization is a top priority for the US federal government. A quick trip in the congressional time machine to revisit 2017’s Modernizing Government Technology Act surfaces some of the most salient points regarding agencies’ challenges: The federal government spends nearly 75% of its annual information technology funding on operating and maintaining existing legacy information technology systems.

Cloud 88

More Trending

article thumbnail

30 Resources for Mastering Data Visualization

KDnuggets

Want to master data visualization? This list of 30 resources and tools will help you get started on your path toward mastering data visualization.

Data 160
article thumbnail

Diagnose and Debug Apache Kafka Issues: Understanding Reduced Message Throughput

Confluent

Learn how to pinpoint common Kafka issues, which producer metrics to monitor, and how to optimize Kafka to keep latency low and throughput high.

Kafka 72
article thumbnail

Cloudera Partner Network: Poised to Heat up Channel Growth

Cloudera

You asked. We delivered: Introducing the Cloudera Partner Network. Cloudera’s channel programs team constantly strives to improve how we engage, reward, and collaborate with our partners. And based on your feedback, we’re replacing Cloudera Connect with a new program, Cloudera Partner Network (CPN). It provides more comprehensive tools and support to help you go to market faster, as well as industry-leading incentives and promotions that we’ve aligned with your business and sales models. .

article thumbnail

Can Web3 beat public cloud? by Colin Eberhardt

Scott Logic

There are a growing number of voices heralding Web3 as the future of the internet, and this technology (concept?) is receiving considerable coverage at conferences, in the technology press, and internet forums. I decided it was time to put Web3 to the test and see how it fares against the contemporary approach to building apps - the cloud. Unfortunately I found Web3 to be very lacking.

Cloud 59
article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, and Terrence Sheflin

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

15 Free Machine Learning and Deep Learning Books

KDnuggets

Check out this list of 15 FREE ebooks for learning machine learning and deep learning.

article thumbnail

Re-Imagining Data Observability

Databand.ai

Re-Imagining Data Observability Ryan Yackel 2022-11-04 10:36:35 Data observability has become one of the hottest topics of the year – and for good reason. Data observability provides an end-to-end view into exactly what’s happening with data pipelines across an organization’s data fabric. And it does so in real time. That means instead of the CEO getting a frantic call at 3 am that something is broken, teams can fix issues proactively before they become a bigger problem.

Data 52
article thumbnail

Vue 2 vs. Vue 3: What Are the Differences and Which Version Should You Choose?

Trio

Vue.js 2 or Vue 2 has been powering interactive web development for quite a few years now. ‘The progressive JavaScript framework’ is one of the most preferred technologies for developing web interfaces, as evident in the 2022 Stack Overflow Developer Survey.

article thumbnail

Demystifying event streams: Transforming events into tables with dbt

dbt Developer Hub

Let’s discuss how to convert events from an event-driven microservice architecture into relational tables in a warehouse like Snowflake. Here are a few things we’ll address: Why you may want to use an architecture like this How to structure your event messages How to use dbt macros to make it easy to ingest new event streams Event Streams at Merit ​ At Merit, we’re building the leading verified identity platform.

Kafka 52
article thumbnail

How to Drive Cost Savings, Efficiency Gains, and Sustainability Wins with MES

Speaker: Nikhil Joshi, Founder & President of Snic Solutions

Is your manufacturing operation reaching its efficiency potential? A Manufacturing Execution System (MES) could be the game-changer, helping you reduce waste, cut costs, and lower your carbon footprint. Join Nikhil Joshi, Founder & President of Snic Solutions, in this value-packed webinar as he breaks down how MES can drive operational excellence and sustainability.

article thumbnail

Top Posts October 24-30: How to Select Rows and Columns in Pandas

KDnuggets

How to Select Rows and Columns in Pandas Using [ ],loc, iloc,at and.iat • Decision Tree Algorithm, Explained • Graphs: The natural way to understand data • 7 Techniques to Handle Imbalanced Data • A Data Science Portfolio That Will Land You The Job in 2022.

Portfolio 108
article thumbnail

How Pinterest Leverages Realtime User Actions in Recommendation to Boost Homefeed Engagement Volume

Pinterest Engineering

Xue Xia, Software Engineer, Homefeed Ranking; Neng Gu, Software Engineer, Content & User Understanding; Dhruvil Deven Badani, Engineering Manager, Homefeed Ranking; Andrew Zhai, Software Engineer, Advanced Technologies Group Image from [link] In this blog post, we will demonstrate how we improved Pinterest Homefeed engagement volume from a machine learning model design perspective — by leveraging realtime user action features in Homefeed recommender system.

article thumbnail

5 Steps for Migrating from Elasticsearch to Rockset for Real-Time Analytics

Rockset

Nothing to Fear Migration is often viewed as a 4 letter word in IT. Something to avoid, something to fear and definitely not something to do on a whim. It’s an understandable position given the risk and horror stories associated with “Migration Projects”. This blog outlines best practices from customers I have helped migrate from Elasticsearch to Rockset , reducing risk and avoiding common pitfalls.

article thumbnail

Internet Egress Filtering of Services at Lyft

Lyft Engineering

Using Envoy as an Explicit CONNECT and Transparent Proxy Photo from Dan Meyers Unrestricted egress traffic from services poses a significant security risk as it allows external threats to exfiltrate data and download arbitrary payloads from untrusted, dangerous hosts. While ingress filtering from the Internet is ubiquitous using firewalls, it is far less common that companies are controlling and observing traffic leaving their network.

article thumbnail

Improving the Accuracy of Generative AI Systems: A Structured Approach

Speaker: Anindo Banerjea, CTO at Civio & Tony Karrer, CTO at Aggregage

When developing a Gen AI application, one of the most significant challenges is improving accuracy. This can be especially difficult when working with a large data corpus, and as the complexity of the task increases. The number of use cases/corner cases that the system is expected to handle essentially explodes. 💥 Anindo Banerjea is here to showcase his significant experience building AI/ML SaaS applications as he walks us through the current problems his company, Civio, is solving.

article thumbnail

Getting Started with spaCy for NLP

KDnuggets

In this blog, we will explore how to get started with spaCy right from the installation to explore the various functionalities it provides.

IT 112
article thumbnail

From Dealership to Concierge – Leveraging Vehicle Data to Transform the Car Buying Experience

Teradata

Analyzing vehicle data should be transforming customer experience in the auto industry. Unfortunately, it is behind many others in terms of delivering individualized experiences based on insight.

Data 52
article thumbnail

The New Rockset Query Editor Experience

Rockset

Developing SQL queries is an essential part of the Rockset product experience. We're excited to announce the release of a new query editor in the Rockset Console with improved performance and an updated design. Upgraded Performance The main motivation for the new query editor experience was to resolve the performance issues of our old query editor. While it was generally usable, typing in the old query editor would lag when working on large queries and become sluggish after long periods of time.

SQL 52
article thumbnail

Tapping Into New Depths Of Business Analytics With IIM Indore!

U-Next

. According to a report by techjury , 7 out of 10 business rate data discovery as very important. Every organization/ small business/ large business etc., today employs Business Analytics on different levels to propel their businesses ahead. Data is at the core of every technological advancement made in recent times. . It’s been over a decade since the domain of data, and its analytics had been declared the hottest jobs of the 21 st century, and yet it shows no signs of slowing down or l

article thumbnail

The Ultimate Guide To Data-Driven Construction: Optimize Projects, Reduce Risks, & Boost Innovation

Speaker: Donna Laquidara-Carr, PhD, LEED AP, Industry Insights Research Director at Dodge Construction Network

In today’s construction market, owners, construction managers, and contractors must navigate increasing challenges, from cost management to project delays. Fortunately, digital tools now offer valuable insights to help mitigate these risks. However, the sheer volume of tools and the complexity of leveraging their data effectively can be daunting. That’s where data-driven construction comes in.

article thumbnail

The Gap Between Deep Learning and Human Cognitive Abilities

KDnuggets

How do we bridge this gap between deep learning and human cognitive ability?

article thumbnail

Hard user separation with NixOS

Tweag

This guide explains how to install NixOS on a computer, with a twist. If you use the same computer in different contexts, let’s say for work and for your private life, you may wish to install two different operating systems to protect your private life data from mistakes or hacks from your work. For instance a cryptolocker you got from a compromised work email won’t lock out your family photos.

Coding 52
article thumbnail

Real-time Data Integration from Oracle to Google BigQuery Using Striim

Striim

Hosted on the Google Cloud Blog, read on to learn how relational databases like Oracle store data but Striim and Google Cloud BigQuery ensure timely and accurate analytics at scale.

article thumbnail

Transitioning from aiohttp to FastAPI

Picnic Engineering

While we love using Java in Picnic, we also adore Python. Over the years, our tech landscape has been enriched with numerous Python services, for instance to distribute product information across our backend ecosystem. We started small, adopting aiohttp as our HTTP server framework of choice. For many years, our Python services flourished as part of the bigger Picnic machine.

Python 52
article thumbnail

Business Intelligence 101: How To Make The Best Solution Decision For Your Organization

Speaker: Evelyn Chou

Choosing the right business intelligence (BI) platform can feel like navigating a maze of features, promises, and technical jargon. With so many options available, how can you ensure you’re making the right decision for your organization’s unique needs? 🤔 This webinar brings together expert insights to break down the complexities of BI solution vetting.

article thumbnail

The AI Education Gap and How to Close It

KDnuggets

AI education is broken, how do we solve it? Individuals end up learning a specific tool or tactic in a vacuum. They are missing the real-world applicability and collaboration that is critical to building impactful AI solutions in line with the organization’s strategy.

article thumbnail

JavaScript vs. TypeScript: What Are the Differences?

Trio

JavaScript has been powering web development since the early 2000s. A mainstay of interactive web development, JavaScript has been the recipient of many language enhancements over the years. One such enhancement is TypeScript.

article thumbnail

Data Engineering Weekly #105

Data Engineering Weekly

Data Engineering Weekly Is Brought to You by RudderStack RudderStack provides data pipelines that make it easy to collect data from every application, website, and SaaS platform, then activate it in your warehouse and business tools. Sign up free to test out the tool today. Editor’s Note: The current state of the Data Catalog The results are out for our poll on the current state of the Data Catalogs.

article thumbnail

Learn How to Secure Your Kafka Cluster Automatically

Confluent

Kafka security is crucial, yet complex. We'll show you how to automatically secure a Kafka cluster for easy authentication and encryption to meet compliance objectives.

Kafka 52
article thumbnail

Driving Responsible Innovation: How to Navigate AI Governance & Data Privacy

Speaker: Aindra Misra, Senior Manager, Product Management (Data, ML, and Cloud Infrastructure) at BILL

Join us for an insightful webinar that explores the critical intersection of data privacy and AI governance. In today’s rapidly evolving tech landscape, building robust governance frameworks is essential to fostering innovation while staying compliant with regulations. Our expert speaker, Aindra Misra, will guide you through best practices for ensuring data protection while leveraging AI capabilities.