Sat.Aug 31, 2019 - Fri.Sep 06, 2019

article thumbnail

I wasn’t getting hired as a Data Scientist. So I sought data on who is.

KDnuggets

Instead of focusing on skills thought to be required of data scientists, we can look at what they have actually done before.

Data 123
article thumbnail

Introducing Derivative Event Sourcing

Confluent

First, what is event sourcing? Here’s an example. Consider your bank account: viewing it online, the first thing you notice is often the current balance. How many of us drill down to see how we got there? We probably all ask similar questions such as: What payments have cleared? Did my direct deposit hit yet? Why am I spending so much money at Sephora?

Kafka 22
article thumbnail

Building A Community For Data Professionals at Data Council

Data Engineering Podcast

Summary Data professionals are working in a domain that is rapidly evolving. In order to stay current we need access to deeply technical presentations that aren’t burdened by extraneous marketing. To fulfill that need Pete Soderling and his team have been running the Data Council series of conferences and meetups around the world. In this episode Pete discusses his motivation for starting these events, how they serve to bring the data community together, and the observations that he has ma

Building 100
article thumbnail

Taking Analytics to the 4th Dimension

Teradata

4D analytics combines geospatial, temporal and time series data to do advanced analysis of time and space. Learn how to uncover new insights today.

Data 56
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Advice on building a machine learning career and reading research papers by Prof. Andrew Ng

KDnuggets

This blog summarizes the career advice/reading research papers lecture in the CS230 Deep learning course by Stanford University on YouTube, and includes advice from Andrew Ng on how to read research papers.

article thumbnail

How to Use Schema Registry and Avro in Spring Boot Applications

Confluent

TL;DR. Following on from How to Work with Apache Kafka in Your Spring Boot Application , which shows how to get started with Spring Boot and Apache Kafka ® , here I will demonstrate how to enable usage of Confluent Schema Registry and Avro serialization format in your Spring Boot applications. Using Avro schemas, you can establish a data contract between your microservices applications.

Kafka 20

More Trending

article thumbnail

How Reinforcement Learning is Changing Customer Engagement

Teradata

Companies are increasingly exploring opportunities to apply reinforcement learning to their most challenging problems. Learn what applications work the best.

56
article thumbnail

Automated Machine Learning: Just How Much?

KDnuggets

This is an interview between Rosaria Silipo and data scientists Paolo Tamagnini, Simon Schmid and Christian Dietz, asking a few questions on the topic of automated machine learning from their point of view, and some interesting examples of its practical use.

article thumbnail

Real-Time Analytics in the World of Virtual Reality and Live Streaming

Rockset

"A fast-moving technology field where new tools, technologies and platforms are introduced very frequently and where it is very hard to keep up with new trends." I could be describing either the VR space or Data Engineering, but in fact this post is about the intersection of both. Virtual Reality – The Next Frontier in Media I work as a Data Engineer at a leading company in the VR space, with a mission to capture and transmit reality in perfect fidelity.

article thumbnail

AsyncTask, Rx, and Coroutines… Oh My!

Pandora Engineering

Credit: Sally Anscombe An Android Apprentice’s journey to understand Pandora’s migration from AsyncTask to newer APIs During my second month as an Android Engineer Apprentice, I was tasked with migrating AsyncTask to newer APIs. Early on, I was asked, “Do you know why we are migrating from AsyncTask?” I wracked my brain and answered shyly, “It has something to do with memory leaks?

article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Taking Analytics to the 4th Dimension

Teradata

4D analytics combines geospatial, temporal and time series data to do advanced analysis of time and space. Learn how to uncover new insights today.

Data 40
article thumbnail

Python Libraries for Interpretable Machine Learning

KDnuggets

In the following post, I am going to give a brief guide to four of the most established packages for interpreting and explaining machine learning models.

article thumbnail

An Overview of Topics Extraction in Python with Latent Dirichlet Allocation

KDnuggets

A recurring subject in NLP is to understand large corpus of texts through topics extraction. Whether you analyze users’ online reviews, products’ descriptions, or text entered in search bars, understanding key topics will always come in handy.

Python 123
article thumbnail

An Easy Introduction to Machine Learning Recommender Systems

KDnuggets

Recommender systems are an important class of machine learning algorithms that offer "relevant" suggestions to users. Categorized as either collaborative filtering or a content-based system, check out how these approaches work along with implementations to follow from example code.

article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

TensorFlow vs PyTorch vs Keras for NLP

KDnuggets

These three deep learning frameworks are your go-to tools for NLP, so which is the best? Check out this comparative analysis based on the needs of NLP, and find out where things are headed in the future.

article thumbnail

Automate your Python Scripts with Task Scheduler: Windows Task Scheduler to Scrape Alternative Data

KDnuggets

In this tutorial, you will learn how to run task scheduler to web scrape data from Lazada (eCommerce) website and dump it into SQLite RDBMS Database.

Python 123
article thumbnail

Top 10 Data Science Use Cases in Energy and Utilities

KDnuggets

In this article, we will consider the most vivid data science use cases in the industry of energy and utilities.

Utilities 122
article thumbnail

Build Your First Voice Assistant

KDnuggets

Hone your practical speech recognition application skills with this overview of building a voice assistant using Python.

Building 114
article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

What’s the difference between analytics and statistics?

KDnuggets

From asking the best questions about data to answering those questions with certainty, understanding the value of these two seemingly different professions is clarified when you see how they should work together.

Data 107
article thumbnail

6 Tips for Building a Training Data Strategy for Machine Learning

KDnuggets

Without a well-defined approach for collecting and structuring training data, launching an AI initiative becomes an uphill battle. These six recommendations will help you craft a successful strategy.

article thumbnail

3 Ways to Manage Human Bias in the Analytics Process

KDnuggets

Managing human bias is an important part of the analytics process. Learn about three areas to watch out for to ensure your models as unbiased as possible.

Process 105
article thumbnail

Beyond Neurons: Five Cognitive Functions of the Human Brain that we are Trying to Recreate with Artificial Intelligence

KDnuggets

The quest for recreating cognitive capabilities of the brain in deep neural networks remains one of the elusive goals of AI. Let’s explore some human cognitive skills that are serving as inspiration to a new generation of AI techniques.

105
105
article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, Terrence Sheflin, and Mahyar Ghasemali

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

Learn Quantum Computing with Python and Q#, Get Programming with Python, Data Science with Python and Dask

KDnuggets

Save 40% on Get Programming with Python, Data Science with Python and Dask, and Learn Quantum Computing with Python and Q# with code nlpython40.

Python 100
article thumbnail

Top KDnuggets tweets, Aug 28 – Sep 03: The 8 Neural Network Architectures #MachineLearning Researchers Need to Learn

KDnuggets

Also: The secret sauce for growing from a data analyst to a data scientist; 4 Tips for Advanced Feature Engineering and Preprocessing; R Users’ Salaries from the 2019 Stackoverflow Survey; Emoji Analytics.

article thumbnail

Starting out in Data Science? Top tips and advice from DataScienceGO Speakers

KDnuggets

DataScienceGO returns to San Diego Sep 27-29, for a three-day career-focused conference designed to unite newcomers, practitioners, managers and executives under one umbrella, speakers weigh in on how to forge the best teams, increase your hiring chances, and prepare for the future.

article thumbnail

Designing Dashboards that Users Actually Like – Free Webcast

KDnuggets

See how creating a system of purpose-specific displays enables users to quickly get answers to their data-related questions.

article thumbnail

How to Drive Cost Savings, Efficiency Gains, and Sustainability Wins with MES

Speaker: Nikhil Joshi, Founder & President of Snic Solutions

Is your manufacturing operation reaching its efficiency potential? A Manufacturing Execution System (MES) could be the game-changer, helping you reduce waste, cut costs, and lower your carbon footprint. Join Nikhil Joshi, Founder & President of Snic Solutions, in this value-packed webinar as he breaks down how MES can drive operational excellence and sustainability.

article thumbnail

Cartoon: Labor Day in the age of AI

KDnuggets

KDnuggets cartoon looks at how AI will impact Labor Day in the year 2050.

69
article thumbnail

KDnuggets™ News 19:n33, Sep 4: Data Science Skills Poll; Object-oriented Programming for Data Scientists

KDnuggets

This week: Object-oriented programming for data scientists; Deep Learning Next Step: Transformers and Attention Mechanism; R Users' Salaries from the 2019 Stackoverflow Survey; Types of Bias in Machine Learning; 4 Tips for Advanced Feature Engineering and Preprocessing; and much more!

article thumbnail

TensorFlow Optimization Showdown: ActiveState vs. Anaconda

KDnuggets

In this TensorFlow tutorial, you’ll learn the impact of optimizing both operators and entire graphs, how to efficiently organize data in training and testing datasets to minimize data shuffling, and how to identify a well-optimized model using Anaconda and ActivePython.

article thumbnail

Top Stories, Aug 26 – Sep 1: Object-oriented programming for data scientists; Why Data Visualization Is The Most Important Skill in a Data Analyst Arsenal

KDnuggets

Also: Types of Bias in Machine Learning; Deep Learning Next Step: Transformers and Attention Mechanism; New Poll: Data Science Skills; R Users Salaries from the 2019 Stackoverflow Survey; How to Sell Your Boss on the Need for Data Analytics.

article thumbnail

The Cloud Development Environment Adoption Report

Cloud Development Environments (CDEs) are changing how software teams work by moving development to the cloud. Our Cloud Development Environment Adoption Report gathers insights from 223 developers and business leaders, uncovering key trends in CDE adoption. With 66% of large organizations already using CDEs, these platforms are quickly becoming essential to modern development practices.