Sat.Jul 20, 2019 - Fri.Jul 26, 2019

article thumbnail

Straining Your Data Lake Through A Data Mesh

Data Engineering Podcast

Summary The current trend in data management is to centralize the responsibilities of storing and curating the organization’s information to a data engineering team. This organizational pattern is reinforced by the architectural pattern of data lakes as a solution for managing storage and access. In this episode Zhamak Dehghani shares an alternative approach in the form of a data mesh.

Data Lake 100
article thumbnail

Convolutional Neural Networks: A Python Tutorial Using TensorFlow and Keras

KDnuggets

Different neural network architectures excel in different tasks. This particular article focuses on crafting convolutional neural networks in Python using TensorFlow and Keras.

Python 119
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

What Should Your Enterprise Expect from its Cloud Analytics Vendor?

Teradata

Large enterprises are investing heavily in cloud-based analytics technologies. What qualities should they be looking for in these cloud vendors? Find out more.

Cloud 69
article thumbnail

Fault Tolerance in Distributed Systems: Tracing with Apache Kafka and Jaeger

Confluent

Using Jaeger tracing, I’ve been able to answer an important question that nearly every Apache Kafka ® project that I’ve worked on posed: how is data flowing through my distributed system? Quick disclaimer: if you’re simply looking for an answer to that question, this post won’t provide that answer directly. Instead, in this post I will point you to an earlier blog post where I already answered that question and then I will focus on what should be your next question: now that I’m relying on Jaege

Kafka 54
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Operational Analytics: What every software engineer should know about low-latency queries on large data sets

Rockset

Introduction to Operational Analytics Operational analytics is a very specific term for a type of analytics which focuses on improving existing operations. This type of analytics, like others, involves the use of various data mining and data aggregation tools to get more transparent information for business planning. The main characteristic that distinguishes operational analytics from other types of analytics is that it is “analytics on the fly," which means that signals emanating from the vari

article thumbnail

This New Google Technique Help Us Understand How Neural Networks are Thinking

KDnuggets

Recently, researchers from the Google Brain team published a paper proposing a new method called Concept Activation Vectors (CAVs) that takes a new angle to the interpretability of deep learning models.

More Trending

article thumbnail

Why I Can’t Wait for Kafka Summit San Francisco

Confluent

The Kafka Summit Program Committee recently published the schedule for the San Francisco event, and there’s quite a bit to look forward to. For starters, it is a two-day event, which means we get to attend 14 talks, miss out on 42 talks (that we’ll later watch on video), and spend two days hanging out with our favorite community friends. While the keynotes have not been announced yet (they will be soon!

Kafka 18
article thumbnail

Is SQL needed to be a data scientist?

KDnuggets

As long as there is ‘data’ in data scientist, Structured Query Language (or see-quel as we call it) will remain an important part of it. In this blog, let us explore data science and its relationship with SQL.

SQL 110
article thumbnail

Top 13 Skills To Become a Rockstar Data Scientist

KDnuggets

Education, coding, SQL, big data platforms, storytelling and more. These are the 13 skills you need to master to become a rockstar data scientist.

Education 120
article thumbnail

Fantastic Four of Data Science Project Preparation

KDnuggets

This article takes a closer look at the four fantastic things we should keep in mind when approaching every new data science project.

article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Top Certificates and Certifications in Analytics, Data Science, Machine Learning and AI

KDnuggets

Here are the top certificates and certifications in Analytics, AI, Data Science, Machine Learning and related areas.

article thumbnail

A Gentle Introduction to Noise Contrastive Estimation

KDnuggets

Find out how to use randomness to learn your data by using Noise Contrastive Estimation with this guide that works through the particulars of its implementation.

article thumbnail

50% ends Friday – Research Frontiers, AI Kick-start, BootCamp, and Career Expo

KDnuggets

ODSC focuses on research at its conferences and invites the experts pushing the boundaries of AI to speak. Between the two upcoming conferences, researchers from more than 20 of the top research institutes in the country (Open AI, NASA’s JPL, Google, MIT CSAIL, BAIR, The Turing Institute, and Max Planck and more) will deliver talks and lead trainings at ODSC West 2019.

IT 58
article thumbnail

How to Share Data Science Secrets Without Sacrificing Security

KDnuggets

Learn how to incorporate security into your practices without slowing down your project. Read this ActiveState blog post to learn more.

article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

High-Quality AI And Machine Learning Data Labeling At Scale: A Brief Research Report

KDnuggets

Analyst firm Cognilytica estimates that as much as 80% of machine learning project time is spent on aggregating, cleaning, labeling, and augmenting machine learning model data. So, how do innovative machine learning teams prepare data in such a way that they can trust its quality, cost of preparation, and the speed with which it’s delivered?

article thumbnail

Neural Code Search: How Facebook Uses Neural Networks to Help Developers Search for Code Snippets

KDnuggets

Developers are always searching for answers to questions about their code. But how do they ask the right questions? Facebook is creating new NLP neural networks to help search code repositories that may advance information retrieval algorithms.

Coding 47
article thumbnail

Top KDnuggets tweets, Jul 17-23: Papers with Code: A Fantastic GitHub Resource for Machine Learning

KDnuggets

Also: Data Science Jobs Report 2019: Python Way Up, TensorFlow Growing Rapidly, R Use Double SAS; The Hundred-Page Machine Learning Book Book Review; The Evolution of a ggplot; Notes on Feature Preprocessing: The What, the Why, and the How.

article thumbnail

Wake Forest University: Executive Director, Business Analytics Programs, School of Business [Winston Salem, NC]

KDnuggets

Responsible for operational leadership and management of the Master of Science in Business Analytics programs. Serves as a thought partner with the program Associate Dean to develop and execute program strategy.

article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?