Sat.Mar 21, 2020 - Fri.Mar 27, 2020

article thumbnail

Coronavirus Data and Poll Analysis – yes, there is hope, if we act now

KDnuggets

We examine the growth of coronavirus daily cases in most affected countries, and show evidence that social distancing works in reducing the rate of spread. We also analyze KDnuggets Poll results - the scale of change to online and how Data Science work is likely to increase or drop in different regions. Stay Healthy and practice social distancing!

article thumbnail

Behind The Scenes Of The Linode Object Storage Service

Data Engineering Podcast

Summary There are a number of platforms available for object storage, including self-managed open source projects. But what goes on behind the scenes of the companies that run these systems at scale so you don’t have to? In this episode Will Smith shares the journey that he and his team at Linode recently completed to bring a fast and reliable S3 compatible object storage to production for your benefit.

Media 100
article thumbnail

ksqlDB: The Missing Link Between Real-Time Data and Big Data Streaming

Confluent

Is event streaming or batch processing more efficient in data processing? Is an IoT system the same as a data analytics system, and a fast data system the same as […].

article thumbnail

Five Books Every CX Leader Should Read in this Time of Social Distancing

Teradata

Check out this curated reading list of books on customer experience. From updated classics to new research and insights into how large enterprises can drive business outcomes from a CX initiative.

59
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Exploring TensorFlow Quantum, Google’s New Framework for Creating Quantum Machine Learning Models

KDnuggets

TensorFlow Quantum allow data scientists to build machine learning models that work on quantum architectures.

article thumbnail

Learn to Optimize Algorithms in Our New Algorithm Complexity Course

Dataquest

Algorithms are at the center of almost any programming job. And particularly in the world of data engineering, using efficient algorithms is important enough that it’s a common topic to be quizzed about in job interviews. That’s why we’ve just launched a new course! Algorithm Complexity is the latest course in our Data Engineer career path.

More Trending

article thumbnail

Nordea Bank

Teradata

Applying cognitive automation to keep pace with increasingly complex financial regulations and consumer expectations.

Banking 52
article thumbnail

Why BERT Fails in Commercial Environments

KDnuggets

The deployment of large transformer-based models in dynamic commercial environments often yields poor results. This is because commercial environments are usually dynamic, and contain continuous domain shifts between inference and training data.

Data 155
article thumbnail

Made With ML: Discover, build, and showcase machine learning projects

KDnuggets

This is a short introduction to Made With ML, a useful resource for machine learning engineers looking to get ideas for projects to build, and for those looking to share innovative portfolio projects once built.

article thumbnail

Want to Build an AI Model for Your Business? Read this

KDnuggets

The best approach for AI production is similar to what venture capitalists (VC’s) do when they evaluate and invest in startups.

Building 136
article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Graph Neural Network model calibration for trusted predictions

KDnuggets

In this article, we’ll talk about calibration in graph machine learning, and how it can help to build trust in these powerful new models.

article thumbnail

Evaluating Ray: Distributed Python for Massive Scalability

KDnuggets

If your team has started using ?Ray? and you’re wondering what it is, this post is for you. If you’re wondering if Ray should be part of your technical strategy for Python-based applications, especially ML and AI, this post is for you.

Python 118
article thumbnail

Diffusion Map for Manifold Learning, Theory and Implementation

KDnuggets

This article aims to introduce one of the manifold learning techniques called Diffusion Map. This technique enables us to understand the underlying geometric structure of high dimensional data as well as to reduce the dimensions, if required, by neatly capturing the non-linear relationships between the original dimensions.

article thumbnail

Top AI Resources – Directory for Remote Learning

KDnuggets

Whether you are just learning Data Science, a current professional, or just interested, it's crucial to keep the mind stimulated and stay current. With conferences, schools, and travel largely canceled because of #coronavirus, these remote resources will help you stay engaged.

article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

Alternative Data, Text Analytics, and Sentiment Analysis in Trading and Investing

KDnuggets

Different types of data beyond your typical dollars and cents have been used in the finance industry for many years. By leveraging machine learning, sentiment data is expected to play an increasingly dominant role in the investment industry, and this article highlights some special challenges of its use in trading models.

Finance 108
article thumbnail

Top Stories, Mar 16-22: 24 Best (and Free) Books To Understand Machine Learning

KDnuggets

Also: Time Series Classification Synthetic vs Real Financial Time Series; Nine lessons learned during my first year as a Data Scientist; What is the most effective policy response to the new coronavirus pandemic?; Nine lessons learned during my first year as a Data Scientist; Five Interesting Data Engineering Projects.

article thumbnail

How to Make Remote Work Effective for Data Science Teams

KDnuggets

This post aims to highlight some work from home best practices, both general and data science-specific, in order to help data scientists and teams remain productive, connected and happy while working remotely.

article thumbnail

KDnuggets™ News 20:n12, Mar 25: 24 Best (and Free) Books To Understand Machine Learning; Coronavirus Daily Change and Poll Analysis; 9 lessons learned during 1st year as a Data Scientist

KDnuggets

Read our analysis of coronavirus data and poll results; Use your time indoors to learn with 24 best and free books to understand Machine Learning; Study the 9 important lessons from the first year as a Data Scientist; Understand the SVM, a top ML algorithm; check a comprehensive list of AI resources for online learning; and more.

article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

Top KDnuggets tweets, Mar 18-24: Advice to Data Scientists: don’t post a blog on #coronavirus based on your ad-hoc data analysis without reading something on epidemiology – here are some useful links

KDnuggets

#Coronavirus growth in Western countries: March 19 update - Spain and US cases; If you need a break from #coronavirus news, here is #AlphaGo - The Movie; Which Country Has Flattened the Curve for the #Coronavirus?; Excellent source of #Coronavirus info - FT.

article thumbnail

How to Make Your Open Source Apache Kafka Connector Available on Confluent Hub

Confluent

Do you have data you need to get into or out of Apache Kafka®? Kafka connectors are perfect for this. There are many connectors out there, usually for well-known and […].

Kafka 59
article thumbnail

Don’t let panic worsen the COVID-19 crisis: Let data run the supply chain

Teradata

Understanding the supply chain and how panic buying not only worsens the situation but also has long term impact on forecasting and demand models.

Data 45