Introduction to Natural Language Processing (NLP)
KDnuggets
OCTOBER 25, 2019
Have you ever wondered how your personal assistant (e.g: Siri) is built? Do you want to build your own? Perfect! Let’s talk about Natural Language Processing.
KDnuggets
OCTOBER 25, 2019
Have you ever wondered how your personal assistant (e.g: Siri) is built? Do you want to build your own? Perfect! Let’s talk about Natural Language Processing.
Data Engineering Podcast
OCTOBER 21, 2019
Summary The scale and complexity of the systems that we build to satisfy business requirements is increasing as the available tools become more sophisticated. In order to bridge the gap between legacy infrastructure and evolving use cases it is necessary to create a unifying set of components. In this episode Dipti Borkar explains how the emerging category of data orchestration tools fills this need, some of the existing projects that fit in this space, and some of the ways that they can work to
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Confluent
OCTOBER 24, 2019
I’ve written an event sourcing bank simulation in Clojure (a lisp build for Java virtual machines or JVMs) called open-bank-mark , which you are welcome to read about in my previous blog post explaining the story behind this open source example. As a next step, specifically for this article I’ve added SSL and combined some topics together, using the subject name strategy option of Confluent Schema Registry , making it more production like, adding security, and making it possible to put multiple
Teradata
OCTOBER 21, 2019
With four new offerings, Teradata is helping companies move from analytics to answers wherever they are on their cloud journey. Read more.
Speaker: Tamara Fingerlin, Developer Advocate
Apache Airflow® 3.0, the most anticipated Airflow release yet, officially launched this April. As the de facto standard for data orchestration, Airflow is trusted by over 77,000 organizations to power everything from advanced analytics to production AI and MLOps. With the 3.0 release, the top-requested features from the community were delivered, including a revamped UI for easier navigation, stronger security, and greater flexibility to run tasks anywhere at any time.
KDnuggets
OCTOBER 24, 2019
In this post, you will see 3 different techniques of how to do Feature Selection to your datasets and how to build an effective predictive model.
Rockset
OCTOBER 21, 2019
At Rockset, our singular focus is to be the shortest (and most efficient) path from data to applications for our users. We recognize and truly believe that our success lies in the success of our users. We constantly think about improving our workflows, coming up with new ones and iterating on them in ways that takes the user experience to a whole new level.
Data Engineering Digest brings together the best content for data engineering professionals from the widest variety of industry thought leaders.
Teradata
OCTOBER 22, 2019
With our renewed focus on user experience, we’re applying user-centered design principles & conducting ethnographic research on key personas, starting with developers.
KDnuggets
OCTOBER 22, 2019
For full-stack data science mastery, you must understand data management along with all the bells and whistles of machine learning. This high-level overview is a road map for the history and current state of the expansive options for data storage and infrastructure solutions.
Netflix Tech
OCTOBER 23, 2019
Jeremy Smith , Jonathan Indig , Faisal Siddiqi We are pleased to announce the open-source launch of Polynote : a new, polyglot notebook with first-class Scala support, Apache Spark integration, multi-language interoperability including Scala, Python, and SQL, as-you-type autocomplete, and more. Polynote provides data scientists and machine learning researchers with a notebook environment that allows them the freedom to seamlessly integrate our JVM-based ML platform ?
KDnuggets
OCTOBER 21, 2019
If you are interested in learning more about the latest Youtube recommendation algorithm paper, read this post for details on its approach and improvements.
Speaker: Alex Salazar, CEO & Co-Founder @ Arcade | Nate Barbettini, Founding Engineer @ Arcade | Tony Karrer, Founder & CTO @ Aggregage
There’s a lot of noise surrounding the ability of AI agents to connect to your tools, systems and data. But building an AI application into a reliable, secure workflow agent isn’t as simple as plugging in an API. As an engineering leader, it can be challenging to make sense of this evolving landscape, but agent tooling provides such high value that it’s critical we figure out how to move forward.
KDnuggets
OCTOBER 21, 2019
One way to process data faster and more efficiently is to detect abnormal events, changes or shifts in datasets. Anomaly detection refers to identification of items or events that do not conform to an expected pattern or to other items in a dataset that are usually undetectable by a human expert.
KDnuggets
OCTOBER 23, 2019
The task: train and evaluate a simple time series model using a random forest of regression trees and the NYC Yellow taxi dataset.
KDnuggets
OCTOBER 24, 2019
See how Deep Learning can help in solving one of the most commonly diagnosed cancer in women.
KDnuggets
OCTOBER 23, 2019
In this crash course on GANs, we explore where they fit into the pantheon of generative models, how they've changed over time, and what the future has in store for this area of machine learning.
Speaker: Andrew Skoog, Founder of MachinistX & President of Hexis Representatives
Manufacturing is evolving, and the right technology can empower—not replace—your workforce. Smart automation and AI-driven software are revolutionizing decision-making, optimizing processes, and improving efficiency. But how do you implement these tools with confidence and ensure they complement human expertise rather than override it? Join industry expert Andrew Skoog as he explores how manufacturers can leverage automation to enhance operations, streamline workflows, and make smarter, data-dri
KDnuggets
OCTOBER 22, 2019
Convert your Data Science Projects into cool apps easily without knowing any web frameworks.
KDnuggets
OCTOBER 25, 2019
The pandas library offers core functionality when preparing your data using Python. But, many don't go beyond the basics, so learn about these lesser-known advanced methods that will make handling your data easier and cleaner.
KDnuggets
OCTOBER 22, 2019
AI is becoming more mainstream. The fact that computers/robots will learn after being built and will surpass a human's intelligence is terrifying.
KDnuggets
OCTOBER 23, 2019
You need to know how many people visit your store now and what sort of audience you're acquiring. Foot traffic data is going to be invaluable to the success of your business.
Advertisement
With Airflow being the open-source standard for workflow orchestration, knowing how to write Airflow DAGs has become an essential skill for every data engineer. This eBook provides a comprehensive overview of DAG writing features with plenty of example code. You’ll learn how to: Understand the building blocks DAGs, combine them in complex pipelines, and schedule your DAG to run exactly when you want it to Write DAGs that adapt to your data at runtime and set up alerts and notifications Scale you
KDnuggets
OCTOBER 22, 2019
To address the current difficulties in hiring data scientists due to their short supply, many companies can benefit from retraining existing analytically minded employees.
KDnuggets
OCTOBER 24, 2019
While there is much excitement today around implementing AI at the enterprise level, the financial costs of this process are often unexpected and underappreciated. These seven myths are crucial lessons learned that executives should know before heading down the road to AI.
KDnuggets
OCTOBER 23, 2019
Read useful advice on how to become a good data scientist; see how you can write your 1st neural net in under 30 lines of Keras code; Understand why AI salaries are heading skywards and what skills you need for them; and read about key ideas and methods in anomaly detection.
KDnuggets
OCTOBER 25, 2019
Semiotics helps us understand the importance of context to determining the meaning of a term and discourse communities provide us with the background context (mental model) by which to correctly interpret its meaning correctly.
Speaker: Tamara Fingerlin, Developer Advocate
In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!
KDnuggets
OCTOBER 23, 2019
Also: The 5 Classification Evaluation Metrics Every Data Scientist Must Know; How to Recognize a Good Data Scientist Job From a Bad One; How to Easily Deploy Machine Learning Models Using Flask.
KDnuggets
OCTOBER 21, 2019
Recently, a group of AI experts from Microsoft Research published a paper proposing a method for scene understanding that combines two key tasks: image captioning and visual question answering (VQA).
KDnuggets
OCTOBER 21, 2019
Also: The 5 Classification Evaluation Metrics Every Data Scientist Must Know; Artificial Intelligence: Salaries Heading Skyward; Writing Your First Neural Net in Less Than 30 Lines of Code with Keras; How to select rows and columns in Pandas using [ ],loc, iloc,at and.iat; The Last SQL Guide for Data Analysis You'll Ever Need.
KDnuggets
OCTOBER 23, 2019
Samsung’s Tech Day event showcases processor and memory advances for 5G, AI, Cloud and Edge Computing, Automotive, IoT, and more.
Speaker: Ben Epstein, Stealth Founder & CTO | Tony Karrer, Founder & CTO, Aggregage
When tasked with building a fundamentally new product line with deeper insights than previously achievable for a high-value client, Ben Epstein and his team faced a significant challenge: how to harness LLMs to produce consistent, high-accuracy outputs at scale. In this new session, Ben will share how he and his team engineered a system (based on proven software engineering approaches) that employs reproducible test variations (via temperature 0 and fixed seeds), and enables non-LLM evaluation m
Teradata
OCTOBER 20, 2019
A new study by Teradata and research firm, Vanson Bourne, shines a light on the market forces impacting the world's largest companies. Find out more.
Netflix Tech
OCTOBER 21, 2019
Open Sourcing Mantis: A Platform For Building Cost-Effective, Realtime, Operations-Focused Applications By Jeff Chao on behalf of the Mantis team Today we’re excited to announce that we’re open sourcing Mantis , a platform that helps Netflix engineers better understand the behavior of their applications to ensure the highest quality experience for our members.
Netflix Tech
OCTOBER 23, 2019
Jeremy Smith , Jonathan Indig , Faisal Siddiqi We are pleased to announce the open-source launch of Polynote : a new, polyglot notebook with first-class Scala support, Apache Spark integration, multi-language interoperability including Scala, Python, and SQL, as-you-type autocomplete, and more. Polynote provides data scientists and machine learning researchers with a notebook environment that allows them the freedom to seamlessly integrate our JVM-based ML platform ?
Netflix Tech
OCTOBER 23, 2019
Jeremy Smith , Jonathan Indig , Faisal Siddiqi We are pleased to announce the open-source launch of Polynote : a new, polyglot notebook with first-class Scala support, Apache Spark integration, multi-language interoperability including Scala, Python, and SQL, as-you-type autocomplete, and more. Polynote provides data scientists and machine learning researchers with a notebook environment that allows them the freedom to seamlessly integrate our JVM-based ML platform ?
Advertisement
Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.
Let's personalize your content