A Gentle Introduction to PyTorch 1.2
KDnuggets
SEPTEMBER 20, 2019
This comprehensive tutorial aims to introduce the fundamentals of PyTorch building blocks for training neural networks.
KDnuggets
SEPTEMBER 20, 2019
This comprehensive tutorial aims to introduce the fundamentals of PyTorch building blocks for training neural networks.
Confluent
SEPTEMBER 20, 2019
As a distributed system for collecting, storing, and processing data at scale, Apache Kafka ® comes with its own deployment complexities. Luckily for on-premises scenarios, a myriad of deployment options are available, such as the Confluent Platform which can be deployed on bare metal, virtual machines, containers, etc. But deployment is just the tip of the iceberg.
Data Engineering Podcast
SEPTEMBER 18, 2019
Summary The conventional approach to analytics involves collecting large amounts of data that can be cleaned, followed by a separate step for analysis and interpretation. Unfortunately this strategy is not viable for handling real-time, real-world use cases such as traffic management or supply chain logistics. In this episode Simon Crosby, CTO of Swim Inc., explains how the SwimOS kernel and the enterprise data fabric built on top of it enable brand new use cases for instant insights.
Teradata
SEPTEMBER 17, 2019
Learn how to better classify data & analytics within the analytic ecosystem by analyzing the various states of data & analytics within organizations. Read more.
Advertisement
Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.
KDnuggets
SEPTEMBER 17, 2019
The article contains a brief introduction of Bioinformatics and how a machine learning classification algorithm can be used to classify the type of cancer in each patient by their gene expressions.
Confluent
SEPTEMBER 16, 2019
Running a single Apache Kafka ® cluster across multiple datacenters (DCs) is a common, yet somewhat taboo architecture. This architecture, referred to as a stretch cluster, provides several operational benefits and unlocks the door to many uses cases. Stretch clusters provide better durability guarantees and make disaster recovery much easier by avoiding the problem of offset translation and restarting clients.
Data Engineering Digest brings together the best content for data engineering professionals from the widest variety of industry thought leaders.
Teradata
SEPTEMBER 15, 2019
Want scale? Without multitasking capabilities, Teradata Vantage would not be able to support hundreds or thousands of user queries at the same time. Learn more.
KDnuggets
SEPTEMBER 17, 2019
We identify two main groups of Data Science skills: A: 13 core, stable skills that most respondents have and B: a group of hot, emerging skills that most do not have (yet) but want to add. See our detailed analysis.
Confluent
SEPTEMBER 19, 2019
When people ask me the very top-level question “why do people use Kafka,” I usually lead with the story in my last post , where I talked about how Apache Kafka ® is helping us deliver on the promises the cloud made to us a decade ago. But I follow it up quickly with a second and potentially unrelated pattern: real-time data pipelines. These provide a different set of motivations for using an event streaming platform than scaling and microservices: specifically, the need to produce analytics resu
KDnuggets
SEPTEMBER 17, 2019
Lately, varying improvements over BERT have been shown — and here I will contrast the main similarities and differences so you can choose which one to use in your research or application.
Speaker: Tamara Fingerlin, Developer Advocate
In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!
KDnuggets
SEPTEMBER 19, 2019
“I want to learn machine learning and artificial intelligence, where do I start?” Here.
KDnuggets
SEPTEMBER 16, 2019
The career path of the Data Scientist remains a hot target for many with its continuing high demand. Becoming one requires developing a broad set of skills including statistics, programming, and even business acumen. Learn more about one person's experience making this journey, and discover the many resources available to help you find your way into a world of data science.
KDnuggets
SEPTEMBER 20, 2019
With recent advances in AI being enabled through access to so much “Big Data” and cheap computing power, there is incredible momentum in the field. Can big data really deliver on all this hype, and what can go wrong?
KDnuggets
SEPTEMBER 18, 2019
Algorithms are at the core of data science and sampling is a critical technical that can make or break a project. Learn more about the most common sampling techniques used, so you can select the best approach while working with your data.
Advertisement
Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.
KDnuggets
SEPTEMBER 17, 2019
For some people anything below 60% is acceptable and for certain others, even a correlation of 30% to 40% is considered too high because it one variable may just end up exaggerating the performance of the model or completely messing up parameter estimates.
KDnuggets
SEPTEMBER 17, 2019
What other creative tools for data science beyond Python and R can you use to make an impression? It's not about the tool -- it's about its impact.
KDnuggets
SEPTEMBER 20, 2019
When we create our machine learning models, a common task that falls on us is how to tune them. So that brings us to the quintessential question: Can we automate this process?
KDnuggets
SEPTEMBER 16, 2019
The new emerging field that wants to study AI agents the way social scientists study humans.
Advertisement
Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?
KDnuggets
SEPTEMBER 14, 2019
New KDnuggets Cartoon looks at one of the hottest directions in Machine Learning and asks can Machine Learning be too unsupervised?
KDnuggets
SEPTEMBER 18, 2019
Also: Cartoon: Unsupervised #MachineLearning?; Cartoon: Unsupervised Machine Learning ? How to Become More Marketable as a Data Scientist; Ensemble Methods for Machine Learning: AdaBoost.
KDnuggets
SEPTEMBER 19, 2019
While mature algorithms and extensive open-source libraries are widely available for machine learning practitioners, sufficient data to apply these techniques remains a core challenge. Discover how to leverage scikit-learn and other tools to generate synthetic data appropriate for optimizing and fine-tuning your models.
KDnuggets
SEPTEMBER 16, 2019
How to turn a typical pytorch script into a scalable d6tflow DAG for faster research & development.
Speaker: Jay Allardyce, Deepak Vittal, Terrence Sheflin, and Mahyar Ghasemali
As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.
KDnuggets
SEPTEMBER 19, 2019
Check out this detailed tutorial on applying data science to the cybersecurity domain, written by an individual with backgrounds in both fields.
KDnuggets
SEPTEMBER 18, 2019
Read about how one data scientist copes with his boring days of deploying machine learning.
KDnuggets
SEPTEMBER 18, 2019
This article covers the implementation of a data scraping and natural language processing project which had two parts: scrape as many posts from Reddit’s API as allowed &then use classification models to predict the origin of the posts.
KDnuggets
SEPTEMBER 16, 2019
Also: The 5 Graph Algorithms That Data Scientists Should Know; Many Heads Are Better Than One: The Case For Ensemble Learning; BERT is changing the NLP landscape; I wasn't getting hired as a Data Scientist; There is No Free Lunch in Data Science.
Speaker: Nikhil Joshi, Founder & President of Snic Solutions
Is your manufacturing operation reaching its efficiency potential? A Manufacturing Execution System (MES) could be the game-changer, helping you reduce waste, cut costs, and lower your carbon footprint. Join Nikhil Joshi, Founder & President of Snic Solutions, in this value-packed webinar as he breaks down how MES can drive operational excellence and sustainability.
KDnuggets
SEPTEMBER 18, 2019
Support for Python 2 will expire on Jan. 1, 2020, after which the Python core language and many third-party packages will no longer be supported or maintained. Take this survey to help determine and share your level of preparation.
KDnuggets
SEPTEMBER 17, 2019
Join this technical webinar on Oct 3, where Domino Chief Data Scientist Josh Poduska will dive into popular open source and proprietary AutoML tools, and walk through hands-on examples of how to install and use these tools, so you can start using these technologies in your work right away.
KDnuggets
SEPTEMBER 19, 2019
Whether it’s demand forecasting, supply chain management, or any other application, getting it right requires balancing the need for performance with the constraints of implementation and complexity. Learn more in this free webinar, Data-Driven Approaches to Forecasting, Sep 26.
KDnuggets
SEPTEMBER 16, 2019
The UC Center for Business Analytics will present the Data Science Symposium 2019 on Oct 10 & 11, featuring 3 keynote speakers and 16 tech talks/tutorials on a wide range of data science topics and tools.
Advertisement
Cloud Development Environments (CDEs) are changing how software teams work by moving development to the cloud. Our Cloud Development Environment Adoption Report gathers insights from 223 developers and business leaders, uncovering key trends in CDE adoption. With 66% of large organizations already using CDEs, these platforms are quickly becoming essential to modern development practices.
Let's personalize your content