Sat.Mar 05, 2022 - Fri.Mar 11, 2022

article thumbnail

Top Posts Feb 28 – Mar 6: The Complete Collection of Data Science Cheat Sheets – Part 2

KDnuggets

Also: Calculus: The hidden building block of machine learning; Decision Tree Algorithm, Explained; Telling a Great Data Story: A Visualization Decision Tree; The Complete Collection of Data Science Cheat Sheets – Part 1.

article thumbnail

How to make Apache Kafka clients go fast(er) on Confluent Cloud

Confluent

Imagine your team wants to design a data streaming architecture and you’re in charge of creating the prototype. Within a few minutes, you provision a fully managed Apache Kafka® cluster […].

Kafka 126
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Developer Friendly Application Persistence That Is Fast And Scalable With HarperDB

Data Engineering Podcast

Summary Databases are an important component of application architectures, but they are often difficult to work with. HarperDB was created with the core goal of being a developer friendly database engine. In the process they ended up creating a scalable distributed engine that works across edge and datacenter environments to support a variety of novel use cases.

article thumbnail

Women Leaders in Data Discuss Breaking Bias on International Women’s Day

Cloudera

As an official sponsor of International Women’s Da y, Cloudera is excited to celebrate Women’s History Month and International Women’s Day, and to take up the mantle of this year’s theme #BreakTheBias. . Even in industries where women are underrepresented, like tech, women have made a lot of progress. Progress over many decades has slowly transformed the workplace into an environment where women’s strengths are recognized and valued.

Big Data 118
article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

Boost Your AI and ML Skills for Free at NVIDIA Conference

KDnuggets

Four-day conference offers hundreds of learning and development opportunities in AI, ML, DL, robotics, data science and high performance computing for developers at all levels.

article thumbnail

Connect Microsoft Azure Services to Vantage

Teradata

This getting started guide describes ‘high-level’ Teradata Vantage connectivity options with the Microsoft Azure Services. Find out more.

98

More Trending

article thumbnail

#BreakTheBias: It’s a Journey

Cloudera

Bias is everywhere. . We’re surrounded by it. . And it’s natural. We are alive today as a species because of biases. But it has a tangible impact on our personal and professional lives. Biases shape us and our experience. . As primary caregivers, women have felt the impact of biases and expectations more keenly during the pandemic. Last year women in my network felt like they were being expected to do everything at home and at work.

Education 109
article thumbnail

8 Women in AI Who Are Striving to Humanize the World

KDnuggets

Some exceptional female researchers and engineers are working on projects to make the world a better place with the help of AI, data science, and machine learning.

article thumbnail

An Introduction to Data Mesh

Confluent

Decentralized architectures continue to flourish as engineering teams look to unlock the potential of their people and systems. From Git, to microservices, to cryptocurrencies, these designs look to decentralization as […].

article thumbnail

Women of Teradata: Hillary Ashton

Teradata

In honor of Women's History Month & Int'l Women's Day, we are spotlighting Hillary Ashton, Teradata's Chief Product Officer, as she looks back on her career and gives advice to young women in tech.

59
article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, and Terrence Sheflin

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

New Features in Cloudera Streams Messaging for CDP Public Cloud 7.2.14

Cloudera

With the launch of CDP Public Cloud 7.2.14, Cloudera Streams Messaging for Data Hub deployments has gotten some powerful new features! In this release , the Streams Messaging templates in Data Hub will come with Apache Kafka 2.8 and Cruise Control 2.5 providing new core features and fixes. KConnect has been added and gains additional capabilities with new connectors and Stateless Apache NiFi capabilities which can run NiFi Flows as connectors.

Cloud 106
article thumbnail

The Significance of Data Quality in Making a Successful Machine Learning Model

KDnuggets

Good quality data becomes imperative and a basic building block of an ML pipeline. The ML model can only be as good as its training data.

article thumbnail

Confluent’s Data Streaming Platform Can Save Over $2.5M vs. Self-Managing Apache Kafka

Confluent

If you’re reading this, it’s likely because you are leveraging (or considering) Apache Kafka® in your organization—especially as it has become the de facto standard for data streaming. Adopted by […].

Kafka 64
article thumbnail

Why Mutability Is Essential for Real-Time Data Analytics

Rockset

This is the first post in a series by Rockset's CTO Dhruba Borthakur on Designing the Next Generation of Data Systems for Real-Time Analytics. We'll be publishing more posts in the series in the near future, so subscribe to our blog so you don't miss them! Posts published so far in the series: Why Mutability Is Essential for Real-Time Data Analytics Handling Out-of-Order Data in Real-Time Analytics Applications Handling Bursty Traffic in Real-Time Analytics Applications SQL and Complex Queries A

article thumbnail

How to Drive Cost Savings, Efficiency Gains, and Sustainability Wins with MES

Speaker: Nikhil Joshi, Founder & President of Snic Solutions

Is your manufacturing operation reaching its efficiency potential? A Manufacturing Execution System (MES) could be the game-changer, helping you reduce waste, cut costs, and lower your carbon footprint. Join Nikhil Joshi, Founder & President of Snic Solutions, in this value-packed webinar as he breaks down how MES can drive operational excellence and sustainability.

article thumbnail

#ClouderaLife Spotlight: Vicki Zingiris

Cloudera

March 8 marks International Women’s Day and as we celebrate the accomplishments of dynamic women across the world, I sat across from one such Clouderan, Vicki Zingiris, Director of Value-Based Services. We discussed important initiatives at Cloudera, the influence that Martial Arts has had on how she leads, collaborates, and mentors, and concluded with some valuable advice for women in the workforce. .

article thumbnail

Build a Machine Learning Web App in 5 Minutes

KDnuggets

In this article, you will learn to export your models and use them outside a Jupyter Notebook environment. You will build a simple web application that is able to feed user input into a machine learning model, and display an output prediction to the user.

article thumbnail

Monte Carlo Named To Enterprise Tech 30 For Second Consecutive Year

Monte Carlo

For the second consecutive year, Monte Carlo was today named to the Enterprise Tech 30 (ET30), an exclusive list of the most promising companies in enterprise technology, as determined by some of the world’s top venture capitalists. Sponsored by Wing Venture Capital and Nasdaq, more than 15,000 private venture-backed companies are considered. The list is then narrowed to 10 early stage ($25 million or less raised), 10 mid stage ($25 to $100 million raised), and 10 late stage ($100 million or mor

article thumbnail

dbt + Machine Learning: What makes a great baton pass?

dbt Developer Hub

Special Thanks: Emilie Schario, Matt Winkler dbt has done a great job of building an elegant, common interface between data engineers, analytics engineers, and any data-y role, by uniting our work on SQL. This unification of tools and workflows creates interoperability between what would normally be distinct teams within the data organization. I like to call this interoperability a “baton pass.

article thumbnail

Improving the Accuracy of Generative AI Systems: A Structured Approach

Speaker: Anindo Banerjea, CTO at Civio & Tony Karrer, CTO at Aggregage

When developing a Gen AI application, one of the most significant challenges is improving accuracy. This can be especially difficult when working with a large data corpus, and as the complexity of the task increases. The number of use cases/corner cases that the system is expected to handle essentially explodes. 💥 Anindo Banerjea is here to showcase his significant experience building AI/ML SaaS applications as he walks us through the current problems his company, Civio, is solving.

article thumbnail

Why I Joined Rockset (With Six Months Hindsight)

Rockset

Photo by Adil from Pexels I’ve found that every startup today fits into one of two categories: A solution that focuses mainly on enhancing a larger solution or platform. A solution that has arrived in the wake of those that have come before. The first category is a symbiotic relationship — think of the shark and remora fish. Even though there is a mutual benefit, one of the parties has a more significant dependency on the other.

MongoDB 52
article thumbnail

How a Level System can Help Forecast AI Costs

KDnuggets

To forecast costs for AI systems, it can be useful to talk about their “level” just like SAE has levels for self-driving cars. Adopting a level system can help organizations plan and prepare for AI systems that scale in complexity over time.

Systems 123
article thumbnail

Treat Your Data Like An Engineering Problem: An Interview with Snowflake Director of Product Management Chris Child

Monte Carlo

Monte Carlo’s Barr Moses sat down with Snowflake Director of Product Management Chris Child to talk about building data platforms at scale, how awesome data teams approach data quality, the role of data observability tools in the modern data stack, and more. To put it simply, to understand modern data engineering, you need to understand Snowflake. And as your data platform becomes productized, you need to get serious about data quality.

article thumbnail

Best Workplaces for Women in Ireland 2022

Cloudera

Over the past decade, Cloudera has matured to become a leading-edge technology company, supporting a diverse range of customers, across the globe. At Cloudera, we are passionate about helping our customers identify opportunities for innovation and growth, enabling them to accelerate their digital transformation, and aiding them to solve some of societies’ largest challenges.

article thumbnail

The Ultimate Guide To Data-Driven Construction: Optimize Projects, Reduce Risks, & Boost Innovation

Speaker: Donna Laquidara-Carr, PhD, LEED AP, Industry Insights Research Director at Dodge Construction Network

In today’s construction market, owners, construction managers, and contractors must navigate increasing challenges, from cost management to project delays. Fortunately, digital tools now offer valuable insights to help mitigate these risks. However, the sheer volume of tools and the complexity of leveraging their data effectively can be daunting. That’s where data-driven construction comes in.

article thumbnail

How Long Does It Take to Learn Data Science Fundamentals?

KDnuggets

This article discusses 2 levels of data science learning, and the amount of time that will need to go into each. From 6 months to 4 years, this write-up covers a number of skills and how long it takes to acquire them.

article thumbnail

Building a Tractable, Feature Engineering Pipeline for Multivariate Time Series

KDnuggets

A time series feature engineering pipeline requires different transformations such as imputation and window aggregation, which follows a sequence of stages. This article demonstrates the building of a pipeline to derive multivariate time series features such that the features can then be easily tracked and validated.

Building 112
article thumbnail

5 Data Science Projects to Learn 5 Critical Data Science Skills

KDnuggets

Learn these to take any data science project idea from brainstorm to deployment.

article thumbnail

How To Use Synthetic Data To Overcome Data Shortages For Machine Learning Model Training

KDnuggets

It takes time and considerable resources to collect, document, and clean data before it can be used. But there is a way to address this challenge – by using synthetic data.

article thumbnail

Business Intelligence 101: How To Make The Best Solution Decision For Your Organization

Speaker: Evelyn Chou

Choosing the right business intelligence (BI) platform can feel like navigating a maze of features, promises, and technical jargon. With so many options available, how can you ensure you’re making the right decision for your organization’s unique needs? 🤔 This webinar brings together expert insights to break down the complexities of BI solution vetting.

article thumbnail

Open Data and Why it is Necessary

KDnuggets

Open data improves accessibility and encourages universal participation, which allows companies to create cutting-edge, data-driven technologies and make the world a better place.

IT 98
article thumbnail

New Ways of Sharing Code Blocks for Data Scientists

KDnuggets

Share the interactive code blocks to impress your colleagues or post it on social media.

Coding 143
article thumbnail

Data Science: Reality vs Expectations

KDnuggets

In the majority of companies, the executives in charge of data science and the decision-making process using data science, have little or no education or understanding in actual data science. Where does this leave you, the data scientist?

article thumbnail

Using Data Science to Make Clean Energy More Equitable

KDnuggets

Here are some lessons inspired by a recent panel the author moderated about how data scientists can help put equity into practice.

article thumbnail

Driving Responsible Innovation: How to Navigate AI Governance & Data Privacy

Speaker: Aindra Misra, Senior Manager, Product Management (Data, ML, and Cloud Infrastructure) at BILL

Join us for an insightful webinar that explores the critical intersection of data privacy and AI governance. In today’s rapidly evolving tech landscape, building robust governance frameworks is essential to fostering innovation while staying compliant with regulations. Our expert speaker, Aindra Misra, will guide you through best practices for ensuring data protection while leveraging AI capabilities.