Sat.Mar 05, 2022 - Fri.Mar 11, 2022

article thumbnail

8 Women in AI Who Are Striving to Humanize the World

KDnuggets

Some exceptional female researchers and engineers are working on projects to make the world a better place with the help of AI, data science, and machine learning.

article thumbnail

How to make Apache Kafka clients go fast(er) on Confluent Cloud

Confluent

Imagine your team wants to design a data streaming architecture and you’re in charge of creating the prototype. Within a few minutes, you provision a fully managed Apache Kafka® cluster […].

Kafka 126
article thumbnail

Women Leaders in Data Discuss Breaking Bias on International Women’s Day

Cloudera

As an official sponsor of International Women’s Da y, Cloudera is excited to celebrate Women’s History Month and International Women’s Day, and to take up the mantle of this year’s theme #BreakTheBias. . Even in industries where women are underrepresented, like tech, women have made a lot of progress. Progress over many decades has slowly transformed the workplace into an environment where women’s strengths are recognized and valued.

Big Data 119
article thumbnail

Developer Friendly Application Persistence That Is Fast And Scalable With HarperDB

Data Engineering Podcast

Summary Databases are an important component of application architectures, but they are often difficult to work with. HarperDB was created with the core goal of being a developer friendly database engine. In the process they ended up creating a scalable distributed engine that works across edge and datacenter environments to support a variety of novel use cases.

article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Boost Your AI and ML Skills for Free at NVIDIA Conference

KDnuggets

Four-day conference offers hundreds of learning and development opportunities in AI, ML, DL, robotics, data science and high performance computing for developers at all levels.

article thumbnail

Connect Microsoft Azure Services to Vantage

Teradata

This getting started guide describes ‘high-level’ Teradata Vantage connectivity options with the Microsoft Azure Services. Find out more.

98

More Trending

article thumbnail

Move Your Database To The Data And Speed Up Your Analytics With DuckDB

Data Engineering Podcast

Summary When you think about selecting a database engine for your project you typically consider options focused on serving multiple concurrent users. Sometimes what you really need is an embedded database that is blazing fast for single user workloads. DuckDB is an in-process database engine optimized for OLAP applications to speed up your analytical queries that meets you where you are, whether that’s Python, R, Java, even the web.

Database 100
article thumbnail

5 Data Science Projects to Learn 5 Critical Data Science Skills

KDnuggets

Learn these to take any data science project idea from brainstorm to deployment.

article thumbnail

An Introduction to Data Mesh

Confluent

Decentralized architectures continue to flourish as engineering teams look to unlock the potential of their people and systems. From Git, to microservices, to cryptocurrencies, these designs look to decentralization as […].

article thumbnail

New Features in Cloudera Streams Messaging for CDP Public Cloud 7.2.14

Cloudera

With the launch of CDP Public Cloud 7.2.14, Cloudera Streams Messaging for Data Hub deployments has gotten some powerful new features! In this release , the Streams Messaging templates in Data Hub will come with Apache Kafka 2.8 and Cruise Control 2.5 providing new core features and fixes. KConnect has been added and gains additional capabilities with new connectors and Stateless Apache NiFi capabilities which can run NiFi Flows as connectors.

Cloud 112
article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Women of Teradata: Hillary Ashton

Teradata

In honor of Women's History Month & Int'l Women's Day, we are spotlighting Hillary Ashton, Teradata's Chief Product Officer, as she looks back on her career and gives advice to young women in tech.

59
article thumbnail

The Significance of Data Quality in Making a Successful Machine Learning Model

KDnuggets

Good quality data becomes imperative and a basic building block of an ML pipeline. The ML model can only be as good as its training data.

article thumbnail

Confluent’s Data Streaming Platform Can Save Over $2.5M vs. Self-Managing Apache Kafka

Confluent

If you’re reading this, it’s likely because you are leveraging (or considering) Apache Kafka® in your organization—especially as it has become the de facto standard for data streaming. Adopted by […].

Kafka 64
article thumbnail

#ClouderaLife Spotlight: Vicki Zingiris

Cloudera

March 8 marks International Women’s Day and as we celebrate the accomplishments of dynamic women across the world, I sat across from one such Clouderan, Vicki Zingiris, Director of Value-Based Services. We discussed important initiatives at Cloudera, the influence that Martial Arts has had on how she leads, collaborates, and mentors, and concluded with some valuable advice for women in the workforce. .

article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

Why Mutability Is Essential for Real-Time Data Analytics

Rockset

This is the first post in a series by Rockset's CTO Dhruba Borthakur on Designing the Next Generation of Data Systems for Real-Time Analytics. We'll be publishing more posts in the series in the near future, so subscribe to our blog so you don't miss them! Posts published so far in the series: Why Mutability Is Essential for Real-Time Data Analytics Handling Out-of-Order Data in Real-Time Analytics Applications Handling Bursty Traffic in Real-Time Analytics Applications SQL and Complex Queries A

article thumbnail

Top Posts Feb 28 – Mar 6: The Complete Collection of Data Science Cheat Sheets – Part 2

KDnuggets

Also: Calculus: The hidden building block of machine learning; Decision Tree Algorithm, Explained; Telling a Great Data Story: A Visualization Decision Tree; The Complete Collection of Data Science Cheat Sheets – Part 1.

article thumbnail

Monte Carlo Named To Enterprise Tech 30 For Second Consecutive Year

Monte Carlo

For the second consecutive year, Monte Carlo was today named to the Enterprise Tech 30 (ET30), an exclusive list of the most promising companies in enterprise technology, as determined by some of the world’s top venture capitalists. Sponsored by Wing Venture Capital and Nasdaq, more than 15,000 private venture-backed companies are considered. The list is then narrowed to 10 early stage ($25 million or less raised), 10 mid stage ($25 to $100 million raised), and 10 late stage ($100 million or mor

article thumbnail

Best Workplaces for Women in Ireland 2022

Cloudera

Over the past decade, Cloudera has matured to become a leading-edge technology company, supporting a diverse range of customers, across the globe. At Cloudera, we are passionate about helping our customers identify opportunities for innovation and growth, enabling them to accelerate their digital transformation, and aiding them to solve some of societies’ largest challenges.

article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

dbt + Machine Learning: What makes a great baton pass?

dbt Developer Hub

Special Thanks: Emilie Schario, Matt Winkler dbt has done a great job of building an elegant, common interface between data engineers, analytics engineers, and any data-y role, by uniting our work on SQL. This unification of tools and workflows creates interoperability between what would normally be distinct teams within the data organization. I like to call this interoperability a “baton pass.

article thumbnail

New Ways of Sharing Code Blocks for Data Scientists

KDnuggets

Share the interactive code blocks to impress your colleagues or post it on social media.

Coding 157
article thumbnail

Treat Your Data Like An Engineering Problem: An Interview with Snowflake Director of Product Management Chris Child

Monte Carlo

Monte Carlo’s Barr Moses sat down with Snowflake Director of Product Management Chris Child to talk about building data platforms at scale, how awesome data teams approach data quality, the role of data observability tools in the modern data stack, and more. To put it simply, to understand modern data engineering, you need to understand Snowflake. And as your data platform becomes productized, you need to get serious about data quality.

article thumbnail

Why I Joined Rockset (With Six Months Hindsight)

Rockset

Photo by Adil from Pexels I’ve found that every startup today fits into one of two categories: A solution that focuses mainly on enhancing a larger solution or platform. A solution that has arrived in the wake of those that have come before. The first category is a symbiotic relationship — think of the shark and remora fish. Even though there is a mutual benefit, one of the parties has a more significant dependency on the other.

MongoDB 52
article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, Terrence Sheflin, and Mahyar Ghasemali

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

Build a Machine Learning Web App in 5 Minutes

KDnuggets

In this article, you will learn to export your models and use them outside a Jupyter Notebook environment. You will build a simple web application that is able to feed user input into a machine learning model, and display an output prediction to the user.

article thumbnail

Maximize Your Productivity as a Data Scientist by Organizing

KDnuggets

Read some tips on getting organized when it comes to working with data.

Data 144
article thumbnail

How To Use Synthetic Data To Overcome Data Shortages For Machine Learning Model Training

KDnuggets

It takes time and considerable resources to collect, document, and clean data before it can be used. But there is a way to address this challenge – by using synthetic data.

article thumbnail

How a Level System can Help Forecast AI Costs

KDnuggets

To forecast costs for AI systems, it can be useful to talk about their “level” just like SAE has levels for self-driving cars. Adopting a level system can help organizations plan and prepare for AI systems that scale in complexity over time.

Systems 131
article thumbnail

How to Drive Cost Savings, Efficiency Gains, and Sustainability Wins with MES

Speaker: Nikhil Joshi, Founder & President of Snic Solutions

Is your manufacturing operation reaching its efficiency potential? A Manufacturing Execution System (MES) could be the game-changer, helping you reduce waste, cut costs, and lower your carbon footprint. Join Nikhil Joshi, Founder & President of Snic Solutions, in this value-packed webinar as he breaks down how MES can drive operational excellence and sustainability.

article thumbnail

Cloud-Native Super Computing

KDnuggets

NVIDIA BlueField DPUs provide on-demand, simple and secure high-performance computing and AI services.

Cloud 131
article thumbnail

Data Science: Reality vs Expectations

KDnuggets

In the majority of companies, the executives in charge of data science and the decision-making process using data science, have little or no education or understanding in actual data science. Where does this leave you, the data scientist?

article thumbnail

How Long Does It Take to Learn Data Science Fundamentals?

KDnuggets

This article discusses 2 levels of data science learning, and the amount of time that will need to go into each. From 6 months to 4 years, this write-up covers a number of skills and how long it takes to acquire them.

article thumbnail

Using Data Science to Make Clean Energy More Equitable

KDnuggets

Here are some lessons inspired by a recent panel the author moderated about how data scientists can help put equity into practice.

article thumbnail

The Cloud Development Environment Adoption Report

Cloud Development Environments (CDEs) are changing how software teams work by moving development to the cloud. Our Cloud Development Environment Adoption Report gathers insights from 223 developers and business leaders, uncovering key trends in CDE adoption. With 66% of large organizations already using CDEs, these platforms are quickly becoming essential to modern development practices.