Sat.Apr 30, 2022 - Fri.May 06, 2022

article thumbnail

Hypothesis Testing Explained

KDnuggets

This brief overview of the concept of Hypothesis Testing covers its classification in parametric and non-parametric tests, and when to use the most popular ones, including means, correlation, and distribution, in the case of one sample and two samples.

IT 160
article thumbnail

AI-First Benefits: 5 Real-World Outcomes

Cloudera

Artificial intelligence (AI) has been a focus for research for decades, but has only recently become truly viable. The availability and maturity of automated data collection and analysis systems is making it possible for businesses to implement AI across their entire operations to boost efficiency and agility. AI has the potential to transform operations by improving three fundamental business requirements: process automation, decision-making based on data insights, and customer interaction.

Insurance 129
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Evolving And Scaling The Data Platform at Yotpo

Data Engineering Podcast

Summary Building a data platform is an iterative and evolutionary process that requires collaboration with internal stakeholders to ensure that their needs are being met. Yotpo has been on a journey to evolve and scale their data platform to continue serving the needs of their organization as it increases the scale and sophistication of data usage. In this episode Doron Porat and Liran Yogev explain how they arrived at their current architecture, the capabilities that they are optimizing for, an

article thumbnail

How to Remove Apache Kafka Brokers the Easy Way

Confluent

The recent release of Confluent Cloud and Confluent Platform 7.0 introduced the ability to easily remove Apache Kafka® brokers and shrink your Confluent Server cluster with just a single command. […].

Kafka 84
article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

Machine Learning Is Not Like Your Brain Part One: Neurons Are Slow, Slow, Slow

KDnuggets

Artificial intelligence is not all that intelligent. While today’s AI can do some extraordinary things, the functionality underlying its accomplishments has very little to do with the way in which a human brain works to achieve the same tasks.

article thumbnail

Choose Compliance, Choose Hybrid Cloud

Cloudera

As digital transformation accelerates, and digital commerce increasingly becomes the dominant form of all commerce, regulators and governments around the world are recognizing the increased need for consumer protections and data protection measures. The European Union has been at the vanguard for some time (most recently having reached provisional agreement on the Digital Services Act ) but from Australia to Brazil , from South Africa to California (the rest of the US hasn’t quite caught on yet!

Cloud 103

More Trending

article thumbnail

From the Cellar to the Cloud – How Aedifion is Driving Next-Generation Building Automation with Confluent

Confluent

It is no exaggeration that a lot is going wrong in commercial buildings today. The building and construction sector consumes 36% of global final energy and accounts for almost 40% […].

article thumbnail

SQL Notes for Professionals: The Free eBook Review

KDnuggets

The free book is a combination of SQL cheat sheets and practical database examples. It provided bite-size information about every SQL function and attribute with coding samples.

SQL 159
article thumbnail

Winning With Data in the Fight Against Fraud, Waste, and Abuse

Cloudera

Fraud, waste, and abuse (FWA) in government is a constant, multi-billion dollar issue that challenges agency leaders at all levels and across all sectors, from healthcare to education to taxation to Social Security. The scope and scale of public spending — federal outlays alone were approximately $6.6 trillion in fiscal year 2020 according to the Congressional Budget Office — make FWA an inherently difficult problem to solve.

article thumbnail

Monte Carlo Named One of the Best Places to Work in the Bay Area for 2022

Monte Carlo

I’m honored to share that Monte Carlo was just named a Best Place to Work in the Bay Area for 2022 by the San Francisco Business Times and the Silicon Valley Business Journal, placing 6th in the small business category. This recognition is especially meaningful to our leadership team because the results are based directly on employee feedback, collected anonymously from a third-party researcher.

article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, and Terrence Sheflin

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

Slim CI/CD with Bitbucket Pipelines

dbt Developer Hub

Continuous Integration (CI) sets the system up to test everyone’s pull request before merging. Continuous Deployment (CD) deploys each approved change to production. “Slim CI” refers to running/testing only the changed code, thereby saving compute. In summary, CI/CD automates dbt pipeline testing and deployment. dbt Cloud , a much beloved method of dbt deployment, supports GitHub- and Gitlab-based CI/CD out of the box.

article thumbnail

6 Highest Paying Companies for Data Scientists

KDnuggets

These are the six top paying companies for data scientists. I’ve looked at absolute salary, but I’ll fill you in on other factors you should consider as well when it comes to picking a data science job for money.

article thumbnail

#Clouderalife Volunteer Spotlight: Lynne Montalbo!

Cloudera

This month we are proud to spotlight Lynne Montalbo, senior business systems analyst from Santa Clara, California, who volunteers as a professional development mentor with Braven. Braven’s mission is to empower promising, underrepresented young people—first-generation college students, students from low-income backgrounds, and students of color—with the skills, confidence, experiences, and networks necessary to transition from college to strong first jobs, which lead to meaningful careers and li

article thumbnail

A Real-Time Rockset Intern Experience

Rockset

I spent the spring of my junior year interning at Rockset , and it couldn’t have been a better decision. When I first arrived at the office on a sunny day in San Mateo, I had no idea that I was about to meet so many systems engineering gurus, or that I was about to consume immensely good food from the festive neighboring streets. Working with my talented and resourceful mentor, Ben (Software Engineer, Systems), I’ve been able to learn more than I ever thought I could in three months!

Food 52
article thumbnail

How to Drive Cost Savings, Efficiency Gains, and Sustainability Wins with MES

Speaker: Nikhil Joshi, Founder & President of Snic Solutions

Is your manufacturing operation reaching its efficiency potential? A Manufacturing Execution System (MES) could be the game-changer, helping you reduce waste, cut costs, and lower your carbon footprint. Join Nikhil Joshi, Founder & President of Snic Solutions, in this value-packed webinar as he breaks down how MES can drive operational excellence and sustainability.

article thumbnail

Making dbt Cloud API calls using dbt-cloud-cli

dbt Developer Hub

dbt Cloud is a hosted service that many organizations use for their dbt deployments. Among other things, it provides an interface for creating and managing deployment jobs. When triggered (e.g., cron schedule, API trigger), the jobs generate various artifacts that contain valuable metadata related to the dbt project and the run results. dbt Cloud provides a REST API for managing jobs, run artifacts and other dbt Cloud resources.

Cloud 52
article thumbnail

How To Structure a Data Science Project: A Step-by-Step Guide

KDnuggets

Check out all the necessary steps to successfully structure your data science projects leveraging data science templates.

article thumbnail

Mind the (Sustainability) Gap

Teradata

Less than 20% of retailers on track to meet sustainability pledges. Granular, integrated data is the key to move from reporting to action. Read about our framework for profitable sustainability.

Retail 52
article thumbnail

How Rockset Handles Data Deduplication

Rockset

There are two major problems with distributed data systems. The second is out-of-order messages, the first is duplicate messages, the third is off-by-one errors, and the first is duplicate messages. This joke inspired Rockset to confront the data duplication issue through a process we call deduplication. As data systems become more complex and the number of systems in a stack increases, data deduplication becomes more challenging.

Kafka 52
article thumbnail

Improving the Accuracy of Generative AI Systems: A Structured Approach

Speaker: Anindo Banerjea, CTO at Civio & Tony Karrer, CTO at Aggregage

When developing a Gen AI application, one of the most significant challenges is improving accuracy. This can be especially difficult when working with a large data corpus, and as the complexity of the task increases. The number of use cases/corner cases that the system is expected to handle essentially explodes. 💥 Anindo Banerjea is here to showcase his significant experience building AI/ML SaaS applications as he walks us through the current problems his company, Civio, is solving.

article thumbnail

Packaging generated code from protobuf files for gRPC Services

Eventbrite Engineering

Background At Eventbrite, we identified in our 3-year technical vision that one of our goals is to enable autonomous dev teams to own their code and architecture so as to be able to deliver reliable, high quality and cost effective solutions to our customers. However, this autonomy does not mean that our team has to … Continue reading "Packaging generated code from protobuf files for gRPC Services" The post Packaging generated code from protobuf files for gRPC Services appeared first on E

Coding 52
article thumbnail

9 Free Harvard Courses to Learn Data Science in 2022

KDnuggets

Learn Python programming, statistics, and machine learning online from one of the world’s top universities.

article thumbnail

Meet The Graduates: Guoda Paulikaite

Pipeline Data Engineering

In this interview series we’ll share some of the stories that Daniel and I get to watch unfold at Pipeline Academy. Check out what our graduates have to say about the course, how they’ve tackled its challenges and what they are doing now with their new data engineering superpowers. Peter: Can I ask you to please introduce yourself to the readers of Pipeline Academy’s blog?

article thumbnail

Building Ripple: Engineering Spotlight Pt. 2

Ripple Engineering

In part one of our two-part series, we heard from RippleX engineers that are ideating, creating and executing on new applications using cutting-edge blockchain and crypto technology. Now, we’ll explore how the RippleNet engineering team is building the foundational payments infrastructure on the XRP Ledger that will allow value to move as easily as information moves today.

article thumbnail

The Ultimate Guide To Data-Driven Construction: Optimize Projects, Reduce Risks, & Boost Innovation

Speaker: Donna Laquidara-Carr, PhD, LEED AP, Industry Insights Research Director at Dodge Construction Network

In today’s construction market, owners, construction managers, and contractors must navigate increasing challenges, from cost management to project delays. Fortunately, digital tools now offer valuable insights to help mitigate these risks. However, the sheer volume of tools and the complexity of leveraging their data effectively can be daunting. That’s where data-driven construction comes in.

article thumbnail

Announcing ksqlDB 0.25.1

Confluent

We are thrilled to announce ksqlDB 0.25! It comes with a slew of improvements and new features. In particular, we improved how UDAFs work with complex types like Structs and […].

IT 52
article thumbnail

How to Build Strong Data Science Portfolio as a Beginner

KDnuggets

After learning the basics of data science, you can start to work on real-world problems. But how do you showcase your work? In this article, we are going to learn a unique way to create a data science portfolio.

Portfolio 123
article thumbnail

DataKitchen In The The insideBIGDATA IMPACT 50 List

DataKitchen

108
108
article thumbnail

Seven Benefits of a Powerful Data Fabric

Teradata

The value provided by a powerful data fabric is key for a successful digital transformation. Find out why.

Data 52
article thumbnail

Business Intelligence 101: How To Make The Best Solution Decision For Your Organization

Speaker: Evelyn Chou

Choosing the right business intelligence (BI) platform can feel like navigating a maze of features, promises, and technical jargon. With so many options available, how can you ensure you’re making the right decision for your organization’s unique needs? 🤔 This webinar brings together expert insights to break down the complexities of BI solution vetting.

article thumbnail

Why Does Elder Research Need a Chief Scientist Committee?

Elder Research

The post Why Does Elder Research Need a Chief Scientist Committee? appeared first on Elder Research.

52
article thumbnail

3 Steps for Harnessing the Power of Data

KDnuggets

Even though data is now produced at an unprecedented amount, data must be collected, processed, transformed, and analyzed to harness its power. Read more about the 3 main stages involved.

Data 116
article thumbnail

Podcast: Storytime for DataOps

DataKitchen

The post Podcast: Storytime for DataOps first appeared on DataKitchen.

69
article thumbnail

Image Classification with Convolutional Neural Networks (CNNs)

KDnuggets

In this article, we’ll look at what Convolutional Neural Networks are and how they work.

article thumbnail

Driving Responsible Innovation: How to Navigate AI Governance & Data Privacy

Speaker: Aindra Misra, Senior Manager, Product Management (Data, ML, and Cloud Infrastructure) at BILL

Join us for an insightful webinar that explores the critical intersection of data privacy and AI governance. In today’s rapidly evolving tech landscape, building robust governance frameworks is essential to fostering innovation while staying compliant with regulations. Our expert speaker, Aindra Misra, will guide you through best practices for ensuring data protection while leveraging AI capabilities.