Sat.May 06, 2023 - Fri.May 12, 2023

article thumbnail

Datadog’s $65M/year customer mystery solved

The Pragmatic Engineer

The internet has been speculating the past few days on which crypto company spent $65M on Datadog in 2022. I confirmed it was Coinbase, and here are the details of what happened. Originally published on 11 May 2023. 👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. We cover one out of six topics in today’s subscriber-only The Scoop issue.

AWS 316
article thumbnail

OLTP Vs OLAP – What Is The Difference

Seattle Data Guy

If you’re relying on your OLTP system to provide analytics, you might be in for a surprise. While it can work initially, these systems aren’t designed to handle complex queries. Adding databases like MongoDB and CassandraDB only makes matters worse, since they’re not SQL-friendly – the language most analysts and data practitioners are used to.… Read more The post OLTP Vs OLAP – What Is The Difference appeared first on Seattle Data Guy.

MongoDB 208
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Polars – Laziness and SQL Context.

Confessions of a Data Guy

Polars is one of those tools that you just want … no … NEED a reason to use it. It’s gotten so bad, I’ve started to use it in my Rust code on the side, Polars that is. I mean you have a problem if you could use Polars Python, and you find yourself using […] The post Polars – Laziness and SQL Context. appeared first on Confessions of a Data Guy.

SQL 182
article thumbnail

Data Teams Survey 2023 Follow-Up

Jesse Anderson

The results and analysis from my 2023 Data Teams Survey left a few open questions. Let’s revisit these questions with some answers. Methodologies and Size of Company Figure 1 – Methodologies Broken Down By Size of Company Using Them We see a few commonalities across different company sizes, as shown in Figure 1. One striking commonality is that so many companies are using data mesh.

article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

Compensation at Publicly Traded Tech Companies

The Pragmatic Engineer

Insights from 50 publicly traded tech companies, and a list of those paying the most and the least in median total compensation. 👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. We cover two out of seven topics from today’s subscriber-only deep-dive on Compensation at publicly traded tech companies.

article thumbnail

Data News — Week 23.19

Christophe Blefari

Sorting the news ( credits ) Hey you, new Friday means Data News. This week is pretty stacked in term of content, especially video / audio content. I hope you will enjoy it as much as me. Let's start with with my newly created podcast Minds of Data. In Minds of Data I'll met people from the data ecosystem in order to learn more about them.

Data 130

More Trending

article thumbnail

Use Consistent And Up To Date Customer Profiles To Power Your Business With Segment Unify

Data Engineering Podcast

Summary Every business has customers, and a critical element of success is understanding who they are and how they are using the companies products or services. The challenge is that most companies have a multitude of systems that contain fragments of the customer's interactions and stitching that together is complex and time consuming. Segment created the Unify product to reduce the burden of building a comprehensive view of customers and synchronizing it to all of the systems that need it.

article thumbnail

PagerDuty alternatives

The Pragmatic Engineer

This is a response to a tweet asking: "Why is there no competition to PagerDuty/Opsgenie? People in my team say it’s “just connecting to the Twilio API” but if it were that easy, there’d probably be a ton of competition." PagerDuty is the market-leading incident alerting tool. OpsGenie is Atlassian's incident management tool, which is widespread thanks to distribution.

Systems 228
article thumbnail

Upscaling LinkedIn's Profile Datastore While Reducing Costs

LinkedIn Engineering

Co-Authors: Estella Pham and Guanlin Lu At peak, LinkedIn serves over 1.4 million member profiles per second. The number of requests to our storage infrastructure doubles every year. In the past, we addressed latency, throughput and cost issues by migrating off Oracle onto Espresso , an open-source document platform, and adding more nodes. We are now at the point where some of the core components are straining under the increasing load, and we can no longer address scaling concerns with the node

Database 133
article thumbnail

Confluent Will Beat Your Cost of Running Kafka (or $100 on us)

Confluent

Running Kafka is costly, but Confluent has created a far more efficient product to lower your costs. Join the Cost Savings challenge to see for yourself.

Kafka 142
article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, and Terrence Sheflin

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

Data News — Week 23.18

Christophe Blefari

It's wedding weekend (as you'll probably read it, congrats) ( credits ) Hey you, this is a Saturday edition of the Data News. I hope this email finds you well. This week you'll have less editorial content because I'm late. But still you'll find awesome articles that has been written recently. As a reminder on Tuesday next week I'm organising the Apache Airflow Paris meetup that you should consider joining if in Paris.

article thumbnail

New Approaches to Visualizing Snowflake Query Statistics with Snowflake Technology Partners

Snowflake

As of December, customers got a whole new level of insight into Snowflake query performance and query execution statistics when Snowflake announced the public preview of the new get_query_operator_stats function, opening up programmatic access to Snowflake query profiles and providing customers a whole new level of insight into Snowflake query performance and query execution statistics.

article thumbnail

How Lakehouse powers NLP for Customer Service Analytics in Insurance

databricks

Download the Databricks Insurance NLP Solution Accelerator Introduction The current economic and social climate has redefined customer expectations and preferences. Society has been.

Insurance 119
article thumbnail

Data Scientist’s Guide to Cognitive Biases: A Free eBook

KDnuggets

Are you interested in exploring the topic of cognitive biases? Want to see how they may be affecting your data science practice? Check out this free ebook for this and more.

article thumbnail

How to Drive Cost Savings, Efficiency Gains, and Sustainability Wins with MES

Speaker: Nikhil Joshi, Founder & President of Snic Solutions

Is your manufacturing operation reaching its efficiency potential? A Manufacturing Execution System (MES) could be the game-changer, helping you reduce waste, cut costs, and lower your carbon footprint. Join Nikhil Joshi, Founder & President of Snic Solutions, in this value-packed webinar as he breaks down how MES can drive operational excellence and sustainability.

article thumbnail

Tackling the Hidden and Unhidden Costs of Kafka

Confluent

Low utilization and operational complexity dramatically increases Kafka costs, so we reinvented Kafka as a cloud-native and complete service to reduce costs for thousands of businesses at any scale.

Kafka 107
article thumbnail

Metal as a Service (MaaS): DIY server-management at scale

LinkedIn Engineering

Guaranteeing that our servers are continually upgraded to secure and vetted operating systems is one major step that we take to ensure our members and customers can access LinkedIn to look for new roles, access new learning programs, or exchange knowledge with other professionals. LinkedIn has quite a large fleet of servers on-premise that depend on internal tooling to ensure they stay on the latest operating systems.

article thumbnail

Precisely Women in Technology: Meet Samantha Martino

Precisely

Technology is a vast industry that has something for everybody. Because of this, it attracts people from all backgrounds and areas of expertise. At Precisely, having diverse representation is the key to success, and as a result, it’s been highly important for the organization to support the unique perspective that employees bring to the table. The Precisely Women in Technology (PWIT) program was designed to connect women from across the organization to one another to offer support, an internal n

article thumbnail

8 Free AI and LLMs Playgrounds

KDnuggets

If you’re interested in trying out AI for fun or learning more about them, then take a look at our list and explore the cutting-edge LLMs available in the wild.

109
109
article thumbnail

Improving the Accuracy of Generative AI Systems: A Structured Approach

Speaker: Anindo Banerjea, CTO at Civio & Tony Karrer, CTO at Aggregage

When developing a Gen AI application, one of the most significant challenges is improving accuracy. This can be especially difficult when working with a large data corpus, and as the complexity of the task increases. The number of use cases/corner cases that the system is expected to handle essentially explodes. 💥 Anindo Banerjea is here to showcase his significant experience building AI/ML SaaS applications as he walks us through the current problems his company, Civio, is solving.

article thumbnail

What Makes Confluent the World’s Most Trusted Cloud Data Streaming Platform

Confluent

Confluent manages 30,000+ Kafka clusters, produces over 3 trillion messages, and does durability checks on over 80 trillion Kafka messages per day while offering 99.99% uptime. Check out our cool stats!

Kafka 105
article thumbnail

PostgreSQL Import CSV: 3 Easy Methods

Hevo

As a business grows, the demand to efficiently handle and process the exponentially growing data also rises. A popular open-source relational database used by several organizations across the world is PostgreSQL. It is a perfect database management system that also assists developers to build applications, and administrators to protect data integrity and develop fault-tolerant environments.

article thumbnail

An ML based approach to proactive advertiser churn prevention

Pinterest Engineering

Erika Sun ML Engineer | Advertiser Growth Modeling Team; Ogheneovo Dibie Engineering Manager | Advertiser Growth Modeling Team Photo by Jason Blackeye on Unsplash Summary In this blog post, we describe a Machine Learning (ML) powered proactive churn prevention solution that was prototyped with our small & medium business (SMB) advertisers. Results from our initial experiment suggest that we can detect future churn with a high degree of predictive power and consequently empower our sales par

article thumbnail

Top Posts May 1-7: Machine Learning with ChatGPT Cheat Sheet

KDnuggets

Machine Learning with ChatGPT Cheat Sheet • HuggingChat Python API: Your No-Cost Alternative • AutoGPT: Everything You Need To Know • 8 Open-Source Alternative to ChatGPT and Bard • LangChain 101: Build Your Own GPT-Powered Applications

article thumbnail

The Ultimate Guide To Data-Driven Construction: Optimize Projects, Reduce Risks, & Boost Innovation

Speaker: Donna Laquidara-Carr, PhD, LEED AP, Industry Insights Research Director at Dodge Construction Network

In today’s construction market, owners, construction managers, and contractors must navigate increasing challenges, from cost management to project delays. Fortunately, digital tools now offer valuable insights to help mitigate these risks. However, the sheer volume of tools and the complexity of leveraging their data effectively can be daunting. That’s where data-driven construction comes in.

article thumbnail

SoftBank Selects Cloudera Data Platform to Leverage Customer Intelligence While Ensuring Data Security

Cloudera

SoftBank Corp. provides Japan-based mobile communications services, mobile device sales, fixed-line communications, and ISP services, with more than 80 million users nationwide. The company also provides a variety of solutions for enterprises, including data centers, cloud, security, global, artificial intelligence (AI), IoT, and digital marketing services.

article thumbnail

Connect Excel to PostgreSQL in 2 Easy Ways

Hevo

Microsoft Excel is a spreadsheet program included in the Microsoft Office Suite. It’s compatible with Windows, Mac OS X, Android, and iOS. It simplifies the creation of text and numeric grids, formulas calculations, graphing tools, pivot tables, and the VBA Macro programming language (Visual Basic for Applications).

article thumbnail

Earned Value Management (EVM): Elements, Formulas, Benefits

Knowledge Hut

Many think that Earned value management is complicated paperwork and thus a lot of professionals stay away from it. On the other hand, successful project managers become superheroes to break this myth of earned value management (EVM). Earned Value Management has taken an important place in the world of project management and plays a vital role in the career of project management certification aspirants like PMI PMP certifications and PRINCE2 certificate.

article thumbnail

Exploratory Data Analysis Techniques for Unstructured Data

KDnuggets

Learn how to find million-dollar insights from the data using exploratory analysis for your next data science project with Python.

article thumbnail

Business Intelligence 101: How To Make The Best Solution Decision For Your Organization

Speaker: Evelyn Chou

Choosing the right business intelligence (BI) platform can feel like navigating a maze of features, promises, and technical jargon. With so many options available, how can you ensure you’re making the right decision for your organization’s unique needs? 🤔 This webinar brings together expert insights to break down the complexities of BI solution vetting.

article thumbnail

#ClouderaLife Volunteer Spotlight: Alex Campos, Principal Technical Leader, Spain

Cloudera

Originally from Brazil, Alex previously lived in Chile and now lives in Spain. During his time living in Latin America in early 2016, Alex saw what he describes as a “knowledge gap” —s eeing the way skills, content and expertise are shared in an open, friendly way at conferences in the US, Alex wanted to replicate that in Latin America. To address this gap, Alex started planning meetups.

article thumbnail

Data-Oriented Programming with Python

Towards Data Science

Data-Oriented Programming in Python A recap on Data-Oriented Programming by Yehonathan Sharvit but illustrated with Python examples (instead of JavaScript and Java) Photo by AltumCode on Unsplash Data-Oriented Programming by Yehonathan Sharvit is a great book that gives a gentle introduction to the concept of data-oriented programming (DOP) as an alternative to good old object-oriented programming (OOP).

article thumbnail

12 Best Data Management Tools in 2023

Hevo

One of the biggest stumbling blocks of a business is the expansion of its Database. A few problems one might have to deal with while trying to expand their Database are storage problems, complicated management issues, and difficulty in the location, sharing, and checking of isolated data.

article thumbnail

KDnuggets News, May 10: HuggingChat Python API: Your No-Cost Alternative • Exploratory Data Analysis Techniques for Unstructured Data

KDnuggets

HuggingChat Python API: Your No-Cost Alternative • Exploratory Data Analysis Techniques for Unstructured Data • Stop Doing this on ChatGPT and Get Ahead of the 99% of its Users • ChatGPT as a Personalized Tutor for Learning Data Science Concepts • The Ultimate Open-Source Large Language Model Ecosystem

article thumbnail

Driving Responsible Innovation: How to Navigate AI Governance & Data Privacy

Speaker: Aindra Misra, Senior Manager, Product Management (Data, ML, and Cloud Infrastructure) at BILL

Join us for an insightful webinar that explores the critical intersection of data privacy and AI governance. In today’s rapidly evolving tech landscape, building robust governance frameworks is essential to fostering innovation while staying compliant with regulations. Our expert speaker, Aindra Misra, will guide you through best practices for ensuring data protection while leveraging AI capabilities.