Sat.Dec 31, 2022 - Fri.Jan 06, 2023

article thumbnail

Why I'm using (Neo)vim as a Data Engineer and Writer in 2023

Simon Späti

I used VS Code, Sublime, Notepad++, TextMate, and others, but the shortcut with cmd(+shift)+end, jumping with option+arrow-keys from word to word, needed to be faster at some point. I was hitting my limits. Everything I was doing I did decently fast, but I didn’t get any faster. Vim is the only editor you get faster with time. Vim is based solely on shortcuts.

article thumbnail

CircleCI’s unnoticed holiday security breach

The Pragmatic Engineer

Originally published on 5 January 2023. 👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. We cover one out of seven topics in today’s subscriber-only The Scoop issue. To get this newsletter every week, subscribe here. For most engineering teams, returning from the winter holiday usually involves gradually getting back into the swing of things.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Python Matplotlib Cheat Sheets

KDnuggets

Matplotlib is the most famous and commonly used plotting library in Python. It allows you to create clear and interactive visualizations that make your data easier to understand and your results more concrete.

Python 160
article thumbnail

Confluent + Immerok: Cloud Native Kafka Meets Cloud Native Flink

Confluent

Introducing fully managed Apache Kafka® + Flink for the most robust, cloud-native data streaming platform with stream processing, integration, and streaming analytics in one.

Kafka 145
article thumbnail

Driving Responsible Innovation: How to Navigate AI Governance & Data Privacy

Speaker: Aindra Misra, Senior Manager, Product Management (Data, ML, and Cloud Infrastructure) at BILL

Join us for an insightful webinar that explores the critical intersection of data privacy and AI governance. In today’s rapidly evolving tech landscape, building robust governance frameworks is essential to fostering innovation while staying compliant with regulations. Our expert speaker, Aindra Misra, will guide you through best practices for ensuring data protection while leveraging AI capabilities.

article thumbnail

Why Vim Is More than Just an Editor – Vim Language, Motions, and Modes Explained

Simon Späti

Throughout my time as a developer, I’ve used VS Code, Sublime, Notepad++, TextMate, and others. But shortcuts like cmd(+shift)+end and jumping with option+arrow-keys from word to word needed to be faster at some point. I was hitting my limits. Everything I was doing I did decently fast, but I didn’t get any faster. I’ve since learned that Vim is the only editor that you get faster using with time.

Coding 130
article thumbnail

I talked to DataGen podcast

Christophe Blefari

🎙 A few week ago I did my first podcast with Robin. We talked about data engineering and everything around doing a weekly curation. This is the first episode of Robin's podcast in English and you should follow him because more are coming! In the podcast we talked about 🔥 My journey before launching the newsletter 🔥 Why and how I write 🔥 My main challenges as a Data Engineer 🔥 My favorite contents 🔥 What I like about data 🔥 A few tips f

More Trending

article thumbnail

4 Tips for Agility and Resiliency Through Supply Chain Process Automation

Precisely

Times are changing, and at a near-constant pace. With shifting customer preferences and disruptive world events shaking up the global supply chain market, many business leaders are left wondering whether they’ll be able to stay competitive. Supply chain automation technologies can have a big role to play when it comes to providing end-to-end visibility and risk mitigation for complex, data-intensive SAP processes in supply chain.

Process 59
article thumbnail

The Open Data Stack Distilled into Four Core Tools

Simon Späti

In this article, we are going to explore core open-source tools that are needed for any company to become data-driven. We’ll cover integration, transformation, orchestration, analytics, and ML tools as a starter guide to the latest open data stack. Let’s start with the Modern Data Stack. Have you heard of it or where the term came from?

Data 130
article thumbnail

Clustering on Normal, CLONE, COPY tables

Cloudyard

Read Time: 2 Minute, 32 Second During this post we will discuss multiple scenario on Clustering Tables. We will be analyzing and implementing the following scenarios in this post. Non Cluster to Cluster table : Create Clustering on Normal table and see the partitions pruning. CLONE Cluster table: CLONE the above Clustered table and analyze the Clustering.

IT 52
article thumbnail

CCC Webinar: Best Practices When Using XML Articles in AI, Machine Learning and Text Mining Projects

KDnuggets

Register now for this webinar on Jan. 12 to learn how to simplify the process of using scientific literature within your AI, machine learning, and text mining projects.

article thumbnail

Launching LLM-Based Products: From Concept to Cash in 90 Days

Speaker: Christophe Louvion, Chief Product & Technology Officer of NRC Health and Tony Karrer, CTO at Aggregage

Christophe Louvion, Chief Product & Technology Officer of NRC Health, is here to take us through how he guided his company's recent experience of getting from concept to launch and sales of products within 90 days. In this exclusive webinar, Christophe will cover key aspects of his journey, including: LLM Development & Quick Wins 🤖 Understand how LLMs differ from traditional software, identifying opportunities for rapid development and deployment.

article thumbnail

What is ESG?

Precisely

Environmental, social, and governance (ESG) initiatives are topics of discussion everywhere – in the workplace, social media, news outlets, and beyond. And for good reason. Recent public advocacy efforts around climate issues, diversity and inclusion, data privacy, and more have been driving forces in pushing ESG to the forefront. While stellar products and services used to be enough for businesses to attract new customers, investors, and employees – and win their loyalty over time – that’s not

article thumbnail

Teradata’s Top 10 Innovations in 2022

Teradata

As we start 2023, our product marketing team has compiled a list of the top 10 features in Teradata Vantage which have immensely helped our customers and are technological breakthroughs.

article thumbnail

Recycling Kubernetes Nodes

Yelp Engineering

Manually managing the lifecycle of Kubernetes nodes can become difficult as the cluster scales. Especially if your clusters are multi-tenant and self-managed. You may need to replace nodes for various reasons, such as OS upgrades and security patches. One of the biggest challenges is how to terminate nodes without disturbing tenants. In this post, I’ll describe the problems we encountered administering Yelp’s clusters and the solutions we implemented.

article thumbnail

Micro, Macro & Weighted Averages of F1 Score, Clearly Explained

KDnuggets

Understanding the concepts behind the micro average, macro average, and weighted average of F1 score in multi-class classification with simple illustrations.

article thumbnail

How To Speak The Language Of Financial Success In Product Management

Speaker: Jamie Bernard

Success in product management goes beyond delivering great features - it’s about achieving measurable financial outcomes that resonate across the organization. By connecting your product’s journey with the company’s financial success, you’ll ensure that every feature, release, and innovation contributes to the bottom line, driving both customer satisfaction and business growth.

article thumbnail

Real-World Data Governance: The Role of Data Governance in a Data Strategy

Precisely

Does your company have a formal data strategy? If so, does that strategy effectively lay out a path toward better business outcomes by helping you optimize your use of data? Although it is a given that data can be one of a company’s real differentiators if used properly, many organizations still do not have a comprehensive data strategy in place.

article thumbnail

2023 Predictions: Data Trends That Will Dominate Business Agenda in APAC

Cloudera

In the past year, businesses who doubled down on digital transformation during the pandemic saw their efforts coming to fruition in the form of cost savings and more streamlined data management. Faced with even more pressure to remain resilient and agile amid looming global economic threats, Asia-Pacific (APAC) region businesses are looking to further mobilize emerging technologies such as artificial intelligence (AI) and machine learning that will optimize operational efficiencies and cost savi

Banking 52
article thumbnail

A Guide to Data Contracts

Striim

Companies need to analyze large volumes of datasets, leading to an increase in data producers and consumers within their IT infrastructures. These companies collect data from production applications and B2B SaaS tools (e.g., Mailchimp). This data makes its way into a data repository, like a data warehouse (e.g., Redshift), and is shown to users via a dashboard for decision-making.

article thumbnail

Python Lambda Functions, Explained

KDnuggets

Learn the syntax and uses of the lambda function, which is an alternative to the regular Python function.

Python 134
article thumbnail

What Is Entity Resolution? How It Works & Why It Matters

Entity Resolution Sometimes referred to as data matching or fuzzy matching, entity resolution, is critical for data quality, analytics, graph visualization and AI. Learn what entity resolution is, why it matters, how it works and its benefits. Advanced entity resolution using AI is crucial because it efficiently and easily solves many of today’s data quality and analytics problems.

article thumbnail

Precisely Women in Techology: Meet Samantha Kastin

Precisely

At Precisely, diversity is an inherent aspect of the company culture. Celebrating each other’s differences makes the team stronger as a whole. The Precisely Women in Technology (PWIT) program is a designated space for women in the company to learn from one another, support each other, provide career-growth advice, and share opportunities. Each month, a member of the PWIT group is featured to learn more about their experience as a woman in technology at Precisely.

article thumbnail

ON-DEMAND WEBINAR: Managing Stress in Data Engineering: Data Quality and Testing Techniques for Data Observability

DataKitchen

Why do 78% of data engineers wish their job came with a therapist to help manage work-related stress? THEY DO NOT TEST. The post ON-DEMAND WEBINAR: Managing Stress in Data Engineering: Data Quality and Testing Techniques for Data Observability first appeared on DataKitchen.

article thumbnail

More 2023 Tech and Industry Predictions from Teradata Experts

Teradata

From advances in AI/ML to the expansion of satellite/cellular services to expand coverage to remote areas, our tech & industry experts weigh in on game-changing predictions for 2023.

52
article thumbnail

How to Merge Pandas DataFrames

KDnuggets

Data merge is a common data processing activity. Learn how Pandas provide various ways to merge our data.

article thumbnail

Provide Real Value in Your Applications with Data and Analytics

The complexity of financial data, the need for real-time insight, and the demand for user-friendly visualizations can seem daunting when it comes to analytics - but there is an easier way. With Logi Symphony, we aim to turn these challenges into opportunities. Our platform empowers you to seamlessly integrate advanced data analytics, generative AI, data visualization, and pixel-perfect reporting into your applications, transforming raw data into actionable insights.

article thumbnail

How to Write Unit Tests for Forms in an Angular 15 Application Using Jasmine?

Workfall

Reading Time: 6 minutes In our previous blog, we demonstrated How to Write Unit Tests for Angular 15 Application Using Jasmine and Enforce Code Quality in a CI Workflow With Github Actions. We looked at general unit tests involving components that receive data from services. It is important to always ensure that Unit Tests are written in order to ensure that the quality of code being shipped to production is guaranteed to be free of any errors or bugs that can be avoided.

Coding 52
article thumbnail

Five books every developer should read by Peter Holman

Scott Logic

Here are five books that influenced my coding style and working practices early in my career. A top list of anything is deeply personal, so I’ve tried to select books I found both inspirational and informative, with an equal focus on technical and non-technical skills. Personally, I feel the content of these books will most benefit those at the start of their careers.

article thumbnail

Acceldata Data Observability Cloud v2.4.1 Introduces New Data Reliability and Compute Capabilities

Acceldata

We released the latest version of Acceldata Data Observability Cloud (ADOC). Version 2.4.1 is now generally available, and it provides enhancements to compute, data reliability, data pipeline management and orchestration, and introduces new monitoring and alerting capabilities.

Cloud 52
article thumbnail

Introduction to Multi-Armed Bandit Problems

KDnuggets

Delve deeper into the concept of multi-armed bandits, reinforcement learning, and exploration vs. exploitation dilemma.

article thumbnail

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Speaker: Maher Hanafi, VP of Engineering at Betterworks & Tony Karrer, CTO at Aggregage

Executive leaders and board members are pushing their teams to adopt Generative AI to gain a competitive edge, save money, and otherwise take advantage of the promise of this new era of artificial intelligence. There's no question that it is challenging to figure out where to focus and how to advance when it’s a new field that is evolving everyday. 💡 This new webinar featuring Maher Hanafi, VP of Engineering at Betterworks, will explore a practical framework to transform Generative AI pr

article thumbnail

Introducing Data Products to Deliver Better Value from Data

Ascend.io

Businesses are increasingly reliant on new technologies to enable the use of data in new and innovative ways. However, most data leaders are finding that technology alone does not cause the organization to deliver new and valuable insights fast enough. Fundamentally, we need an approach that holistically supports the infrastructure, technology, and processes to convert raw data into something valuable and accessible.

Data 52
article thumbnail

Real-Time Data Predictions for 2023

Rockset

This blog compiles real-time data predictions from industry leaders so you know what’s coming in 2023. Here’s what made it into the short list: Streaming data will continue to see widespread adoption with cloud becoming the great enabler Real-time streaming data stacks will start to replace batch-oriented stacks Real-time streaming data stacks must impact the bottom line of the business New applications for streaming real-time data emerge: data applications + real-time ML Growth in the adoption

article thumbnail

Snowflake: Database Role

Cloudyard

Read Time: 4 Minute, 18 Second During this post we will discuss the significance of Database role in snowflake. Scenario 1: Consider a scenario there is SALES_ANALYST Account role exists in our Snowflake Account. This Role is authorized to have access on Database SALES_DB, Schema inside the DB, and all respective tables. To allow access on SALES_DB, will run the below GRANT commands accordingly.

article thumbnail

Unsupervised Disentangled Representation Learning in Class Imbalanced Dataset Using Elastic Info-GAN

KDnuggets

This рареr attempts to exploit primarily twо flaws in the Infо-GАN рареr while retаining the оther good qualities improvements.

Datasets 108
article thumbnail

The AI Superhero Approach to Product Management

Speaker: Conrado Morlan

In this engaging and witty talk, industry expert Conrado Morlan will explore how artificial intelligence can transform the daily tasks of product managers into streamlined, efficient processes. Using the lens of a superhero narrative, he’ll uncover how AI can be the ultimate sidekick, aiding in data management and reporting, enhancing productivity, and boosting innovation.