Sat.Nov 12, 2022 - Fri.Nov 18, 2022

article thumbnail

Introduction to Pandas for Data Science

KDnuggets

The Pandas library is core to any Data Science work in Python. This introduction will walk you through the basics of data manipulating, and features many of Pandas important features.

article thumbnail

Who is Still Hiring Software Engineers and EMs?

The Pragmatic Engineer

👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. We cover one out of five topics in today’s subscriber-only The Scoop issue. To get this newsletter every week, subscribe here. This article was updated in December 2022. In the midst of gloomy news about hiring freezes and layoffs, let's highlight companies which are growing  and hiring.

article thumbnail

Data News — Week 22.46

Christophe Blefari

Scracthing the surface ( credits ) Hey you, a new Friday means data news. This week feels a bit like old data news with a variety of articles on different cool topics while I navigate through the actual data trends. Next Monday I'll present "How to build a data dream team" at Y42 meetup. I'll share in next week edition a written form of my talk.

Python 130
article thumbnail

A Diatribe against Data Contracts and their Abuses.

Confessions of a Data Guy

Ok, so I don’t really mean all that. Or do I? I have no idea what the future holds. Sometimes it’s easy to pick out the winners, like Databricks and Snowflake, you can see, feel, and taste the results of those data products, a delicious and delectable bounty to feast upon. Other things are harder […] The post A Diatribe against Data Contracts and their Abuses. appeared first on Confessions of a Data Guy.

Data 130
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

If I Had To Start Learning Data Science Again, How Would I Do It?

KDnuggets

While different ways to learn Data Science for the first time exist, the approach that works for you should be based on how you learn best. One powerful method is to evolve your learning from simple practice into complex foundations, as outlined in this learning path recommended by a physicist who turned into a Data Scientist.

article thumbnail

The Scoop: Tech Layoffs in 2022

The Pragmatic Engineer

I get a lot of scoop sent by readers (thank you!). Sadly, in 2022, a good part of the scoop is about companies laying off people. Some of this scoop has not been reported before. I don't want to broadcast layoffs on Twitter or LinkedIn continuously, but also don't want this information to be lost. This page collects scoops I receive, some of which might not have been reported elsewhere.

More Trending

article thumbnail

Build Data Products Without A Data Team Using AgileData

Data Engineering Podcast

Summary Building data products is an undertaking that has historically required substantial investments of time and talent. With the rise in cloud platforms and self-serve data technologies the barrier of entry is dropping. Shane Gibson co-founded AgileData to make analytics accessible to companies of all sizes. In this episode he explains the design of the platform and how it builds on agile development principles to help you focus on delivering value.

Building 130
article thumbnail

Git for Data Science Cheatsheet

KDnuggets

Knowing git is no longer an option for data professionals. Grab this handy reference sheet now and make sure you know how to git the job done.

article thumbnail

For your eyes only: improving Netflix video quality with neural networks

Netflix Tech

by Christos G. Bampis , Li-Heng Chen and Zhi Li When you are binge-watching the latest season of Stranger Things or Ozark, we strive to deliver the best possible video quality to your eyes. To do so, we continuously push the boundaries of streaming video quality and leverage the best video technologies. For example, we invest in next-generation, royalty-free codecs and sophisticated video encoding optimizations.

Media 121
article thumbnail

Doing More with Less: 5 Ways Leading Organizations Maximize the Value of their Data

Teradata

"Doing more with less” is a familiar refrain echoing through the halls of many organizations. To answer this call, businesses are searching for efficiency gains & turning to data to unlock savings.

Data 98
article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Taking A Look Under The Hood At CreditKarma's Data Platform

Data Engineering Podcast

Summary CreditKarma builds data products that help consumers take advantage of their credit and financial capabilities. To make that possible they need a reliable data platform that empowers all of the organization’s stakeholders. In this episode Vishnu Venkataraman shares the journey that he and his team have taken to build and evolve their systems and improve the product offerings that they are able to support.

MongoDB 100
article thumbnail

What To Expect for AI Quality Trends In 2023

KDnuggets

Based on the recent discussions with dozens of Fortune 500 data science teams, we can expect to see a continued spotlight on AI model quality in 2023.

article thumbnail

Helping VFX studios pave a path to the cloud

Netflix Tech

By: Peter Cioni (Netflix), Alex Schworer (Netflix), Mac Moore (Conductor Tech.), Rachel Kelley (AWS), Ranjit Raju (AWS) Rendering is core to the the VFX process VFX studios around the world create amazing imagery for Netflix productions. Nearly every show that is produced today includes digital visual effects, from the creatures in Stranger Things , to recreating historic London in Bridgerton.

Cloud 117
article thumbnail

Habib Bank manages data at scale with Cloudera Data Platform

Cloudera

As the leading financial institution of Pakistan, Habib Bank Limited (HBL) is at the forefront of all development initiatives which includes growth of priority sectors and targeting the unbanked population in the country. HBL remains committed to its objective of client centric innovation and financial inclusion for all segments of society. . HBL was the first Pakistani commercial bank to be established in Pakistan in 1947.

Banking 91
article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

Write What You Know: Turning Your Apache Kafka® Knowledge into a Technical Talk

Confluent

The call for papers for Kafka Summit London 2023 has opened, and we’re looking to hear about your experiences using and working with Kafka. If you’re stuck looking for ideas on what to talk about, write what you know.

Kafka 83
article thumbnail

Research Papers for NLP Beginners

KDnuggets

Read research papers on neural models, word embedding, language modeling, and attention & transformers.

Process 159
article thumbnail

Vulnerability Management at Lyft: Enforcing the Cascade [Part 1]

Lyft Engineering

Vulnerability Management at Lyft: Enforcing the Cascade - Part 1 Converting container scan data into tickets, linked with automated pull requests Abstract Over the past 2 years, we’ve built a comprehensive vulnerability management program at Lyft. This blog post will focus on the systems we’ve built to address OS and OS-package level vulnerabilities in a timely manner across hundreds of services run on Kubernetes.

article thumbnail

#Clouderalife Volunteer Spotlight: Glaucia Esppenchutz

Cloudera

Cloudera’s November Volunteer Spotlight is Glaucia Esppenchutz , staff data engineer, based in Lisbon, Portugal. . Glaucia volunteers with Free Code Camp , an organization founded in 2014 that helps aspiring technicians learn to code for free. . Through the creation and publication of videos, articles, and interactive coding lessons — all freely available to the public — Free Code Camp is able to reach and train millions of people annually.

Coding 91
article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

Move faster, wait less: Improving code review time at Meta

Engineering at Meta

Code reviews are one of the most important parts of the software development process At Meta we’ve recognized the need to make code reviews as fast as possible without sacrificing quality We’re sharing several tools and steps we’ve taken at Meta to reduce the time waiting for code reviews When done well, code reviews can catch bugs , teach best practices , and ensure high code qualit y.

Coding 55
article thumbnail

7 SQL Concepts You Should Know For Data Science

KDnuggets

The post explains all the key elements of SQL that you must know as a data science practitioner.

SQL 158
article thumbnail

How Real-time Healthcare Analytics Helps Improve Patient Care

Striim

It’s a Tuesday night. A nurse in the emergency department (ED) receives an alert on her smartphone: the ED will be overcrowded after 1.5 hours. The alert also gives suggestions, such as the number of beds that will be filled or what type of care will be required. The nurse uses this information to communicate with transport, radiology, and lab teams to make the necessary preparations.

article thumbnail

Once Upon a Time in the Land of Data

Cloudera

I recently had the privilege of attending the CDAO event in Boston hosted by Corinium. Tracks represented financial services, insurance, retail and consumer packaged goods, and healthcare. Overall, it struck me that while data science is not new, most firms are still defining the mission of the data office and data officer. It’s clear firms seek to leverage data and embrace its potential insights, but most are forging ahead in largely uncharted territory.

article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, Terrence Sheflin, and Mahyar Ghasemali

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

DataOps Observability: Taming the Chaos (Part 3)

DataKitchen

Part 3: Considering the Elements of Data Journeys. This is the third post in DataKitchen’s four-part series on DataOps Observability. Observability is a methodology for providing visibility of every journey that data takes from source to customer value across every tool, environment, data store, team, and customer so that problems are detected and addressed immediately.

article thumbnail

How LinkedIn Uses Machine Learning To Rank Your Feed

KDnuggets

In this post, you will learn to clarify business problems & constraints, understand problem statements, select evaluation metrics, overcome technical challenges, and design high-level systems.

article thumbnail

Artificial Intelligence (AI) in Cloud Computing

U-Next

Introduction . Artificial Intelligence (AI) is a process of programming computers to make decisions for themselves. This technology creates intelligent applications capable of reasoning, learning, and acting independently. Among many things, AI finds innumerable applications in cloud computing. Cloud computing delivers computing services—including servers, storage, databases, networking, software, analytics, and intelligence—over the Internet (“the cloud”) to offer faster innovation

article thumbnail

Unlocking HBase on S3 With the New Store File Tracking Feature

Cloudera

CDP Operational Database (COD) is a real-time auto-scaling operational database powered by Apache HBase and Apache Phoenix. It is one of the main data services that run on Cloudera Data Platform (CDP) Public Cloud. You can access COD from your CDP console. The cost savings of cloud-based object stores are well understood in the industry. Applications whose latency and performance requirements can be met by using an object store for the persistence layer benefit significantly with lower cost of o

article thumbnail

How to Drive Cost Savings, Efficiency Gains, and Sustainability Wins with MES

Speaker: Nikhil Joshi, Founder & President of Snic Solutions

Is your manufacturing operation reaching its efficiency potential? A Manufacturing Execution System (MES) could be the game-changer, helping you reduce waste, cut costs, and lower your carbon footprint. Join Nikhil Joshi, Founder & President of Snic Solutions, in this value-packed webinar as he breaks down how MES can drive operational excellence and sustainability.

article thumbnail

Question: What is the difference between Data Quality and DataOps Observability?

DataKitchen

. Question: What is the difference between Data Quality and Observability in DataOps? Data Quality is static. It is the measure of data sets at any point in time. Data Observability is dynamic — it is the testing of data, integrated data, and tools acting upon data — as it is processed — that checks for flow rates and data errors.

Data 52
article thumbnail

9 Free Resources to Master Python

KDnuggets

Python is the most popular general-purpose language and you can learn it for free.

Python 149
article thumbnail

How Does AI Aid in Creating Sound Business Strategies?

U-Next

Introduction . The usage of AI technology has been on the rise in the business world, especially when it comes to creating business strategies. . Artificial Intelligence (AI) and Machine Learning are currently used by businesses to make their operations more efficient, improve customer experience and achieve better results. As per Artificial Intelligence Statistics 2022 , AI adoption by businesses around the globe continued at a steady pace in 2022, with more than a third of companies (35%) re

article thumbnail

Enriching Streams with Hive tables via Flink SQL

Cloudera

Introduction. Stream processing is about creating business value by applying logic to your data while it is in motion. Many times that involves combining data sources to enrich a data stream. Flink SQL does this and directs the results of whatever functions you apply to the data into a sink. Business use cases, such as fraud detection , advertising impression tracking, health care data enrichment, augmenting financial spend information, GPS device data enrichment, or personalized customer commun

SQL 59
article thumbnail

The Cloud Development Environment Adoption Report

Cloud Development Environments (CDEs) are changing how software teams work by moving development to the cloud. Our Cloud Development Environment Adoption Report gathers insights from 223 developers and business leaders, uncovering key trends in CDE adoption. With 66% of large organizations already using CDEs, these platforms are quickly becoming essential to modern development practices.