Sat.May 27, 2023 - Fri.Jun 02, 2023

article thumbnail

An educational side project

The Pragmatic Engineer

👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. We cover one out of four topics in today’s subscriber-only The Scoop issue. If you’re not yet a full subscriber, you missed this week’s deep-dive on Agoda’s private cloud setup. To get the full issues, twice a week, subscribe here.

Education 363
article thumbnail

A Roadmap To Bootstrapping The Data Team At Your Startup

Data Engineering Podcast

Summary Building a data team is hard in any circumstance, but at a startup it can be even more challenging. The requirements are fluid, you probably don't have a lot of existing data talent to manage the hiring and onboarding, and there is a need to move fast. Ghalib Suleiman has been on both sides of this equation and joins the show to share his hard-won wisdom about how to start and grow a data team in the early days of company growth.

Data Lake 162
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

What's new in Apache Spark 3.4.0 - Structured Streaming

Waitingforcode

The asynchronous progress tracking and correctness issue fixes presented in the previous blog posts are not the single new feature in Apache Spark Structured Streaming 3.4.0. There are many others but to keep the blog post readable, I'll focus here only on 3 of them.

130
130
article thumbnail

Data News — Week 23.21

Christophe Blefari

Me ( credits ) Hey, I've been sick in the last 3 days and it was impossible to write something. As I still want to send something, here a raw edition with no comments. See you on Friday. Gen Ai 🤖 QLoRA: Efficient Finetuning of Quantized LLMs — 65B parameter model on a single 48GB GPU reaching 99.3% of the performance level of ChatGPT on Vicuna.

BI 130
article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

Ensuring the Successful Launch of Ads on Netflix

Netflix Tech

By Jose Fernandez , Ed Barker , Hank Jacobs Introduction In November 2022, we introduced a brand new tier —  Basic with ads. This tier extended existing infrastructure by adding new backend components and a new remote call to our ads partner on the playback path. As we were gearing up for launch, we wanted to ensure it would go as smoothly as possible.

Algorithm 138
article thumbnail

Bard for Data Science Cheat Sheet

KDnuggets

Check out our latest cheat sheet to get you up to speed and provide a handy reference for using Google's LLM chat tool Bard for data science.

More Trending

article thumbnail

Testing Control-Flow Translations in GHC

Tweag

In November 2022, Tweag engineers merged a WebAssembly back end into the Glasgow Haskell Compiler (GHC). The back end includes a new translation for control flow , which enables GHC to avoid depending on external tools like Binaryen. Because the translation is new, we wanted to test it before submitting a merge request. And classic unit testing was not a good fit—we would have needed to know what the WebAssembly code was expected to be generated from any given fragment of Haskell, and that’s a j

Algorithm 117
article thumbnail

ThoughtSpot Sage: data security with large language models

ThoughtSpot

With the recent announcement of ThoughtSpot Sage , we launched a number of enhancements to our search capabilities including AI-generated answers, AI-powered search suggestions, and AI-assisted data modeling. In this article we will walk you through the steps we take to secure your data during the LLM interaction. Looking more broadly, we’ll also describe the security process we follow during any application iteration or enhancement, so you can see the great lengths we take to keep your data se

article thumbnail

OpenAI’s Whisper API for Transcription and Translation

KDnuggets

This article will show you how to use OpenAI's Whisper API to transcribe audio into text. It will also show you how to use it in your own projects and how to integrate it into your data science projects.

article thumbnail

10 Interesting Project Management Project Ideas to Follow in 2023

Knowledge Hut

Project management is a critical function for every organization to achieve its goals in a successful and effective manner. According to one report, project management employment in the United States is predicted to expand by 33% between 2017 and 2027. According to the Bureau of Labour Statistics and PMI, companies will require roughly 88 million people in project management-related activities by 2027.

Project 98
article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, and Terrence Sheflin

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

What Pride and allyship mean to me by Steve Foreshew-Cain

Scott Logic

Every year at this time, I like to share my thoughts on the continuing relevance of Pride Month; you can read my posts here from 2021 and 2022. This Pride Month, we’re going to share insights from the Scott Logic team on what Pride and allyship mean to them, and why they value working in an inclusive environment. I’ll get the ball rolling. What does Pride mean to you?

article thumbnail

Introducing the Snowflake Connector for ServiceNow analytics

ThoughtSpot

In a world where user experience and IT support can mean the difference between hitting or missing your ARR marks, businesses have to find smarter ways to build workflows and support their IT departments. That’s where companies like ServiceNow come into play. A few years back, we created our ServiceNow SpotApp , a pre-built analytics template to help companies analyze and understand their data—so they can increase efficiencies across their complex IT environments.

article thumbnail

The Top AutoML Frameworks You Should Consider in 2023

KDnuggets

AutoML frameworks are powerful tool for data analysts and machine learning specialists that can automate data preprocessing, model selection, hyperparameter tuning, and even perform complex tasks like feature engineering.

article thumbnail

LinkedIn Bug Bounty Program - One Year Anniversary of Public Launch

LinkedIn Engineering

Authors: Ameen Maali , Rohit Pitke , Surbhi Jain , and Mira Thambireddy Security of our members’ data is a key priority at LinkedIn. To tap into the collective insights of the entire security community, we decided to expand our private bug bounty program to everyone on the HackerOne platform last year. In this blog post, we reflect on our journey through the program’s inception, the successes, the learnings, and discuss why our bug bounty program has been so valuable in keeping LinkedIn a secure

article thumbnail

How to Drive Cost Savings, Efficiency Gains, and Sustainability Wins with MES

Speaker: Nikhil Joshi, Founder & President of Snic Solutions

Is your manufacturing operation reaching its efficiency potential? A Manufacturing Execution System (MES) could be the game-changer, helping you reduce waste, cut costs, and lower your carbon footprint. Join Nikhil Joshi, Founder & President of Snic Solutions, in this value-packed webinar as he breaks down how MES can drive operational excellence and sustainability.

article thumbnail

Top 20 Artificial Intelligence Project Ideas in 2023

Knowledge Hut

AI finds its use in a wide range of applications like marketing , automation, transport, supply chain, and communication, to name a few. From cutting-edge research to real-world applications, here we will investigate the most executed artificial intelligence projects. This article will assist you to discover plenty of fascinating ideas and insights to inspire you, whether you are a tech fanatic or want to know about the future of AI.

Project 96
article thumbnail

How DoorDash uses XcodeGen to eliminate project merge conflicts

DoorDash Engineering

At DoorDash, we work to implement efficient processes that can mitigate common conflicts within a large iOS development team. Part of those efforts involve using XcodeGen, a command line interface (CLI), to reduce merging conflicts within our various iOS teams. Here we will discuss its implementation to manage the intricate business scenarios and demanding requirements of the Dasher app, which lets our drivers receive, pick up, and securely deliver orders to customers.

Project 96
article thumbnail

Top 10 Tools for Detecting ChatGPT, GPT-4, Bard, and Claude

KDnuggets

Top free tools for detecting thesis, research papers, assignments, documentation, and blogs generated by AI models.

152
152
article thumbnail

Warmest ocean ever

ArcGIS

Our ocean is a key regulator in our climate and weather patterns. As temperatures rise so will the land temperatures and storm frequencies.

107
107
article thumbnail

Improving the Accuracy of Generative AI Systems: A Structured Approach

Speaker: Anindo Banerjea, CTO at Civio & Tony Karrer, CTO at Aggregage

When developing a Gen AI application, one of the most significant challenges is improving accuracy. This can be especially difficult when working with a large data corpus, and as the complexity of the task increases. The number of use cases/corner cases that the system is expected to handle essentially explodes. 💥 Anindo Banerjea is here to showcase his significant experience building AI/ML SaaS applications as he walks us through the current problems his company, Civio, is solving.

article thumbnail

Data Ticket Takers vs. Decision Makers

Towards Data Science

Are You a Data Ticket Taker or Decision Maker? The characteristics and value of reactive vs. proactive data teams Image courtesy of the author. Fundamentally, there are two different types of data teams in this world. There are those who are reactive to the wants of the organization, and then there are those who proactively lead the organization towards its needs.

Data 93
article thumbnail

Better LLMs with Better Data using Cleanlab Studio

databricks

This post and accompanying notebook and tutorial video demonstrate how to use Cleanlab Studio to improve the performance of Large Language Models (LLMs.

Data 98
article thumbnail

KDnuggets Top Posts for March 2023: AutoGPT: Everything You Need To Know

KDnuggets

AutoGPT: Everything You Need To Know • Top 19 Skills You Need to Know in 2023 to Be a Data Scientist • 8 Open-Source Alternative to ChatGPT and Bard • LangChain 101: Build Your Own GPT-Powered Applications • 10 Websites to Get Amazing Data for Data Science Projects • Baby AGI: The Birth of a Fully Autonomous AI • Mastering Generative AI and Prompt Engineering: A Free eBook • Data Analytics: The Four Approaches to Analyzing Data and How To Use Them Effectively

article thumbnail

6 Pillars of Data Quality and How to Improve Your Data

Databand.ai

6 Pillars of Data Quality and How to Improve Your Data Eric Jones May 30, 2023 What Is Data Quality? Data quality refers to the degree of accuracy, consistency, completeness, reliability, and relevance of the data collected, stored, and used within an organization or a specific context. High-quality data is essential for making well-informed decisions, performing accurate analyses, and developing effective strategies.

article thumbnail

The Ultimate Guide To Data-Driven Construction: Optimize Projects, Reduce Risks, & Boost Innovation

Speaker: Donna Laquidara-Carr, PhD, LEED AP, Industry Insights Research Director at Dodge Construction Network

In today’s construction market, owners, construction managers, and contractors must navigate increasing challenges, from cost management to project delays. Fortunately, digital tools now offer valuable insights to help mitigate these risks. However, the sheer volume of tools and the complexity of leveraging their data effectively can be daunting. That’s where data-driven construction comes in.

article thumbnail

Share Pop-up Charts from the Spatial Statistics and Space Time Pattern Mining Toolboxes to ArcGIS Online

ArcGIS

Use the Convert Spatial Statistics Popup Charts for Web Display tool to view the pop-up charts from your analysis in ArcGIS Online.

97
article thumbnail

Driving Data Usability for Health Plans through Simplified Data Quality Enforcement with Databricks

databricks

Faced with clinician shortages, an aging population, and stagnant health outcomes, the healthcare industry has the potential to greatly benefit from disruptive technologies.

article thumbnail

Programming Languages for Specific Data Roles

KDnuggets

What programming language do you need for a specific data role?

article thumbnail

How to Handle Authentication in Angular SPAs?

Workfall

Reading Time: 4 minutes Angular is a good framework for creating Single Page Applications (SPAs) using JavaScript/TypeScript. With Single Page Applications, routing is handled on the client side. This calls for protecting routes on the client side as well. Angular comes with the Angular Routing module which handles routing. Sometimes you will have protected resources that you will only want your user to see the UI if and only if they are authenticated.

article thumbnail

Business Intelligence 101: How To Make The Best Solution Decision For Your Organization

Speaker: Evelyn Chou

Choosing the right business intelligence (BI) platform can feel like navigating a maze of features, promises, and technical jargon. With so many options available, how can you ensure you’re making the right decision for your organization’s unique needs? 🤔 This webinar brings together expert insights to break down the complexities of BI solution vetting.

article thumbnail

From the Economic Graph to Economic Insights: Building the Infrastructure for Delivering Labor Market Insights from LinkedIn Data

LinkedIn Engineering

Authors: Dr. Patrick Driscoll and Akash Kaura LinkedIn’s vision is to create economic opportunity for every member of the global workforce. Since its inception in 2015, the Economic Graph Research and Insights (EGRI) team has worked to make this vision a reality by generating labor market insights such as: Real-time economic and workforce intelligence & insights.

article thumbnail

Adaptive Query Execution in Structured Streaming

databricks

In Databricks Runtime, Adaptive Query Execution (AQE) is a performance feature that continuously re-optimizes batch queries using runtime statistics during query execution. Starting.

article thumbnail

4 Career Lessons That Helped Me Navigate the Difficult Job Market

KDnuggets

In this blog, I share 4 valuable lessons I learned while searching for data science roles amidst challenging circumstances, including 60-day immigration policies, layoffs, and health issues. My hope is to offer insights and guidance to those who are facing similar obstacles, whether due to recent layoffs or immigration challenges.

article thumbnail

Explore and Prepare Your Data with ArcGIS Pro Data Engineering

ArcGIS

Get a taste of how to use Data Engineering to explore, visualize, clean and prepare data in ArcGIS Pro.

article thumbnail

Driving Responsible Innovation: How to Navigate AI Governance & Data Privacy

Speaker: Aindra Misra, Senior Manager, Product Management (Data, ML, and Cloud Infrastructure) at BILL

Join us for an insightful webinar that explores the critical intersection of data privacy and AI governance. In today’s rapidly evolving tech landscape, building robust governance frameworks is essential to fostering innovation while staying compliant with regulations. Our expert speaker, Aindra Misra, will guide you through best practices for ensuring data protection while leveraging AI capabilities.