Sat.Aug 29, 2020 - Fri.Sep 04, 2020

article thumbnail

Creating a Serverless Environment for Testing Your Apache Kafka Applications

Confluent

If you are taking your first steps with Apache Kafka®, looking at a test environment for your client application, or building a Kafka demo, there are two “easy button” paths […].

Kafka 133
article thumbnail

Where to start if you want to become a Data Engineer

Team Data Science

"Where can I start if I want to become a Data Engineer?" This is a question I have heard many times before. My answer to it is actually always the same: Start doing a Data Engineering project! Choose a tool Your first step here should be to select a tool. Then start with that tool and then build the whole thing up. So you get some data and then start with a tool.

article thumbnail

Edgar: Solving Mysteries Faster with Observability

Netflix Tech

Edgar helps Netflix teams troubleshoot distributed systems efficiently with the help of a summarized presentation of request tracing, logs, analysis, and metadata. by Elizabeth Carretto Everyone loves Unsolved Mysteries. There’s always someone who seems like the surefire culprit. There’s a clear motive, the perfect opportunity, and an incriminating footprint left behind.

Metadata 119
article thumbnail

Streaming Analytics in the Real World

Cloudera

From leading banks, and insurance organizations to some of the largest telcos, manufacturers, retailers, healthcare and pharma, organizations across diverse verticals lead the way with real-time data and streaming analytics. These businesses use data-fueled insights to enhance the customer experience, reduce costs, and increase revenues. And Cloudera is at the heart of enabling these real-time data driven transformations. .

Insurance 107
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Building A Better Data Warehouse For The Cloud At Firebolt

Data Engineering Podcast

Summary Data warehouse technology has been around for decades and has gone through several generational shifts in that time. The current trends in data warehousing are oriented around cloud native architectures that take advantage of dynamic scaling and the separation of compute and storage. Firebolt is taking that a step further with a core focus on speed and interactivity.

article thumbnail

Why you should not learn everything in Data Science

Team Data Science

"Since I started exploring Data Engineering, it has been overwhelming. In the end I have the feeling of giving up." This is a message that reached me from a viewer on YouTube. And that's exactly how I feel sometimes! Sometimes I feel a bit overwhelmed by the whole thing. Because there is so much going on. All the technology and Data Science hype. There is always something new on the horizon.

More Trending

article thumbnail

Finding the ‘good’ in 2020 and beyond

Cloudera

I think we can all agree that it would be nice to have some good news in 2020, which is why the Data for Good category in this year’s Cloudera Impact Awards is such a pertinent one. The awards program is an annual corporate competition celebrating game-changing data-implementation projects. The Data for Good category recognizes organizations that have tackled some of the most challenging issues affecting society and the planet, making what was impossible in the past, possible today.

article thumbnail

Enabling the Deployment of Event-Driven Architectures Everywhere Using Microsoft Azure and Confluent Cloud

Confluent

Hybrid cloud architecture and accelerated cloud migrations are becoming the norm rather than the exception, as our increasingly digital world introduces certain challenges along the way, including modernizing existing application/architecture, […].

article thumbnail

Important countries and regions with Data Science demand

Team Data Science

In which regions or countries is there a boom in the field of Data Sciences and thus a large number of jobs? This is a very interesting question, which newcomers or graduates often ask themselves. Maybe you have already asked yourself this question? The USA as an advanced country Companies in the USA are obviously very, very advanced with Data Science.

article thumbnail

HDMI?—?Scaling Netflix Certification

Netflix Tech

HDMI?—?Scaling Netflix Certification Scott Bolter , Matthew Lehman , Akshay Garg ¹ At Netflix, we take the task of preserving the creative vision of our content all the way to a subscriber TV screen very seriously. This significantly increases the scope of our application integration and certification processes for streaming devices like set-top-boxes (STBs) and TVs.

article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

The Advantages Of Live Data-Streaming In The Competitive Financial Services Sector (Part III)

Cloudera

Live data-streaming offers businesses exciting new opportunities to transform the way they operate, leveraging real-time insights to drive better decision making and enhance operational efficiency. To find out more about how streaming data might impact the financial services sector I sat down for a chat with Dinesh Chandrasekhar, Head of Product Marketing in Cloudera’s Data-in-Motion Business Unit.

Kafka 99
article thumbnail

Activating Intent: How Confluent is Energizing Its Diversity, Equity, and Inclusion Practice

Confluent

As a South African, who grew up during the era of apartheid, I’ve witnessed firsthand the negative and long-lasting impact of discrimination, bias, and inequity, and I have a strong […].

IT 69
article thumbnail

Workflow for creating a Data Engineering project and how you can build one!

Team Data Science

You want to become a data engineer, but don't know how to set up a data engineering project? I will show you! Do not make this mistake! First of all you should not make the mistake that unfortunately many people make! Often people want to build the whole thing from the beginning. They say: "Okay I need to do a project. I need to make a big thing. I don't even know what data and what tools I want to use.

article thumbnail

Underscores are Overloaded in Scala!

Rock the JVM

Scala syntax can be confusing: discover almost all uses of underscores and why understanding their inconsistent philosophy is worthwhile

Scala 52
article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

The Future Of The Telco Industry And Impact Of 5G & IoT – Part 3

Cloudera

Article 3. The Future Of The Telco Industry And Impact Of 5G & IoT – Part 3. In the final installment in the series, Vijay Raja, Director of Industry & Solutions Marketing at Cloudera shares his views on how the telecom sector is changing and where it goes next. Hi Vijay, thank you so much for joining us again. To continue where we left off, how are ML and IoT influencing the Telecom sector, and how is Cloudera supporting this industry evolution?

article thumbnail

How real-time stream processing works with ksqlDB, in 7 animations

Confluent

ksqlDB, the event streaming database, is becoming one of the most popular ways to work with Apache Kafka®.

Process 52
article thumbnail

Why You Need Data Engineers And Data Scientists To Be Successful!

Team Data Science

Data Science , Artificial Intelligence and Machine Learning. These topics are currently the hype in the field of Data Science. Everyone wants to become a Data Scientist. But isn't the work being done in the field of Data Engineereing the real MVP? Isn't it important to have Data Scientists AND Data Engineers on board to make a project successful? Yes, it is!

article thumbnail

Underscores are Overloaded in Scala!

Rock the JVM

Scala syntax can be confusing: discover almost all uses of underscores and why understanding their inconsistent philosophy is worthwhile

Scala 52
article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

Discover and Explore Data Faster with the CDP DDE Template

Cloudera

From a-z in 10 minutes! It is hard to believe if you have had previous experience with setting up, sizing, and deploying a distributed search engine service that this is possible. Imagine how many times IT has lost valuable time spending hours trying to understand Apache Solr application requirements and map them into how to best size and deploy the Solr service.

article thumbnail

How ksqlDB works in 7 animations

Confluent

ksqlDB, the event streaming database, is becoming one of the most popular ways to work with Apache Kafka.

Kafka 52
article thumbnail

Back to school – CEOs need to learn a new language, fast!

Teradata

CEOs of banks know all about change. But the existential challenge posed by Big Tech requires a totally new set of skills. What do they need to learn to survive?

Banking 52
article thumbnail

Helping traditional organizations being more efficient

DareData

The world of data science and information technology is a constantly evolving landscape, where dozens of new tools and methodologies are created and updated daily, and many others quickly become obsolete. Every organization has their own ecosystem of applications, but even the most advanced organizations sometimes fall behind in certain areas when compared to the bleeding edge of technological advances.

article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, Terrence Sheflin, and Mahyar Ghasemali

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

CDP Private Cloud is a Game-changer for Partners

Cloudera

Recently, Cloudera announced the release of Cloudera CDP Private Cloud, delivering the final component of our hybrid cloud strategy. There’s nothing comparable to it in the industry. CDP Private Cloud offers benefits of a public cloud architecture—autoscaling, isolation, agile provisioning, etc.—in an on-premise environment. Additionally, lines of business (LOBs) are able to gain access to a shared data lake that is secured and governed by the use of Cloudera Shared Data Experience (SDX).

Cloud 70
article thumbnail

Preset & Superset User Documentation

Preset

Preset and Superset User Documentation is available for everyone that want to become a superset expert

52
article thumbnail

Teradata Dynamic Resource Optimization – Both On-Premises and in the Cloud

Teradata

With Teradata Vantage on Azure, customers have access to the same dynamic resource optimization tools that they have come to love with the added agility that Azure brings to the table.

Cloud 52
article thumbnail

Repartition vs Coalesce in Apache Spark

Rock the JVM

Clarifying the differences between two essential repartitioning operations in Apache Spark

52
article thumbnail

How to Drive Cost Savings, Efficiency Gains, and Sustainability Wins with MES

Speaker: Nikhil Joshi, Founder & President of Snic Solutions

Is your manufacturing operation reaching its efficiency potential? A Manufacturing Execution System (MES) could be the game-changer, helping you reduce waste, cut costs, and lower your carbon footprint. Join Nikhil Joshi, Founder & President of Snic Solutions, in this value-packed webinar as he breaks down how MES can drive operational excellence and sustainability.

article thumbnail

Migration Supporting Real-Time Analytics for Customer Experience Management

Cloudera

Service Management Group ( SMG ) offers an easy-to-use experience management (XM) platform that combines end-to-end customer and employee experience management software with hands-on professional services to deliver actionable insights and help brands get smarter about their customers. The XM platform, smg360 , helps customers across verticals, including restaurants, retail, and healthcare, drive changes that boost loyalty and improve business outcomes. .

article thumbnail

Offload Real-Time Reporting and Analytics from MongoDB Using PostgreSQL

Rockset

MongoDB’s Advantages & Disadvantages MongoDB has comprehensive aggregation capabilities. You can run many analytic queries on MongoDB without exporting your data to a third-party tool. However, these aggregation queries are frequently CPU-intensive and can block or delay the execution of other queries. For example, Online Transactional Processing (OLTP) queries are usually short read operations that have direct impacts on the user experience.

MongoDB 40
article thumbnail

Larry H Miller Sports & Entertainment

Teradata

The Utah Jazz create winning customer experiences using Teradata Vantage on AWS with consumption pricing for flexible and elastic modern cloud analytics.

article thumbnail

Key Challenges with Quasi Experiments at Netflix

Netflix Tech

Kamer Toker-Yildiz , Colin McFarland , Julia Glick At Netflix, when we can’t run A/B experiments we run quasi experiments ! We run quasi experiments with various objectives such as non-member experiments focusing on acquisition, member experiments focusing on member engagement, or video streaming experiments focusing on content delivery. Consolidating on one methodology could be a challenge, as we may face different design or data constraints or optimization goals.

Media 75
article thumbnail

The Cloud Development Environment Adoption Report

Cloud Development Environments (CDEs) are changing how software teams work by moving development to the cloud. Our Cloud Development Environment Adoption Report gathers insights from 223 developers and business leaders, uncovering key trends in CDE adoption. With 66% of large organizations already using CDEs, these platforms are quickly becoming essential to modern development practices.