December, 2017

article thumbnail

Wallaroo with Sean T. Allen - Episode 12

Data Engineering Podcast

Summary Data oriented applications that need to operate on large, fast-moving sterams of information can be difficult to build and scale due to the need to manage their state. In this episode Sean T. Allen, VP of engineering for Wallaroo Labs, explains how Wallaroo was designed and built to reduce the cognitive overhead of building this style of project.

Kafka 100
article thumbnail

8 Key Facts You Should know if You are a HR Professional

U-Next

Two of the most common reasons why people think they can be great HR professionals are either they are very organized and systematic or they have good people skills. But these two qualities alone are not enough for anyone to make it big in their career in human resource management. The two attributes can land them jobs but to move up the ladder, they definitely need some qualities that will set them apart from other employees.

article thumbnail

Constant Gardening

Zalando Engineering

How effective management is a continuing story of growth Producers’ Style One of the things I struggled the most with in the past year was identifying the best way to lead my teams. I worked a lot on myself, observed my peers, and tried to learn from my leads, but in the end, I ran into into the well known dilemma: task-focused or people-focused management, which one is best?

article thumbnail

Recap of Hadoop News for November 2017

ProjectPro

News on Hadoop - November 2017 IBM leads BigInsights for Hadoop out behind barn. Shots heard.theRegister.co.uk, November 8, 2017. IBM’s BigInsights for Hadoop sunset on December 6, 2017. IBM will not provide any further new instances for the basic plan of its data analytics platform. The existing instances will continue to be available on the Bluemix console as is from December 7, 2017 to November 7, 2018.

Hadoop 52
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Apache Hadoop 3.0.0 is Generally Available!

Cloudera

The Apache Hadoop community recently released version 3.0.0 GA , the third major release in Hadoop’s 10-year history at the Apache Software Foundation. We covered earlier releases like 3.0.0-alpha1 and 3.0.0-alpha2 on the Cloudera Engineering blog, and 3.0.0 GA is bigger and better than ever. General availability (GA) marks a point of quality and stability for the release series that indicates it’s ready for broader use.

Hadoop 44
article thumbnail

SiriDB: Scalable Open Source Timeseries Database with Jeroen van der Heijden - Episode 11

Data Engineering Podcast

Summary Time series databases have long been the cornerstone of a robust metrics system, but the existing options are often difficult to manage in production. In this episode Jeroen van der Heijden explains his motivation for writing a new database, SiriDB, the challenges that he faced in doing so, and how it works under the hood. Preamble Hello and welcome to the Data Engineering Podcast, the show about modern data infrastructure When you’re ready to launch your next project you’ll

Database 100

More Trending

article thumbnail

data.world with Bryon Jacob - Episode 9

Data Engineering Podcast

Summary We have tools and platforms for collaborating on software projects and linking them together, wouldn’t it be nice to have the same capabilities for data? The team at data.world are working on building a platform to host and share data sets for public and private use that can be linked together to build a semantic web of information. The CTO, Bryon Jacob, discusses how the company got started, their mission, and how they have built and evolved their technical infrastructure.

article thumbnail

5 Ways You Can Reduce Attrition Using HR Analytics

U-Next

One of the major misconceptions revolving around the human resource department is that it’s one of the most chilled out departments in the entire organization. Most people (I know you do too) think that the folks at HR have an easy life and that their primary responsibilities include organizing fun activities, team lunches, secret Santa games and more.

article thumbnail

How Workforce Analysis will Change Your Life as an HR Manager

U-Next

Every plan and forecast needs a basis – a basis on which the plans will be estimated and discussed for efficiency and execution. You have to consider several factors and criteria to explore the possibilities of executing your plan and this requires a lot of data. By implementing data analysis techniques into workforce management and HR job processes, you can now analyze many aspects of your organization including staffing, retrenchment management, job satisfaction levels of employees and more.

article thumbnail

Become a Competent HR Professional with HR Analytics

U-Next

This is the age of cutthroat competition. This is a time where resting is unthinkable. The moment you feel complacent is the time your competitors go up the corporate ladder and ultimately towards success. People are working harder than ever and are constantly on the lookout for newer ways to better themselves. They are in search of newer courses and technologies related to their fields to master and want to become better at what they do.

article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Building a Self-Managed Shared Data Experience

Cloudera

Cloud promises many advantages as an environment for machine learning and analytics. Cloud makes it fast and easy to spin up resources for new applications. Cloud offers elasticity of those resources to efficiently support transient analytics workloads and data pipelines. Cloud offers self-service without waiting for IT infrastructure and operations teams.

article thumbnail

“Comply, you must comply!” – How Nordea Bank deals with regulatory compliance

Cloudera

Regulatory compliance, like death and taxes, is something that is mandatory and the cost of doing business in the financial services industry. In fact, the only thing that’s guaranteed is that the rate of regulatory change will only continue to increase over time. How banks deal with regulatory compliance is actually changing for the better. With big data platforms, like ones based on open source software, banks can more effectively adapt to and quickly manage the wave of rules coming at them.

Banking 40
article thumbnail

Plotting the data-driven journey

Cloudera

“Becoming data-driven is a multi-year journey, not a simple implementation.” It’s one of the first things we tell our customers. Acquiring and using data in a way that simply wasn’t possible up until very recently, requires a huge cultural shift. It’s far more than just technology. Last year our CTO and co-founder, Amr Awadallah suggested in The Malaysian Digest that we may be in the early stages of a multi-decade revolution.

Hadoop 40
article thumbnail

Manufacturing Insights that Transcend the Industry

Cloudera

Manufacturing as an industry has always been at the forefront of squeezing value from data. Instrumentation, highly connected systems, and automation have been part and parcel of manufacturing organisations for decades. Constrained by the state of technology more than cost, process optimisation was always achieved by making clever use of the data available and has given rise to completely new disciplines and applications.

article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

Three Ways Marketers Can Reach Data Nirvana

Cloudera

Data-driven marketing is the new black. As marketers uncover the magnitude of value they can derive from the data at their disposal, they will be faced with a new set of data management-related challenges that can either make or break their quest. IDC predicts that 163 zettabytes of data will be generated by 2025, uncovering a new world of consumer insights and business possibilities.

article thumbnail

Pattern Recognition: ‘Fake news’ and other Machine Learning use cases

Cloudera

Here’s a puzzle for you: 3, 1, 2, 0, 1, -1, ? 240, 48, 12, 4, 2, ? 2, 5, 11, 17, 23, 31, ? Find the pattern? . You may end up scratching your head a bit, but many of you will figure it out, without even pulling out a calculator. We humans are pretty good at pattern detection. And, we often use some form of pattern detection in our daily lives to help us solve problems or predict our next-best course of action.

article thumbnail

Cloudera’s Diversity & Inclusion Efforts Recognized by Watermark and the Human Rights Campaign Foundation

Cloudera

From left to right: Nirupama Kamat, Anne Smith, Aimee Schneider, Wendi Durnin, Britt Sellin, Saketa Chandra Chalamchala, Josh Blackburn, Mike Olson, Kathy Erickson, Hilary Mason, Lauren Tipton. Not pictured: Kathyrn Hirsch. Mike Olson, Amr Awadallah, and the rest of Cloudera’s early leaders laid the foundation for a culture of respect, openness, and inclusion from day one.

article thumbnail

Strata Data Singapore 2017: Big Data, Safe Data, Cloud Data

Cloudera

If you’re going to Strata Data Singapore 2017 at the Suntec Singapore Convention & Exhibition Centre , here are four sessions to attend that cover various combinations of my favorite themes: big data, safe data, and cloud data. Hear our ideas, bring your questions, and enjoy the show. Pro-tip: I know these speakers would be very happy to meet with you in smaller groups too.

article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

The Beginner’s Guide to HR Analytics and Data Powered Talent Management

U-Next

When Chandler said he was into data reconfiguration and stuff, he passed it off as dull and boring. But the fact is that data has always been crucial to mankind in business operations since then and even before. Today, data has become a key ingredient to the successful operations of many companies, businesses, and startups. As more organizations wake up to the inevitability of data, this is the time businesses also pressurize one of the core departments of any business – the HR – to implement da

article thumbnail

AngularConnect 2017

Zalando Engineering

Highlights and Takeaways from Europe's Largest Angular Conference Just a week after Angular Version 5 was released , I had the pleasure to attend AngularConnect 2017. AngularConnect is Europe's largest Angular conference. It is a multi-track conference so I could not attend all the sessions, but the talks I saw were amazing and offered great content.

article thumbnail

Surviving Data Loss

Zalando Engineering

Backing up Apache Kafka and Zookeeper to S3 What is Apache Kafka? Apache Kafka is a distributed streaming platform used for building real-time data pipelines and streaming applications. It is horizontally scalable, fault-tolerant, and wicked fast. It runs in production in many companies. Backups are important with any kind of data. Apache Kafka lowers this risk of data loss with replication across brokers.

Kafka 40
article thumbnail

Introducing: Helsinki’s 100th Employee

Zalando Engineering

In conversation with Full Stack Engineer, Maksim Ekimovskii Yesterday, Finland celebrated the 100th anniversary of its independence. To join our Helsinki hub’s celebrations, we spoke to the 100th employee, Maksim Ekimovskii. A full stack engineer and passionate videographer , Maksim tells us about his journey with Zalando so far. Chillin' out Maksim, relaxin' all cool. ** Tell us a little about yourself.

article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, Terrence Sheflin, and Mahyar Ghasemali

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

A Recipe for Kafka Lag Monitoring

Zalando Engineering

A closer look at the ingredients needed for ultimate stability This is part of a series of posts on Kafka. See Ranking Websites in Real-time with Apache Kafka’s Streams API for the first post in the series. Remora is a small application to track the monitoring of Kafka. Due to many teams deploying this to their production environments, open sourcing this application made sense.

Kafka 40
article thumbnail

Opening the Shrinking Global Space for Civil Society

Cloudera

. First they came for the Socialists, and I did not speak out—. Because I was not a Socialist. Then they came for the Trade Unionists, and I did not speak out—. Because I was not a Trade Unionist. Then they came for the Jews, and I did not speak out—. Because I was not a Jew. Then they came for me—and there was no one left to speak for me. Most of you will likely be familiar with some version of this poem.

Media 40