June, 2018

article thumbnail

Package Management And Distribution For Your Data Using Quilt with Kevin Moore - Episode 37

Data Engineering Podcast

Summary Collaboration, distribution, and installation of software projects is largely a solved problem, but the same cannot be said of data. Every data team has a bespoke means of sharing data sets, versioning them, tracking related metadata and changes, and publishing them for use in the software systems that rely on them. The CEO and founder of Quilt Data, Kevin Moore, was sufficiently frustrated by this problem to create a platform that attempts to be the means by which data can be as collabo

article thumbnail

JVM Profiler: An Open Source Tool for Tracing Distributed JVM Applications at Scale

Uber Engineering

Computing frameworks like Apache Spark have been widely adopted to build large-scale data applications. For Uber, data is at the heart of strategic decision-making and product development. To help us better leverage this data, we manage massive deployments of Spark … The post JVM Profiler: An Open Source Tool for Tracing Distributed JVM Applications at Scale appeared first on Uber Engineering Blog.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Top AWS Certifications-Which one should I choose?

ProjectPro

AWS certifications are the most in-demand cloud computing certifications in the IT industry today, with an overwhelming growth in cloud computing. So, for those looking for a career in Amazon Web Services, this blog lists the best AWS certifications available today, including the cost, duration, and topics covered in each certification exam. With everyone from Netflix to American Airlines signing up to the cloud to keep things from crumbling into pieces, organizations are running into a signific

AWS 52
article thumbnail

Programming Best Practices For Data Science

Dataquest

The data science life cycle is generally comprised of the following components: data retrieval data cleaning data exploration and visualization statistical or predictive modeling While these components are helpful for understanding the different phases, they don’t help us think about our programming workflow. Often, the entire data science life cycle ends up as an arbitrary mess of notebook cells in either a Jupyter Notebook or a single messy script.

article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

Turning petabytes of pharmaceutical data into actionable insights

Cloudera

Authors: Mai N. Nguyen, Accenture & Mitch Gomulinski, Cloudera. Imagine storing the DNA of the entire population of the US – and then cloning them, twice. That’s the equivalent of 1 petabyte ( ComputerWeekly ) – the amount of unstructured data available within our large pharmaceutical client’s business. Then imagine the insights that are locked in that massive amount of data.

article thumbnail

The State of Open Source

Zalando Engineering

The evolution and future of open source at Zalando Open source software has been the core of Zalando’s tech stack since the company’s humble beginnings, selling flip-flops from a basement 10 years ago; it’s part of our DNA as a tech company. For engineering teams at Zalando, open source is a natural part of how we solve problems, we consult and share the TechRadar for guidance on appropriate technologies to use, we contribute to projects such as Kubernetes , and work in the open on a very large

More Trending

article thumbnail

CockroachDB In Depth with Peter Mattis - Episode 35

Data Engineering Podcast

Summary With the increased ease of gaining access to servers in data centers across the world has come the need for supporting globally distributed data storage. With the first wave of cloud era databases the ability to replicate information geographically came at the expense of transactions and familiar query languages. To address these shortcomings the engineers at Cockroach Labs have built a globally distributed SQL database with full ACID semantics in Cockroach DB.

article thumbnail

ArangoDB: Fast, Scalable, and Multi-Model Data Storage with Jan Steeman and Jan Stücke - Episode 34

Data Engineering Podcast

Summary Using a multi-model database in your applications can greatly reduce the amount of infrastructure and complexity required. ArangoDB is a storage engine that supports documents, dey/value, and graph data formats, as well as being fast and scalable. In this episode Jan Steeman and Jan Stücke explain where Arango fits in the crowded database market, how it works under the hood, and how you can start working with it today.

article thumbnail

Recap of Hadoop News for May 2018

ProjectPro

News on Hadoop - May 2018 Data-Driven HR: How Big Data And Analytics Are Transforming Recruitment.Forbes.com, May 4, 2018. With platforms like LinkedIn and Glassdoor giving every employer access to valuable big data, the world of recruitment transforming to intelligent recruitment.HR teams that make use of big data in future are likely to be successful in recruiting the right talent in the coming years.

Hadoop 52
article thumbnail

The cost of not embarking on a customer 360 strategy

Cloudera

Gartner’s recently released report “Master Data Management Forms the Basis of a Trusted 360-Degree View of the Customer,” shares the results of an executive survey highlighting several key points, including that customer initiatives, are among CEOs’ top five priorities in 2018. The report includes numerous strategic recommendations and outlines the impact of a Master Data Management (MDM) strategy.

article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, and Terrence Sheflin

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

Introducing Blended Learning From Cloudera University

Cloudera

Over the past decade, Cloudera University has taught more than 50,000 developers, administrators, analysts, and data scientists how to apply big data technologies. Developers are learning the APIs, so they can create new applications that were never before possible. Administrators learn to plan, install, monitor, and troubleshoot clusters. And analysts discover the power of SQL over large, diverse datasets.

Hadoop 43
article thumbnail

The Intrapreneurship Journey at Zalando

Zalando Engineering

Sharing our innovation stories: success, failures, and learnings Franzi, Humberto, Neil, Lenia, Vivek. These are just some names of the people who are willing to put in the extra effort and run the additional mile to impact the organization in a way they haven’t done before. The stories of these Zalando intrapreneurs are the ones I summarized at the Innov8rs conference in Madrid.

Media 40
article thumbnail

All Aboard

Zalando Engineering

What new tech employees can expect from Zalando onboarding So, you’ve applied for a technical role at Zalando and you’ve just accepted the offer! If you’re wondering what to expect, look no further. We are excited to share a peek behind the scenes, so you can see what awaits you in the first few weeks of this journey, regardless of whether you’re joining in Berlin, Dortmund, Dublin, Hamburg, Helsinki or Lisbon, to make sure you’re well-equipped to dive into life at Zalando.

article thumbnail

Loading Time Matters

Zalando Engineering

How Zalando's overall site speed improved by more than 25% in five months We all know that providing a fast user experience is key. Still, it was somewhat a wake-up call for us last fall when we saw our aggregated loading time increasing; not because we had increased latency in our systems but simply because the share of mobile visits kept increasing.

Bytes 40
article thumbnail

How to Drive Cost Savings, Efficiency Gains, and Sustainability Wins with MES

Speaker: Nikhil Joshi, Founder & President of Snic Solutions

Is your manufacturing operation reaching its efficiency potential? A Manufacturing Execution System (MES) could be the game-changer, helping you reduce waste, cut costs, and lower your carbon footprint. Join Nikhil Joshi, Founder & President of Snic Solutions, in this value-packed webinar as he breaks down how MES can drive operational excellence and sustainability.