June, 2018

article thumbnail

CockroachDB In Depth with Peter Mattis - Episode 35

Data Engineering Podcast

Summary With the increased ease of gaining access to servers in data centers across the world has come the need for supporting globally distributed data storage. With the first wave of cloud era databases the ability to replicate information geographically came at the expense of transactions and familiar query languages. To address these shortcomings the engineers at Cockroach Labs have built a globally distributed SQL database with full ACID semantics in Cockroach DB.

article thumbnail

JVM Profiler: An Open Source Tool for Tracing Distributed JVM Applications at Scale

Uber Engineering

Computing frameworks like Apache Spark have been widely adopted to build large-scale data applications. For Uber, data is at the heart of strategic decision-making and product development. To help us better leverage this data, we manage massive deployments of Spark … The post JVM Profiler: An Open Source Tool for Tracing Distributed JVM Applications at Scale appeared first on Uber Engineering Blog.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Top AWS Certifications-Which one should I choose?

ProjectPro

AWS certifications are the most in-demand cloud computing certifications in the IT industry today, with an overwhelming growth in cloud computing. So, for those looking for a career in Amazon Web Services, this blog lists the best AWS certifications available today, including the cost, duration, and topics covered in each certification exam. With everyone from Netflix to American Airlines signing up to the cloud to keep things from crumbling into pieces, organizations are running into a signific

AWS 52
article thumbnail

Introducing Blended Learning From Cloudera University

Cloudera

Over the past decade, Cloudera University has taught more than 50,000 developers, administrators, analysts, and data scientists how to apply big data technologies. Developers are learning the APIs, so they can create new applications that were never before possible. Administrators learn to plan, install, monitor, and troubleshoot clusters. And analysts discover the power of SQL over large, diverse datasets.

Hadoop 43
article thumbnail

Driving Responsible Innovation: How to Navigate AI Governance & Data Privacy

Speaker: Aindra Misra, Senior Manager, Product Management (Data, ML, and Cloud Infrastructure) at BILL

Join us for an insightful webinar that explores the critical intersection of data privacy and AI governance. In today’s rapidly evolving tech landscape, building robust governance frameworks is essential to fostering innovation while staying compliant with regulations. Our expert speaker, Aindra Misra, will guide you through best practices for ensuring data protection while leveraging AI capabilities.

article thumbnail

The State of Open Source

Zalando Engineering

The evolution and future of open source at Zalando Open source software has been the core of Zalando’s tech stack since the company’s humble beginnings, selling flip-flops from a basement 10 years ago; it’s part of our DNA as a tech company. For engineering teams at Zalando, open source is a natural part of how we solve problems, we consult and share the TechRadar for guidance on appropriate technologies to use, we contribute to projects such as Kubernetes , and work in the open on a very large

article thumbnail

Programming Best Practices For Data Science

Dataquest

The data science life cycle is generally comprised of the following components: data retrieval data cleaning data exploration and visualization statistical or predictive modeling While these components are helpful for understanding the different phases, they don’t help us think about our programming workflow. Often, the entire data science life cycle ends up as an arbitrary mess of notebook cells in either a Jupyter Notebook or a single messy script.

More Trending

article thumbnail

Recap of Hadoop News for May 2018

ProjectPro

News on Hadoop - May 2018 Data-Driven HR: How Big Data And Analytics Are Transforming Recruitment.Forbes.com, May 4, 2018. With platforms like LinkedIn and Glassdoor giving every employer access to valuable big data, the world of recruitment transforming to intelligent recruitment.HR teams that make use of big data in future are likely to be successful in recruiting the right talent in the coming years.

Hadoop 52
article thumbnail

All Aboard

Zalando Engineering

What new tech employees can expect from Zalando onboarding So, you’ve applied for a technical role at Zalando and you’ve just accepted the offer! If you’re wondering what to expect, look no further. We are excited to share a peek behind the scenes, so you can see what awaits you in the first few weeks of this journey, regardless of whether you’re joining in Berlin, Dortmund, Dublin, Hamburg, Helsinki or Lisbon, to make sure you’re well-equipped to dive into life at Zalando.

article thumbnail

Loading Time Matters

Zalando Engineering

How Zalando's overall site speed improved by more than 25% in five months We all know that providing a fast user experience is key. Still, it was somewhat a wake-up call for us last fall when we saw our aggregated loading time increasing; not because we had increased latency in our systems but simply because the share of mobile visits kept increasing.

Bytes 40
article thumbnail

Package Management And Distribution For Your Data Using Quilt with Kevin Moore - Episode 37

Data Engineering Podcast

Summary Collaboration, distribution, and installation of software projects is largely a solved problem, but the same cannot be said of data. Every data team has a bespoke means of sharing data sets, versioning them, tracking related metadata and changes, and publishing them for use in the software systems that rely on them. The CEO and founder of Quilt Data, Kevin Moore, was sufficiently frustrated by this problem to create a platform that attempts to be the means by which data can be as collabo

article thumbnail

Launching LLM-Based Products: From Concept to Cash in 90 Days

Speaker: Christophe Louvion, Chief Product & Technology Officer of NRC Health and Tony Karrer, CTO at Aggregage

Christophe Louvion, Chief Product & Technology Officer of NRC Health, is here to take us through how he guided his company's recent experience of getting from concept to launch and sales of products within 90 days. In this exclusive webinar, Christophe will cover key aspects of his journey, including: LLM Development & Quick Wins 🤖 Understand how LLMs differ from traditional software, identifying opportunities for rapid development and deployment.

article thumbnail

User Analytics In Depth At Heap with Dan Robinson - Episode 36

Data Engineering Podcast

Summary Web and mobile analytics are an important part of any business, and difficult to get right. The most frustrating part is when you realize that you haven’t been tracking a key interaction, having to write custom logic to add that event, and then waiting to collect data. Heap is a platform that automatically tracks every event so that you can retroactively decide which actions are important to your business and easily build reports with or without SQL.

Scala 100
article thumbnail

The Intrapreneurship Journey at Zalando

Zalando Engineering

Sharing our innovation stories: success, failures, and learnings Franzi, Humberto, Neil, Lenia, Vivek. These are just some names of the people who are willing to put in the extra effort and run the additional mile to impact the organization in a way they haven’t done before. The stories of these Zalando intrapreneurs are the ones I summarized at the Innov8rs conference in Madrid.

Media 40
article thumbnail

Turning petabytes of pharmaceutical data into actionable insights

Cloudera

Authors: Mai N. Nguyen, Accenture & Mitch Gomulinski, Cloudera. Imagine storing the DNA of the entire population of the US – and then cloning them, twice. That’s the equivalent of 1 petabyte ( ComputerWeekly ) – the amount of unstructured data available within our large pharmaceutical client’s business. Then imagine the insights that are locked in that massive amount of data.

article thumbnail

The cost of not embarking on a customer 360 strategy

Cloudera

Gartner’s recently released report “Master Data Management Forms the Basis of a Trusted 360-Degree View of the Customer,” shares the results of an executive survey highlighting several key points, including that customer initiatives, are among CEOs’ top five priorities in 2018. The report includes numerous strategic recommendations and outlines the impact of a Master Data Management (MDM) strategy.

article thumbnail

How To Speak The Language Of Financial Success In Product Management

Speaker: Jamie Bernard

Success in product management goes beyond delivering great features - it’s about achieving measurable financial outcomes that resonate across the organization. By connecting your product’s journey with the company’s financial success, you’ll ensure that every feature, release, and innovation contributes to the bottom line, driving both customer satisfaction and business growth.