Sat.Jun 20, 2020 - Fri.Jun 26, 2020

article thumbnail

Aws Account

Start Data Engineering

1. AWS account Sign up for an AWS account at AWS Sign Up. You will be eligible for some free services for the first time sign up, ref: AWS Free Tier get your access key by clicking on your name -> My Security Credentials on the top pane and then clicking Create New Access Key.

AWS 130
article thumbnail

Modernization Means Simplicity and Sophistication

Teradata

When it comes to being a modern data warehouse, your age really is just a number. It’s the underlying capabilities that actually count. Read more.

article thumbnail

How Merging Companies Will Give Rise to Unified Data Streams

Confluent

Company mergers are becoming more common as businesses strive to improve performance and grow market share by saving costs and eliminating competition through acquisitions. But how do business mergers relate […].

Data 116
article thumbnail

Bringing Business Analytics To End Users With GoodData

Data Engineering Podcast

Summary The majority of analytics platforms are focused on use internal to an organization by business stakeholders. As the availability of data increases and overall literacy in how to interpret it and take action improves there is a growing need to bring business intelligence use cases to a broader audience. GoodData is a platform focused on simplifying the work of bringing data to employees and end users.

article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Aws Emr

Start Data Engineering

EMR AWS EMR is a managed service provided by AWS to run Spark, HDFS, HIVE and other select software.

AWS 130
article thumbnail

How to Leverage Advanced Analytics in the Healthcare Domain

Teradata

Learn how Teradata Vantage's advanced analytics capabilities can analyze and predict useful diagnoses and insights in biomedicine and healthcare.

More Trending

article thumbnail

PgBouncer on Kubernetes and how to achieve minimal latency

Zalando Engineering

Introduction In the new Postgres Operator release 1.5 we have implemented couple of new interesting features , including connection pooling support. Master Wq says there is "No greatest tool", to run something successfully in production one needs to understand pros and cons. Let's try to dig into the topic, and take a look at the performance aspect of connection pooler support, mostly from a scaling perspective.

article thumbnail

Getting Started - Time Series Charts

Preset

In this blog we will understand better what are Time Series and provide some examples of time series visualizations in Superset

40
article thumbnail

Data is the Prize and the Strategy

Teradata

Big Tech wants your data. It will monetize it & deliver great services to your customers. If banks can’t find a way to do the same, they should give up now.

Banking 80
article thumbnail

Announcing the Snowflake Sink Connector for Apache Kafka in Confluent Cloud

Confluent

We are excited to announce the preview release of the fully managed Snowflake sink connector in Confluent Cloud, our fully managed event streaming service based on Apache Kafka®. Our managed […].

Kafka 105
article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Learnings from Distributed XGBoost on Amazon SageMaker

Zalando Engineering

Overview XGBoost is a popular Python library for gradient boosted decision trees. The implementation allows practitioners to distribute training across multiple compute instances (or workers), which is especially useful for large training sets. One tool used at Zalando for deploying production machine learning models is the managed service from Amazon called SageMaker.

article thumbnail

Real-Time Recommendations for Event Ticketing Using MongoDB and Rockset

Rockset

When building data-driven applications, it’s been a common practice for years to move analytics away from the source database into either a slave, data warehouse or something similar. The main reason for this is that analytical queries, such as aggregations and joins, tend to require a lot more resources. When running, the detrimental impact on database performance could reverberate back to front-end users and have a negative impact on their experience.

MongoDB 40
article thumbnail

Big Tech is Poised to Pounce on Banking

Teradata

COVID-19 has changed banking, possibly for ever. But as banks wrestle with the pandemic & its after-effects, they must also focus on a bigger, imminent threat to their existence – & it’s not from FinTechs.

Banking 80
article thumbnail

Announcing ksqlDB 0.10.0

Confluent

We’re excited to announce the release of ksqlDB 0.10.0, available now in the standalone distribution and on Confluent Cloud! This version includes a first-class Java client, improved Apache Kafka® key […].

Java 92
article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

Reducing the Total Cost of Operations for Self-Managed Apache Kafka

Confluent

We kicked off Project Metamorphosis last month by announcing a set of features that make Apache Kafka® more elastic, one of the most important traits of cloud-native data systems. This […].

Kafka 71