Top Data Engineering Digest Kafka Data Collection Content for Week of Feb 29

Sat.Feb 29, 2020 - Fri.Mar 06, 2020

Easier Stream Processing On Kafka With ksqlDB

Data Engineering Podcast

MARCH 2, 2020

Summary Building applications on top of unbounded event streams is a complex endeavor, requiring careful integration of multiple disparate systems that were engineered in isolation. The ksqlDB project was created to address this state of affairs by building a unified layer on top of the Kafka ecosystem for stream processing. Developers can work with the SQL constructs that they are familiar with while automatically getting the durability and reliability that Kafka offers.

Kafka

Kafka Process PostgreSQL MySQL

Kafka Connect Elasticsearch Connector in Action

Confluent

MARCH 4, 2020

The Elasticsearch sink connector helps you integrate Apache Kafka® and Elasticsearch with minimum effort. You can take data you’ve stored in Kafka and stream it into Elasticsearch to then be […].

Kafka

Kafka IT Data

Join 16,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Communication

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

MORE WEBINARS

Trending Sources

Analyzing GDPR Fines – who are largest violators?

KDnuggets

MARCH 6, 2020

Fines from the GDPR have been rolling in since its inception in 2018. This article investigates who are the largest penalty recipients by country, the amounts, and private individuals.

Webinars

The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Communication

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

MORE WEBINARS

Introducing Dispatch

Netflix Tech

MARCH 5, 2020

By Kevin Glisson, Marc Vilanova, Forest Monsen Netflix is pleased to announce the open-source release of our crisis management orchestration framework: Dispatch! Okay, but what is Dispatch? Put simply, Dispatch is: All of the ad-hoc things you’re doing to manage incidents today, done for you, and a bunch of other things you should’ve been doing, but have not had the time!

Metadata

Metadata AWS Management Architecture

Get Better Network Graphs & Save Analysts Time

Many organizations today are unlocking the power of their data by using graph databases to feed downstream analytics, enahance visualizations, and more. Yet, when different graph nodes represent the same entity, graphs get messy. Watch this essential video with Senzing CEO Jeff Jonas on how adding entity resolution to a graph database condenses network graphs to improve analytics and save your analysts time.

Database

How to Connect Teradata Vantage to Azure Blob Storage to Query JSON Files

Teradata

MARCH 4, 2020

Many Teradata customers are interested in integrating Vantage with Microsoft Azure First Party Services. Check out this guide to help you get started.

Mock APIs vs. Real Backends – Getting the Best of Both Worlds

Confluent

MARCH 3, 2020

When building API-driven web applications, there is one key metric that engineering teams should minimize: the blocked factor. The blocked factor measures how much time developers spend in the following […].

Engineering

Engineering Building

More Trending

Open-Sourcing riskquant, a library for quantifying risk

Netflix Tech

MARCH 5, 2020

Netflix has a program in our Information Security department for quantifying the risk of deliberate (attacker-driven) and accidental… Continue reading on Netflix TechBlog ».

Programming

How to Repurpose Successful Database Techniques inside Teradata Vantage

Teradata

MARCH 2, 2020

Learn how Teradata's hashing algorithm is used to enhance the performance and ease-of-use of the Advanced SQL Engine.

Database

Database Algorithm SQL Engineering

Best Practices for Analyzing Kafka Event Streams

Rockset

MARCH 5, 2020

Apache Kafka has seen broad adoption as the streaming platform of choice for building applications that react to streams of data in real time. In many organizations, Kafka is the foundational platform for real-time event analytics, acting as a central location for collecting event data and making it available in real time. While Kafka has become the standard for event streaming, we often need to analyze and build useful applications on Kafka data to unlock the most value from event streams.

Kafka

Kafka Data Warehouse Data Lake Relational Database

On Spark, Hive, and Small Files: An In-Depth Look at Spark Partitioning Strategies

Airbnb Tech

MARCH 3, 2020

One of the most common ways to store results from a Spark job is by writing the results to a Hive table stored on HDFS. While in theory, managing the output file count from your jobs should be simple, in reality, it can be one of the more complex parts of your pipeline. Author : Zachary Ennenga Airbnb’s new office building, 650 Townsend Background At Airbnb, our offline data processing ecosystem contains many mission-critical, time-sensitive jobs — it is essential for us to maximize the stabilit

Datasets

Datasets Bytes Scala Data Engineering

Understanding User Needs and Satisfying Them

Speaker: Scott Sehlhorst

We know we want to create products which our customers find to be valuable. Whether we label it as customer-centric or product-led depends on how long we've been doing product management. There are three challenges we face when doing this. The obvious challenge is figuring out what our users need; the non-obvious challenges are in creating a shared understanding of those needs and in sensing if what we're doing is meeting those needs.

Certification

How Netflix uses Druid for Real-time Insights to Ensure a High-Quality Experience

Netflix Tech

MARCH 3, 2020

By Ben Sykes Continue reading on Netflix TechBlog ».

Kafka

Data Engineering Digest

Sat.Feb 29, 2020 - Fri.Mar 06, 2020

Easier Stream Processing On Kafka With ksqlDB

Kafka Connect Elasticsearch Connector in Action

Webinars

Trending Sources

Analyzing GDPR Fines – who are largest violators?

Webinars

Introducing Dispatch

Get Better Network Graphs & Save Analysts Time

How to Connect Teradata Vantage to Azure Blob Storage to Query JSON Files

Mock APIs vs. Real Backends – Getting the Best of Both Worlds

Top February Stories: The Death of Data Scientists – will AutoML replace them?

More Trending

Top February Stories: The Death of Data Scientists – will AutoML replace them?

Open-Sourcing riskquant, a library for quantifying risk

How to Repurpose Successful Database Techniques inside Teradata Vantage

Best Practices for Analyzing Kafka Event Streams

On Spark, Hive, and Small Files: An In-Depth Look at Spark Partitioning Strategies

Understanding User Needs and Satisfying Them

How Netflix uses Druid for Real-time Insights to Ensure a High-Quality Experience

Stay Connected

Sat.Feb 29, 2020 - Fri.Mar 06, 2020

Easier Stream Processing On Kafka With ksqlDB

Kafka Connect Elasticsearch Connector in Action

Webinars

Trending Sources

Analyzing GDPR Fines – who are largest violators?

Webinars

Introducing Dispatch

Get Better Network Graphs & Save Analysts Time

How to Connect Teradata Vantage to Azure Blob Storage to Query JSON Files

Mock APIs vs. Real Backends – Getting the Best of Both Worlds

Top February Stories: The Death of Data Scientists – will AutoML replace them?

Sign up to get articles personalized to your interests!

More Trending

Top February Stories: The Death of Data Scientists – will AutoML replace them?

Open-Sourcing riskquant, a library for quantifying risk

How to Repurpose Successful Database Techniques inside Teradata Vantage

Best Practices for Analyzing Kafka Event Streams

On Spark, Hive, and Small Files: An In-Depth Look at Spark Partitioning Strategies

Understanding User Needs and Satisfying Them

How Netflix uses Druid for Real-time Insights to Ensure a High-Quality Experience

Stay Connected