Remove co-partitioning-in-kafka-streams
article thumbnail

Druid Deprecation and ClickHouse Adoption at Lyft

Lyft Engineering

In this particular blog post, we explain how Druid has been used at Lyft and what led us to adopt ClickHouse for our sub-second analytic system. Real-time Ingestion Events from our real-time analytics pipeline were configured to be sent into our internal Flink application, streamed to Kafka, and written into Druid.

Kafka 104
article thumbnail

Rockset Enhances Kafka Integration to Simplify Real-Time Analytics on Streaming Data

Rockset

We’re introducing a new Rockset Integration for Apache Kafka that offers native support for Confluent Cloud and Apache Kafka, making it simpler and faster to ingest streaming data for real-time analytics. With the Kafka Integration, users no longer need to build, deploy or operate any infrastructure component on the Kafka side.

Kafka 52
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Why Mutability Is Essential for Real-Time Data Analytics

Rockset

We'll be publishing more posts in the series in the near future, so subscribe to our blog so you don't miss them! A platform such as Apache Kafka/Confluent , Spark or Amazon Kinesis for publishing that stream of event data. Event streaming/stream processing has been around for almost a decade. It’s well understood.

article thumbnail

A Reference Architecture for the Cloudera Private Cloud Base Data Platform

Cloudera

This blog post provides an overview of best practice for the design and deployment of clusters incorporating hardware and operating system configuration, along with guidance for networking and security as well as integration with existing enterprise infrastructure. Introduction and Rationale. Private Cloud Base Overview. Role allocation.

article thumbnail

Running Kafka Streams applications in AWS

Zalando Engineering

Second in our series about the use of Apache Kafka’s Streams API by Zalando This is the second in a series about the use of Apache Kafka’s Streams API by Zalando, Europe’s leading online fashion platform. See Ranking Websites in Real-time with Apache Kafka’s Streams API for the first post in the series.

Kafka 40
article thumbnail

Optimizing Kafka Streams Applications

Confluent

With the release of Apache Kafka ® 2.1.0, Kafka Streams introduced the processor topology optimization framework at the Kafka Streams DSL layer. This framework opens the door for various optimization techniques from the existing data stream management system (DSMS) and data stream processing literature.

Kafka 90
article thumbnail

Upscaling LinkedIn's Profile Datastore While Reducing Costs

LinkedIn Engineering

Co-Authors: Estella Pham and Guanlin Lu At peak, LinkedIn serves over 1.4 In this blog post we’ll discuss our decision to leverage Couchbase, the challenges that arose, and how we addressed each challenge in our final solution. million member profiles per second. The number of requests to our storage infrastructure doubles every year.

Database 133