Remove fully-managed-apache-kafka-service
article thumbnail

Running Unified PubSub Client in Production at Pinterest

Pinterest Engineering

A central component of data ingestion infrastructure at Pinterest is our PubSub stack, and the Logging Platform team currently runs deployments of Apache Kafka and MemQ. years since our previous blog post, PSC has been battle-tested at large scale in Pinterest with notably positive feedback and results.

Kafka 98
article thumbnail

Revolutionizing Real-Time Streaming Processing: 4 Trillion Events Daily at LinkedIn

LinkedIn Engineering

Authors: Bingfeng Xia and Xinyu Liu Background At LinkedIn, Apache Beam plays a pivotal role in stream processing infrastructures that process over 4 trillion events daily through more than 3,000 pipelines across multiple production data centers.

Process 119
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

A Closer Look at The Next Phase of Cloudera’s Hybrid Data Lakehouse

Cloudera

Faced with unique challenges around distributed data infrastructures, governance, and an evolving security landscape, enterprises need the right support to fully tap into AI quickly. Cloudera is now the only provider to offer an open data lakehouse with Apache Iceberg for cloud and on-premises.

article thumbnail

Getting Started With Cloudera Open Data Lakehouse on Private Cloud

Cloudera

Cloudera recently released a fully featured Open Data Lakehouse , powered by Apache Iceberg in the private cloud, in addition to what’s already been available for the Open Data Lakehouse in the public cloud since last year. Cloudera Flow Management 2.1.6 and Cloudera Flow Management 2.1.6.

Cloud 76
article thumbnail

Simplify Metrics on Apache Druid With Rill Data and Cloudera

Cloudera

Cloudera has partnered with Rill Data, an expert in metrics at any scale, as Cloudera’s preferred ISV partner to provide technical expertise and support services for Apache Druid customers. We want Cloudera customers that rely on Apache Druid to know that their clusters are secure and supported by the Cloudera partner ecosystem.

BI 84
article thumbnail

Streams Replication Manager Prefixless Replication

Cloudera

Streams Replication Manager (SRM) is an enterprise-grade replication solution that enables fault tolerant, scalable, and robust cross-cluster Kafka topic replication. Introduction Kafka as an event streaming component can be applied to a wide variety of use cases. This makes it difficult to manage multiple clusters.

article thumbnail

What’s New in CDP Private Cloud Base 7.1.7?

Cloudera

Cloudera Manager enhancements for greater efficiency and hardened platform security. Apache Ozone enhancements deliver full High Availability providing customers with enterprise-grade object storage and compatibility with Hadoop Compatible File System and S3 API. . We expand on this feature later in this blog. x, and 6.3.x,

Cloud 97