Remove Events Remove Java Remove Kafka Remove Metadata
article thumbnail

Data Reprocessing Pipeline in Asset Management Platform @Netflix

Netflix Tech

This platform has evolved from supporting studio applications to data science applications, machine-learning applications to discover the assets metadata, and build various data facts. During this evolution, quite often we receive requests to update the existing assets metadata or add new metadata for the new features added.

article thumbnail

The Importance of Distributed Tracing for Apache-Kafka-Based Applications

Confluent

Apache-Kafka ® -based applications stand out for their ability to decouple producers and consumers using an event log as an intermediate layer. This enables choreographed service collaborations, where many components can subscribe to events stored in the event log and react to them asynchronously.

Kafka 111
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Monitoring Data Replication in Multi-Datacenter Apache Kafka Deployments

Confluent

Instead of having many point-to-point connections between sites, the Confluent Platform provides an integrated event streaming architecture with frictionless data replication between sites. Useful to know in case of a disaster event). How far behind is data replication?

Kafka 86
article thumbnail

Rockset Enhances Kafka Integration to Simplify Real-Time Analytics on Streaming Data

Rockset

We’re introducing a new Rockset Integration for Apache Kafka that offers native support for Confluent Cloud and Apache Kafka, making it simpler and faster to ingest streaming data for real-time analytics. With the Kafka Integration, users no longer need to build, deploy or operate any infrastructure component on the Kafka side.

Kafka 52
article thumbnail

Sysmon Security Event Processing in Real Time with KSQL and HELK

Confluent

During a recent talk titled Hunters ATT&CKing with the Right Data , which I presented with my brother Jose Luis Rodriguez at ATT&CKcon, we talked about the importance of documenting and modeling security event logs before developing any data analytics while preparing for a threat hunting engagement. FROM SYSMON_JOIN.

Process 81
article thumbnail

Building Real-time Machine Learning Foundations at Lyft

Lyft Engineering

The Event Driven Decisions capability in particular turned out to be general enough as to be applicable to a wide range of use cases. The interface was designed such that a minimal amount of metadata was needed to construct a pipeline object which performs a given capability. One key component is the Analytics Event Abstraction layer.

article thumbnail

Solving Data Lineage Tracking And Data Discovery At WeWork

Data Engineering Podcast

The solution to discoverability and tracking of data lineage is to incorporate a metadata repository into your data platform. The metadata repository serves as a data catalog and a means of reporting on the health and status of your datasets when it is properly integrated into the rest of your tools.

Metadata 100