Remove how-kafka-is-used-by-netflix
article thumbnail

Data Engineering Weekly #165

Data Engineering Weekly

Intuit: How Intuit data analysts write SQL 2x faster with the internal GenAI tool The productivity increase with GenAI is undeniable, and several startups are trying to solve the Text2SQL generation problem. The blog further emphasizes its increased investment in Data Mesh and clean data.

article thumbnail

Ensuring the Successful Launch of Ads on Netflix

Netflix Tech

We used this simulation to help us surface problems of scale and validate our Ads algorithms. Replaying real traffic and making it appear as Basic with ads traffic was a better solution than artificially simulating Netflix traffic. We used this information to simulate a subscriber population through our AB testing platform.

Algorithm 136
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Data Engineering Weekly #151

Data Engineering Weekly

Github writes an excellent blog to capture the current state of the LLM integration architecture. link] Netflix: Incremental Processing using Netflix Maestro and Apache Iceberg Netflix writes about its incremental processing design with its orchestration engine Maestro on top of Iceberg.

article thumbnail

An Engineering Guide to Data Quality - A Data Contract Perspective - Part 2

Data Engineering Weekly

I won’t bore you with the importance of data quality in the blog. Let’s dive into both architectural patterns and see how to adopt them in real-time and batch data processing. Since Kafka is almost synonymous with real-time data processing, we often call this a “Fronting Kafka” pattern.

article thumbnail

Auto-Diagnosis and Remediation in Netflix Data Platform

Netflix Tech

By Vikram Srivastava and Marcelo Mayworm Netflix has one of the most complex data platforms in the cloud on which our data scientists and engineers run batch and streaming workloads. As our subscribers grow worldwide and Netflix enters the world of gaming , the number of batch workflows and real-time data pipelines increases rapidly.

Kafka 95
article thumbnail

Build AI-powered Recommendations with Confluent Cloud for Apache Flink® and Rockset

Rockset

It powers steam processing at many companies including Uber, Netflix, and Linkedin. Rockset customers using Flink often share how challenging it is to self-manage Flink for streaming transformations. Today, Confluent announced the general availability of its serverless Apache Flink service. What is RAG?

Cloud 64
article thumbnail

1. Streamlining Membership Data Engineering at Netflix with Psyberg

Netflix Tech

By Abhinaya Shetty , Bharath Mummadisetty At Netflix, our Membership and Finance Data Engineering team harnesses diverse data related to plans, pricing, membership life cycle, and revenue to fuel analytics, power various dashboards, and make data-informed decisions. How does late-arriving data impact us? Let’s dive in!