Remove Coding Remove Definition Remove Metadata Remove Pipeline-centric
article thumbnail

Rebuilding Netflix Video Processing Pipeline with Microservices

Netflix Tech

The Netflix video processing pipeline went live with the launch of our streaming service in 2007. By integrating with studio content systems, we enabled the pipeline to leverage rich metadata from the creative side and create more engaging member experiences like interactive storytelling.

Process 91
article thumbnail

Toward a Data Mesh (part 2) : Architecture & Technologies

François Nguyen

You are starting to be an operation or technology centric data team. This is really for us the definition of a self serve platform. ” Code : all the code necessary to build a data product (data pipelines, API, policies). To get out of this, you have to move to another stage : the serverless stage.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Experts Share the 5 Pillars Transforming Data & AI in 2024

Monte Carlo

Gen AI can whip up serviceable code in moments — making it much faster to build and test data pipelines. Just like at first everyone had to code in a language, then everyone had to know how to incorporate packages from those languages — now we’re moving into, ‘ How do you incorporate AI that will write the code for you?’”

article thumbnail

97 things every data engineer should know

Grouparoo

This provided a nice overview of the breadth of topics that are relevant to data engineering including data warehouses/lakes, pipelines, metadata, security, compliance, quality, and working with other teams. For example, grouping the ones about metadata, discoverability, and column naming might have made a lot of sense.

article thumbnail

The Rise of the Data Engineer

Maxime Beauchemin

Like data scientists, data engineers write code. There’s a multitude of reasons why complex pieces of software are not developed using drag and drop tools: it’s that ultimately code is the best abstraction there is for software. They’re highly analytical, and are interested in data visualization.

article thumbnail

How Airbnb Standardized Metric Computation at Scale

Airbnb Tech

Specifically, we will showcase how we standardize dataset definitions through declarative configurations, explain how data versioning enables us to ensure cross-dataset consistency, and illustrate how we backfill data efficiently with zero downtime. Anyone can look up definitions without confusion.

article thumbnail

Using DataOps to Drive Agility and Business Value

DataKitchen

Whether it’s streaming, batch, virtualized or not, using active metadata, or just plain old regular coding, it provides a good way for the data and analytics team to add continuous value to the organization.”. Their definition of DataOps was that we do some automation and check a record count. Be business-centric.