Remove blogs product-discovery-challenges
article thumbnail

Data Engineering Weekly #161

Data Engineering Weekly

Here is the agenda, 1) Data Application Lifecycle Management - Harish Kumar( Paypal) Hear from the team in PayPal on how they build the data product lifecycle management (DPLM) systems. 4) Building Data Products and why should you? link] Nvidia: What Is Sovereign AI?

article thumbnail

Running Unified PubSub Client in Production at Pinterest

Pinterest Engineering

For these reasons, and others detailed in our original PubSub Client blog post , our team has decided to invest in building, productionalizing, and most recently open-sourcing PubSub Client (PSC). years since our previous blog post, PSC has been battle-tested at large scale in Pinterest with notably positive feedback and results.

Kafka 98
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Data Engineering Weekly #166

Data Engineering Weekly

Poor data quality and unlcear data ownership remains the top challenges for the data teams. “With Cube, we’ve been able to speed up time to release a new data model to production by 5x and decrease analytics downtime by 90%. I strongly believe the concept of Data Product will play a bigger role in data engineering.

article thumbnail

DoorDash identifies Five big areas for using Generative AI

DoorDash Engineering

In this blog post, we’ll examine how DoorDash hopes to leverage Generative AI and revolutionize the delivery experience. Enhancement of employee productivity Generative AI can be used to accelerate DoorDash employees’ productivity by automating tasks such as SQL writing, and document drafting.

Food 98
article thumbnail

Data Engineering Weekly #162

Data Engineering Weekly

Big thanks to our insightful speakers, Hareshkumar Selvakumar - Talks about his work on Data Products for PayPal. Major ML dataset repositories and frameworks support Croissant, simplifying the discovery, preparation, and utilization of datasets for machine learning practitioners by standardizing and organizing them.

article thumbnail

Streams Replication Manager Prefixless Replication

Cloudera

Replication is a crucial capability in distributed systems to address challenges related to fault tolerance, high availability, load balancing, scalability, data locality, network efficiency, and data durability. release) Remote topic discovery SRM needs to be able to know which topics are replicas and what are their respective source topics.

article thumbnail

A Closer Look at The Next Phase of Cloudera’s Hybrid Data Lakehouse

Cloudera

Faced with unique challenges around distributed data infrastructures, governance, and an evolving security landscape, enterprises need the right support to fully tap into AI quickly. This marks a significant milestone for the platform: according to IDC, today about half of the world’s enterprise production data under management is on-prem.