Remove category platform product
article thumbnail

The fancy data stack—batch version

Christophe Blefari

Summer Edition ( credits ) This is the first article of the Data News Summer Edition: how to build a data platform. A lot of logos and products will be mentioned. As a disclaimer, this may not quite make sense in a corporate context, but since this is my blog, I'll do what I want. This is not a paid article.

article thumbnail

Data News — Week 23.08

Christophe Blefari

This is something I struggle with, I really like writing, I really like this newsletter, I really like the blog, but it takes me one day per week to be done. A bit of infrastructure This week I've seen a lot of articles that I can put under the infrastructure category, so here we are. I'm open to all honest feedbacks.

Kafka 130
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Use Consistent And Up To Date Customer Profiles To Power Your Business With Segment Unify

Data Engineering Podcast

Summary Every business has customers, and a critical element of success is understanding who they are and how they are using the companies products or services. Segment created the Unify product to reduce the burden of building a comprehensive view of customers and synchronizing it to all of the systems that need it.

article thumbnail

Building DoorDash’s Product Knowledge Graph with Large Language Models

DoorDash Engineering

DoorDash’s retail catalog is a centralized dataset of essential product information for all products sold by new verticals merchants – merchants operating a business other than a restaurant, such as a grocery, a convenience store, or a liquor store. Better personalization.

article thumbnail

An ML based approach to proactive advertiser churn prevention

Pinterest Engineering

Erika Sun ML Engineer | Advertiser Growth Modeling Team; Ogheneovo Dibie Engineering Manager | Advertiser Growth Modeling Team Photo by Jason Blackeye on Unsplash Summary In this blog post, we describe a Machine Learning (ML) powered proactive churn prevention solution that was prototyped with our small & medium business (SMB) advertisers.

article thumbnail

Data Engineering in Retrospect: Key Trends and Patterns of 2023

Data Engineering Weekly

The data industry clearly understands the power of blog storage, and using S3 as a database is not a new concept either. The companies backing Apache Hudi and Iceberg write articles about comparative ACID support in both the platforms here and here. Databricks is already doing a lot of these move-up stack products.

article thumbnail

Viral spam content detection at LinkedIn

LinkedIn Engineering

On the LinkedIn platform, members from around the world share their knowledge, perspectives, and discuss topics important to them. There are rare occasions when content uploaded on our platform goes undetected by our current defense mechanisms, which may result in sharing across the platform.