article thumbnail

Low Code And High Quality Data Engineering For The Whole Organization With Prophecy

Data Engineering Podcast

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management You listen to this show to learn about all of the latest tools, patterns, and practices that power data engineering projects across every domain.

article thumbnail

Troubleshooting Kafka In Production

Data Engineering Podcast

Summary Kafka has become a ubiquitous technology, offering a simple method for coordinating events and data across different systems. Data lakes are notoriously complex. What motivated to write a book about how to manage Kafka in production? There are many options now for persistent data queues.

Kafka 245
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Ethics Sheet for AI-assisted Comic Book Art Generation

Cloudera

This blog is intended to serve as an ethics sheet for the task of AI-assisted comic book art generation, inspired by “ Ethics Sheets for AI Tasks.” AI-assisted comic book art generation is a task I proposed in a blog post I authored on behalf of my employer, Cloudera. Introduction. Scope, motivation, and benefits.

article thumbnail

Designing A Non-Relational Database Engine

Data Engineering Podcast

Data lakes are notoriously complex. For data engineers who battle to build and scale high quality data workflows on the data lake, Starburst powers petabyte-scale SQL analytics fast, at a fraction of the cost of traditional methods, so that you can meet all your data needs ranging from AI to data applications to complete analytics.

article thumbnail

Six Books that Have Shaped My Data Career

Towards Data Science

If you’re interested in those early days, how I grew my career, and advice for newcomers to data, take a look at my earlier article. In this article, I want to focus on my on-again, off-again relationship with books and reading. Even if you haven’t read any of the books below, you’ve probably at least heard of some of them.

article thumbnail

[O’Reilly Book] Chapter 1: Why Data Quality Deserves Attention Now

Monte Carlo

These conversations inspired us to write this book to convey some of the best practices we’ve learned and developed related to managing data quality at each stage of the data pipeline, from ingestion to analytics, and share how data teams in similar situations may be able to prevent their own data downtime.

article thumbnail

Unlocking Your dbt Projects With Practical Advice For Practitioners

Data Engineering Podcast

Data lakes are notoriously complex. For data engineers who battle to build and scale high quality data workflows on the data lake, Starburst powers petabyte-scale SQL analytics fast, at a fraction of the cost of traditional methods, so that you can meet all your data needs ranging from AI to data applications to complete analytics.

Project 147