article thumbnail

Ethics Sheet for AI-assisted Comic Book Art Generation

Cloudera

This blog is intended to serve as an ethics sheet for the task of AI-assisted comic book art generation, inspired by “ Ethics Sheets for AI Tasks.” AI-assisted comic book art generation is a task I proposed in a blog post I authored on behalf of my employer, Cloudera. Introduction. Scope, motivation, and benefits.

article thumbnail

[O’Reilly Book] Chapter 1: Why Data Quality Deserves Attention Now

Monte Carlo

Your downstream data consumers including product analysts, marketing leaders, and sales teams rely on data-driven tools like CRMs, CXPs, CMSs, and any other acronym under the sun to do their jobs quickly and effectively. But what happens when the data is wrong?

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

The Significance of O’Reilly’s Data Quality Fundamentals

Monte Carlo

Now, roughly two years and 300 pages later, I’m thrilled to announce Data Quality Fundamentals is now available both online and in print. It’s our hope this book will prepare the next generation of data teams as they drive data product development and analytics strategy forward.

article thumbnail

5 Skills Data Engineers Should Master to Keep Pace with GenAI

Monte Carlo

Organizations need to connect LLMs with their proprietary data and business context to actually create value for their customers and employees. They need robust data pipelines, high-quality data, well-guarded privacy, and cost-effective scalability. Data engineers. Who can deliver?

article thumbnail

Understanding Generative AI: A Comprehensive Guide

Edureka

GANs, or generative adversarial networks GANs, first developed by Ian Goodfellow in 2014, comprise a Discriminator network that assesses the data and a Generator network that generates it. The generator produces high-quality data because the two networks are trained together in a game-like setting.

article thumbnail

How to Use DBT to Get Actionable Insights from Data?

Workfall

DBT’s superpowers include seamlessly connecting with databases and data warehouses, performing amazing transformations, and effortlessly managing dependencies to ensure high-quality data. Each successful deployment enriches its data ecosystem, empowering decision-makers with valuable, up-to-date insights.

article thumbnail

Troubleshooting Kafka In Production

Data Engineering Podcast

Summary Kafka has become a ubiquitous technology, offering a simple method for coordinating events and data across different systems. Data lakes are notoriously complex. What motivated to write a book about how to manage Kafka in production? There are many options now for persistent data queues.

Kafka 245