article thumbnail

Metadata Management And Integration At LinkedIn With DataHub

Data Engineering Podcast

The key to those solutions is a robust and flexible metadata management system. LinkedIn has gone through several iterations on the most maintainable and scalable approach to metadata, leading them to their current work on DataHub. What were you using at LinkedIn for metadata management prior to the introduction of DataHub?

Metadata 100
article thumbnail

Why Column-Aware Metadata Is Key to Automating Data Transformations

Snowflake

IoT devices in every industry; geolocation information on our phones, watches, cars, and every other mobile device; every website or app we access—all are collecting data. A single organization may have access to millions of attributes. For the future, our automation tools must collect and manage metadata at the column level.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Build Data Products Without A Data Team Using AgileData

Data Engineering Podcast

Summary Building data products is an undertaking that has historically required substantial investments of time and talent. Shane Gibson co-founded AgileData to make analytics accessible to companies of all sizes. Atlan is the metadata hub for your data ecosystem. Can you describe what AgileData is and the story behind it?

Building 130
article thumbnail

Build AI-powered Recommendations with Confluent Cloud for Apache Flink® and Rockset

Rockset

Building a real-time, contextual and trustworthy knowledge base for AI applications revolves around RAG pipelines. What are the challenges building RAG pipelines? When you are building applications for consistent, real-time performance at scale you will want to use a streaming-first architecture.

Cloud 64
article thumbnail

Building a Data Platform in 2024

Towards Data Science

How to build a modern, scalable data platform to power your analytics and data science projects (updated) Table of Contents: What’s changed? Orchestration I mentioned modularity as a core concept of building a modern data platform in my 2021 article, but I failed to emphasize the importance of data orchestration.

article thumbnail

Building A Data Governance Bridge Between Cloud And Datacenters For The Enterprise At Privacera

Data Engineering Podcast

In this episode Balaji Ganesan shares how his experiences building and maintaining Ranger in previous roles helped him understand the needs of organizations and engineers as they define and evolve their data governance policies and practices. Acryl]([link] The modern data stack needs a reimagined metadata management platform.

article thumbnail

Build and deploy ML with ease Using Snowpark ML, Snowflake Notebooks, and Snowflake Feature Store

Snowflake

And because the Notebook is natively integrated into Snowflake’s role-based access controls (RBAC), it’s easy to securely share and collaborate on your code and results without compromising on any enterprise data. What’s Next? Check out the Snowpark ML demo from Snowday to see the latest launches in action.

Building 103