Remove 2022 Remove Blog Remove Data Warehouse Remove Metadata
article thumbnail

CI/CD for Data Pipelines: A Game-Changer with AnalyticsCreator

Data Science Blog: Data Engineering

Enter AnalyticsCreator AnalyticsCreator, a powerful tool for data management, brings a new level of efficiency and reliability to the CI/CD process. It offers full BI-Stack Automation, from source to data warehouse through to frontend. It supports a holistic data model, allowing for rapid prototyping of various models.

article thumbnail

The Future of the Data Lakehouse – Open

Cloudera

These lakes power mission critical large scale data analytics, business intelligence (BI), and machine learning use cases, including enterprise data warehouses. In recent years, the term “data lakehouse” was coined to describe this architectural pattern of tabular analytics over data in the data lake.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Setting up and Getting Started with Cloudera’s New SQL AI Assistant

Cloudera

As described in our recent blog post , an SQL AI Assistant has been integrated into Hue with the capability to leverage the power of large language models (LLMs) for a number of SQL tasks. This is a real game-changer for data analysts on all levels and will make SQL development faster, easier, and less error-prone.

SQL 71
article thumbnail

AI at Scale isn’t Magic, it’s Data – Hybrid Data

Cloudera

A recent VentureBeat article , “4 AI trends: It’s all about scale in 2022 (so far),” highlighted the importance of scalability. But it isn’t just aggregating data for models. Data needs to be prepared and analyzed. Different data types need different types of analytics – real-time, streaming, operational, data warehouses.

article thumbnail

Data Engineering Weekly #105

Data Engineering Weekly

The author narrates the data contract, why it is the foundation for building data mesh, and how it glue data catalogs, glossary and data quality systems. link] Tomasz Tunguz: 9 Predictions for data in 2023 It is almost the end of 2022, and it's time for 2023 predictions.

article thumbnail

Surrogate keys in dbt: Integers or hashes?

dbt Developer Hub

Those who have been building data warehouses for a long time have undoubtedly encountered the challenge of building surrogate keys on their data models. This was because storage was expensive — it was more efficient to store data as few times as possible, and rely on joins to connect data tog ether when a report required it.

article thumbnail

Demystifying Modern Data Platforms

Cloudera

July brings summer vacations, holiday gatherings, and for the first time in two years, the return of the Massachusetts Institute of Technology (MIT) Chief Data Officer symposium as an in-person event. A key area of focus for the symposium this year was the design and deployment of modern data platforms. What is a data fabric?