Remove tags
article thumbnail

How to get started with dbt

Christophe Blefari

dbt Core is an open-source framework that helps you organise data warehouse SQL transformation. dbt was born out of the analysis that more and more companies were switching from on-premise Hadoop data infrastructure to cloud data warehouses. This switch has been lead by modern data stack vision.

article thumbnail

A new era of SQL-development, fueled by a modern data warehouse

Cloudera

These trends and demands lead to stress for existing data warehouse solutions – scale, efficiency, security integrations, IT budgets, ease of access. Cloudera recently launched Cloudera Data Warehouse, a modern data warehousing solution. It has been estimated to shorten the discovery phase by hours.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

A New Horizon for Data Reliability With Monte Carlo and Snowflake

Monte Carlo

Snowflake object tagging enables data teams to easily organize and classify their data objects with helpful context like ownership and priority level. And with Monte Carlo’s deep Snowflake integration, those object tags can also be automatically imported and applied to your Monte Carlo Access View as well. The other half?

article thumbnail

From Big Data to Better Data: Ensuring Data Quality with Verity

Lyft Engineering

In this post we will define data quality at a high-level and explore our motivation to achieve better data quality. We will then introduce our in-house product, Verity, and showcase how it serves as a central platform for ensuring data quality in our Hive Data Warehouse. What and Where is Data Quality?

article thumbnail

Data Lake Explained: A Comprehensive Guide to Its Architecture and Use Cases

AltexSoft

Instead of relying on traditional hierarchical structures and predefined schemas, as in the case of data warehouses, a data lake utilizes a flat architecture. This structure is made efficient by data engineering practices that include object storage. Data warehouse vs. data lake in a nutshell.

article thumbnail

Extreme data center pressure? Burst to the cloud with CDP!

Cloudera

Your sunk costs are minimal and if a workload or project you are supporting becomes irrelevant, you can quickly spin down your cloud data warehouses and not be “stuck” with unused infrastructure. Cloud deployments for suitable workloads gives you the agility to keep pace with rapidly changing business and data needs.

Cloud 101
article thumbnail

Upgrade your Modern Data Stack

Christophe Blefari

When you put the things like this it opens the doors and does not limit the modern data stack to 4 vendors. The central storage can be cloud storage, a warehouse, a real-time system, while the SQL engine can be a data warehouse or a dedicated processing engine. For this you need to do recurrent data cleaning.