Remove tags data-transformation
article thumbnail

Data News — Week 24.02

Christophe Blefari

Back to the usual Data News—with a little delay, I'm sorry. It's a subject close to my heart and I was very happy to share it with you, because I never thought that Data News would become such a big part of my life. I actually cover data engineering and how to put data stuff into production.

article thumbnail

Type-safe data processing pipelines

Tweag

Computing is all about transforming data. A wide variety of domains, such as multimedia, securities trading or compilers, allow decomposing the corresponding transformations into a sequence of well-defined steps. How can we express these transformations to avoid missing any necessary steps? a → b) → [b] becomes ∀a. ∀b.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Just Launched: Data Products

Monte Carlo

At what level does it make sense to deploy your data monitoring coverage? For most traditional data quality tools the answer has been by table. Data teams are onboarding datasets quickly and with the power of modern platforms, that process has only been getting faster and volumes only getting bigger.

article thumbnail

How to get started with dbt

Christophe Blefari

dbt Core is an open-source framework that helps you organise data warehouse SQL transformation. dbt was born out of the analysis that more and more companies were switching from on-premise Hadoop data infrastructure to cloud data warehouses. This switch has been lead by modern data stack vision. Enter the ELT.

article thumbnail

Transforming Delimited String Columns into Rows with Snowflake

RandomTrees

In the kingdom of data manipulation and analytics, one common challenge is dealing with data stored in delimited string format within a single column. This format poses difficulties for analysis and querying since the data is not readily accessible or organized. Let’s see how data is loaded to the table.

Media 52
article thumbnail

Introducing Vector Search on Rockset: How to run semantic search with OpenAI and Rockset

Rockset

Organizations have continued to accumulate large quantities of unstructured data, ranging from text documents to multimedia content to machine and sensor data. Comprehending and understanding how to leverage unstructured data has remained challenging and costly, requiring technical depth and domain expertise.

article thumbnail

Data Trends 2024: Strategies for an AI-Ready Data Foundation

Snowflake

A company’s data strategy is always in motion. Some emerging approaches may be seen in our newly released Snowflake Data Trends 2024 , looking at how users in the Data Cloud are working with their data. For the report, we focused on two broad aspects of data strategy. The most marked finding was around governance.