Remove Data Lake Remove MongoDB Remove MySQL Remove PostgreSQL
article thumbnail

Data News — Week 24.08

Christophe Blefari

Enabling near real-time data analytics on the data lake — Grab showcasing what they did with Flink and Hudi to enable real-time use-cases. Data Economy 💰 MariaDB takeover at $37m. Neurelo raises $5m seed to provide HTTP APIs on top of databases (PostgreSQL, MongoDB and MySQL).

Data Lake 130
article thumbnail

Optimize Your Machine Learning Development And Serving With The Open Source Vector Database Milvus

Data Engineering Podcast

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode. Sifflet also offers a 2-week free trial.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Make Data Lineage A Ubiquitous Part Of Your Work By Simplifying Its Implementation With Alvin

Data Engineering Podcast

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management When you’re ready to build your next pipeline, or want to test out the projects you hear about on the show, you’ll need somewhere to deploy it, so check out our friends at Linode.

IT 100
article thumbnail

Python for Data Engineering

Ascend.io

Use Case: Transforming monthly sales data to weekly averages import dask.dataframe as dd data = dd.read_csv('large_dataset.csv') mean_values = data.groupby('category').mean().compute() compute() Data Storage Python extends its mastery to data storage, boasting smooth integrations with both SQL and NoSQL databases.

article thumbnail

How Rockset Enables SQL-Based Rollups for Streaming Data

Rockset

Apache Kafka has made acquiring real-time data more mainstream, but only a small sliver are turning batch analytics, run nightly, into real-time analytical dashboards with alerts and automatic anomaly detection. The majority are still draining streaming data into a data lake or a warehouse and are doing batch analytics.

SQL 52
article thumbnail

Real-Time Data Transformations with dbt + Rockset

Rockset

This can be helpful when you want to reduce the size of large scale data streams, deduplicate data, or partition your data. Collections can also be created from other data sources including data lakes (e.g. DynamoDB or MongoDB), and relational databases (e.g. PostgreSQL or MySQL).

SQL 52
article thumbnail

The Top 25 Data Engineering Influencers and Content Creators on LinkedIn

Databand.ai

Bob also hosts The Engineering Side of Data podcast , which is dedicated to discussions around data engineering and features a variety of guests from the data engineering space. His specialties include Microsoft SQL Server, Azure Databricks, Azure Data Factory, SQL Server Integration Services (SSIS), and Azure Data Lake.