Remove Kafka Remove NoSQL Remove PostgreSQL Remove SQL
article thumbnail

TimescaleDB: Fast And Scalable Timeseries with Ajay Kulkarni and Mike Freedman - Episode 18

Data Engineering Podcast

release of PostGreSQL had on the design of the project? Is timescale compatible with systems such as Amazon RDS or Google Cloud SQL? release of PostGreSQL had on the design of the project? Is timescale compatible with systems such as Amazon RDS or Google Cloud SQL? What impact has the 10.0 What impact has the 10.0

article thumbnail

15+ Must Have Data Engineer Skills in 2023

Knowledge Hut

Kafka Kafka is one of the most desired open-source messaging and streaming systems that allows you to publish, distribute, and consume data streams. Kafka, which is written in Scala and Java, helps you scale your performance in today’s data-driven and disruptive enterprises.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Python for Data Engineering

Ascend.io

Read More: Data Automation Engineer: Skills, Workflow, and Business Impact Python for Data Engineering Versus SQL, Java, and Scala When diving into the domain of data engineering, understanding the strengths and weaknesses of your chosen programming language is essential. Statically typed, requiring type definition upfront.

article thumbnail

Why Mutability Is Essential for Real-Time Data Analytics

Rockset

A platform such as Apache Kafka/Confluent , Spark or Amazon Kinesis for publishing that stream of event data. Traditionally, this information would be stored in transactional databases — Oracle Database , MySQL , PostgreSQL , etc. because they allow for mutability: Any field stored in these transactional databases is updatable.

article thumbnail

Real-Time Data Transformations with dbt + Rockset

Rockset

Using the adapter, you could now load data into Rockset and create collections by writing SQL SELECT statements in dbt. For instance, let’s say you have streaming data coming in from Kafka or Kinesis. S3 or GCS), NoSQL databases (e.g. PostgreSQL or MySQL). DynamoDB or MongoDB), and relational databases (e.g.

SQL 52
article thumbnail

Analytics on DynamoDB: Comparing Elasticsearch, Athena and Spark

Rockset

There is limited support for SQL analytics with some of these options. At Rockset, we recently added support for creating collections that pull data from Amazon DynamoDB - which basically means you can run fast SQL on DynamoDB tables without any ETL. DynamoDB, being a NoSQL store, imposes no fixed schema on the documents stored.

NoSQL 52
article thumbnail

Data Engineering Glossary

Silectis

Kafka Apache Kafka is the Apache Foundation’s open-source software platform for streaming. NoSQL A non-relational database Open Source Software that is available to freely use and modify Parquet A column-oriented data storage format that’s part of the Hadoop ecosystem. HDFS stands for Hadoop Distributed File System.