article thumbnail

Data Engineering Annotated Monthly – September 2021

Big Data Tools

Kafka 3.0.0 – The Apache Software Foundation needed less than one month to go from Kafka version 3.0.0-rc0 PostgreSQL 14 – Sometimes I forget, but traditional relational databases play a big role in the lives of data engineers. And of course, PostgreSQL is one of the most popular databases.

article thumbnail

Data Engineering Annotated Monthly – September 2021

Big Data Tools

Kafka 3.0.0 – The Apache Software Foundation needed less than one month to go from Kafka version 3.0.0-rc0 PostgreSQL 14 – Sometimes I forget, but traditional relational databases play a big role in the lives of data engineers. And of course, PostgreSQL is one of the most popular databases.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

15+ Must Have Data Engineer Skills in 2023

Knowledge Hut

Kafka Kafka is one of the most desired open-source messaging and streaming systems that allows you to publish, distribute, and consume data streams. Kafka, which is written in Scala and Java, helps you scale your performance in today’s data-driven and disruptive enterprises.

article thumbnail

Data Engineering Glossary

Silectis

Kafka Apache Kafka is the Apache Foundation’s open-source software platform for streaming. MySQL An open-source relational databse management system with a client-server model. PostgreSQL A free, open-source relational database management system, also known as Postgres.

article thumbnail

Top 20+ Big Data Certifications and Courses in 2023

Knowledge Hut

Big Data Frameworks : Familiarity with popular Big Data frameworks such as Hadoop, Apache Spark, Apache Flink, or Kafka are the tools used for data processing. Database Management : knowing how to work with databases - both relational(like Postgres) and non-relational - is important for efficient storing and retrieval of data.

article thumbnail

Real-Time Data Transformations with dbt + Rockset

Rockset

For instance, let’s say you have streaming data coming in from Kafka or Kinesis. S3 or GCS), NoSQL databases (e.g. DynamoDB or MongoDB), and relational databases (e.g. PostgreSQL or MySQL). For high velocity data, most commonly coming from data streams, you can roll it up at write-time.

SQL 52
article thumbnail

Analytics on DynamoDB: Comparing Elasticsearch, Athena and Spark

Rockset

DynamoDB has been one of the most popular NoSQL databases in the cloud since its introduction in 2012. As opposed to a traditional RDBMS like PostgreSQL, DynamoDB scales horizontally, obviating the need for careful capacity planning, resharding, and database maintenance.

NoSQL 52