article thumbnail

Data Engineering Annotated Monthly – April 2022

Big Data Tools

Apache Hudi 1.11.0 – This release of the well-known data lake has added many interesting changes. There’s at least one interesting twist that goes like this: “A data pipeline has five stages grouped into three heads.” Corrections in data lakehouse table format comparisons – Quasi-mutable (a.k.a.

article thumbnail

Data Engineering Annotated Monthly – April 2022

Big Data Tools

Apache Hudi 1.11.0 – This release of the well-known data lake has added many interesting changes. There’s at least one interesting twist that goes like this: “A data pipeline has five stages grouped into three heads.” Corrections in data lakehouse table format comparisons – Quasi-mutable (a.k.a.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Top 20 Azure Data Engineering Projects in 2023 [Source Code]

Knowledge Hut

Azure Data Ingestion Pipeline Create an Azure Data Factory data ingestion pipeline to extract data from a source (e.g., Azure SQL Database, Azure Data Lake Storage). Data Aggregation Working with a sample of big data allows you to investigate real-time data processing, big data project design, and data flow.

article thumbnail

How to Become an Azure Data Engineer? 2023 Roadmap

Knowledge Hut

To provide end users with a variety of ready-made models, Azure Data engineers collaborate with Azure AI services built on top of Azure Cognitive Services APIs. Data engineers must therefore have a thorough understanding of programming languages like Python, Java, or Scala.

article thumbnail

Data Engineering Annotated Monthly – September 2022

Big Data Tools

Here are some great articles and posts that can help inspire us all to learn from the experience of other people, teams, and companies who work in data engineering. That wraps up September’s Data Engineering Annotated. Follow JetBrains Big Data Tools on Twitter and subscribe to our blog for more news!

article thumbnail

Data Engineering Annotated Monthly – September 2022

Big Data Tools

Here are some great articles and posts that can help inspire us all to learn from the experience of other people, teams, and companies who work in data engineering. That wraps up September’s Data Engineering Annotated. Follow JetBrains Big Data Tools on Twitter and subscribe to our blog for more news!

article thumbnail

20 Latest AWS Glue Interview Questions and Answers for 2023

ProjectPro

You can leverage AWS Glue to discover, transform, and prepare your data for analytics. In addition to databases running on AWS, Glue can automatically find structured and semi-structured data kept in your data lake on Amazon S3, data warehouse on Amazon Redshift, and other storage locations.

AWS 52