article thumbnail

Data Engineering Annotated Monthly – April 2022

Big Data Tools

It’s true that there is a scheduler for data engineering for k8s – YuniKorn – but some would prefer to run Flink ad hoc, and that requires these tools to implement the k8s operator. That wraps up April’s Data Engineering Annotated. Follow JetBrains Big Data Tools on Twitter and subscribe to our blog for more news!

article thumbnail

Data Engineering Annotated Monthly – April 2022

Big Data Tools

It’s true that there is a scheduler for data engineering for k8s – YuniKorn – but some would prefer to run Flink ad hoc, and that requires these tools to implement the k8s operator. That wraps up April’s Data Engineering Annotated. Follow JetBrains Big Data Tools on Twitter and subscribe to our blog for more news!

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Data Engineering Annotated Monthly – October 2021

Big Data Tools

The Future of the Data Engineer – As the author describes it: “Is the data engineer still the ‘worst seat at the table’? Thoughts on the past, present, and future of tooling, processes, and culture in our industry.” That wraps up October’s Data Engineering Annotated. What else can I even add?

article thumbnail

Data Engineering Annotated Monthly – October 2021

Big Data Tools

The Future of the Data Engineer – As the author describes it: “Is the data engineer still the ‘worst seat at the table’? Thoughts on the past, present, and future of tooling, processes, and culture in our industry.” That wraps up October’s Data Engineering Annotated. What else can I even add?

article thumbnail

Data Engineering Annotated Monthly – November 2021

Big Data Tools

Here’s what’s happening in the world of data engineering right now. Apache Arrow 6.0.1 – Apache Arrow presents itself as a cross-language development platform for in-memory analytics. Of course, you probably already know that if you’re doing data engineering in Python or, for example, Go – because the 6.0

article thumbnail

Data Engineering Annotated Monthly – November 2021

Big Data Tools

Here’s what’s happening in the world of data engineering right now. Apache Arrow 6.0.1 – Apache Arrow presents itself as a cross-language development platform for in-memory analytics. Of course, you probably already know that if you’re doing data engineering in Python or, for example, Go – because the 6.0

article thumbnail

Top 20 Azure Data Engineering Projects in 2023 [Source Code]

Knowledge Hut

Data Aggregation Working with a sample of big data allows you to investigate real-time data processing, big data project design, and data flow. Learn how to aggregate real-time data using several big data tools like Kafka, Zookeeper, Spark, HBase, and Hadoop.