Seamless Data Analytics Workflow: From Dockerized JupyterLab and MinIO to Insights with Spark SQL
Towards Data Science
DECEMBER 23, 2023
Photo by Ian Taylor on Unsplash This tutorial guides you through an analytics use case, analyzing semi-structured data with Spark SQL. We’ll start with the data engineering process, pulling data from an API and finally loading the transformed data into a data lake (represented by MinIO ).
Let's personalize your content