Creating a Data Pipeline with Spark, Google Cloud Storage and Big Query
Towards Data Science
MARCH 6, 2023
Many open-source data-related tools have been developed in the last decade, like Spark, Hadoop, and Kafka, without mention all the tooling available in the Python libraries. Google Cloud Storage (GCS) is Google’s blob storage. Of course, you’ll need to create a Google Cloud Platform account.
Let's personalize your content