Sat.Apr 15, 2023

article thumbnail

Spark SQL checkpoints

Waitingforcode

In my long - but not long enough! - journey with Apache Spark I've met the "checkpointing" world in the context of Structured Streaming mostly. But this term also applies to other modules including Apache Spark SQL, so batch processing!

SQL 130
article thumbnail

The Dog Days of PySpark

Confessions of a Data Guy

PySpark. One of those things to hate and love, well … kinda hard not to love. PySpark is the abstraction that lets a bazillion Data Engineers forget about that blight Scala and cuddle their wonderfully soft and ever-kind Python code, while choking down gobs of data like some Harkonnen glutton. But, that comes with […] The post The Dog Days of PySpark appeared first on Confessions of a Data Guy.

Scala 130