article thumbnail

Useful classes for data engineers - Scala & Java

Waitingforcode

In this blog post I'll share with you a list of Java and Scala classes I use almost every time in data engineering projects. The part for Python will follow next week! We all have our habits and as programmers, libraries and frameworks are definitely a part of the group.

Scala 130
article thumbnail

Unity Catalog Lakeguard: Industry-first and only data governance for multi-user Apache™ Spark clusters

databricks

Run SQL, Python & Scala workloads with full data governance & cost-efficient multi-user compute. Unlock the power of Apache Spark™ with Unity Catalog Lakeguard on Databricks Data Intelligence Platform.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Python for Data Engineering

Ascend.io

As the demand to efficiently collect, process, and store data increases, data engineers have started to rely on Python to meet this escalating demand. In this article, our primary focus will be to unpack the reasons behind Python’s prominence in the data engineering domain. Why Python for Data Engineering?

article thumbnail

Making applyInPandasWithState less painful

Waitingforcode

However, due to Python duck typing, some operations are more difficult and more risky to express in the code than in the strongly typed Scala API. Do not get the title wrong! Having applyInPandasWithState in the PySpark API is huge!

Scala 147
article thumbnail

A Backtracking Sudoku Solver in Scala

Rock the JVM

This article is for Scala beginners. After you learn the language, the next big thing you need to master is how to write essential “algorithms” in Scala. This tends to make algorithms in Scala quite difficult. This article works identically for Scala 2 and Scala 3. All you need is recursion. map ( row => row.

Scala 40
article thumbnail

Scala For Big Data Engineering – Why should you care?

Advancing Analytics: Data Engineering

The thought of learning Scala fills many with fear, its very name often causes feelings of terror. The truth is Scala can be used for many things; from a simple web application to complex ML (Machine Learning). The name Scala stands for “scalable language.” So what companies are actually using Scala?

Scala 52
article thumbnail

The Dog Days of PySpark

Confessions of a Data Guy

PySpark is the abstraction that lets a bazillion Data Engineers forget about that blight Scala and cuddle their wonderfully soft and ever-kind Python code, while choking down gobs of data like some Harkonnen glutton. One of those things to hate and love, well … kinda hard not to love.

Scala 130