Remove AWS Remove PostgreSQL Remove Scala Remove SQL
article thumbnail

Data News — Week 24.08

Christophe Blefari

JVM vs. SQL data engineer — There's a big discussion in the community about what real data engineering is. Is it Java/Scala or Python? Is it DataFrames or SQL? Still, I prefer SQL/Python data engineering, as you know me. They provide tooling to do without writing awful SQL queries.

Data Lake 130
article thumbnail

Make Data Lineage A Ubiquitous Part Of Your Work By Simplifying Its Implementation With Alvin

Data Engineering Podcast

Ascend users love its declarative pipelines, powerful SDK, elegant UI, and extensible plug-in architecture, as well as its support for Python, SQL, Scala, and Java. Ascend automates workloads on Snowflake, Databricks, BigQuery, and open source Spark, and can be deployed in AWS, Azure, or GCP. How do you account for assets (e.g.

IT 100
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Optimize Your Machine Learning Development And Serving With The Open Source Vector Database Milvus

Data Engineering Podcast

Ascend users love its declarative pipelines, powerful SDK, elegant UI, and extensible plug-in architecture, as well as its support for Python, SQL, Scala, and Java. Ascend automates workloads on Snowflake, Databricks, BigQuery, and open source Spark, and can be deployed in AWS, Azure, or GCP.

article thumbnail

Keep Your Data And Query It Too Using Chaos Search with Thomas Hazel and Pete Cheslock - Episode 47

Data Engineering Podcast

What are the benefits of implementing the Elasticsearch API on top of your data in S3 as opposed to using systems such as Presto or Drill to interact with the same information via SQL? What mechanisms do you use to allow for such drastic space savings of indexed data in S3 versus in an Elasticsearch cluster?

IT 100
article thumbnail

15+ Must Have Data Engineer Skills in 2023

Knowledge Hut

Java Big Data requires you to be proficient in multiple programming languages, and besides Python and Scala, Java is another popular language that you should be proficient in. Kafka, which is written in Scala and Java, helps you scale your performance in today’s data-driven and disruptive enterprises.

article thumbnail

Top 20+ Big Data Certifications and Courses in 2023

Knowledge Hut

Programming Languages : Good command on programming languages like Python, Java, or Scala is important as it enables you to handle data and derive insights from it. Cloud Computing : Knowledge of cloud platforms like AWS, Azure, or Google Cloud is essential as these are used by many organizations to deploy their big data solutions.

article thumbnail

Integrated Commerce and our Merchant Center rebuild

Zalando Engineering

We’ll also be utilising AWS and RESTful architectural styles that incorporate both backend and frontend technologies. AWS simplifies the deployment of new versions of our services, on top of ensuring scalability and easy maintainability.

Scala 52