article thumbnail

The fancy data stack—batch version

Christophe Blefari

FAQ and remarks Why do you use Google Cloud? My opinion on the matter is this: all clouds are born equal, you just have to find the one you're most comfortable with, or suffer your company's choices. this list can become infinite) Conclusion After this design exercice I have mix feeling.

article thumbnail

The Alooma Data Pipeline With CTO Yair Weinberger - Episode 33

Data Engineering Podcast

Links Alooma Convert Media Data Integration ESB (Enterprise Service Bus) Tibco Mulesoft ETL (Extract, Transform, Load) Informatica Microsoft SSIS OLAP Cube S3 Azure Cloud Storage Snowflake DB Redshift BigQuery Salesforce Hubspot Zendesk Spark The Log: What every software engineer should know about real-time data’s unifying abstraction by Jay (..)

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

15+ Best Data Engineering Tools to Explore in 2023

Knowledge Hut

These tools include both open-source and commercial options, as well as offerings from major cloud providers like AWS, Azure, and Google Cloud. MongoDB MongoDB is a NoSQL document-oriented database that is widely used by data engineers for building scalable and flexible data-driven applications.

article thumbnail

Unstructured Data: Examples, Tools, Techniques, and Best Practices

AltexSoft

Semi-structured data is typically stored in NoSQL databases, such as MongoDB, Cassandra, and Couchbase, following hierarchical or graph data models. There are several widely used unstructured data storage solutions such as data lakes (e.g., Amazon S3, Google Cloud Storage, Microsoft Azure Blob Storage), NoSQL databases (e.g.,

article thumbnail

Jobprofil des Data Engineers

Data Science Blog: Data Engineering

Dazu gesellen sich Datenbanken wie der PostgreSQL, Maria DB oder Microsoft SQL Server sowie CosmosDB oder einfachere Cloud-Speicher wie der Microsoft Blobstorage, Amazon S3 oder Google Cloud Storage. Beispiele für verbreitete NoSQL-Datenbanken sind MongoDB, CouchDB, Cassandra oder Neo4J.

article thumbnail

The Good and the Bad of Hadoop Big Data Framework

AltexSoft

It lets you run MapReduce and Spark jobs on data kept in Google Cloud Storage (instead of HDFS); or. Oracle Big Data Service , offering customers a fully-managed Hadoop environment in the cloud. MongoDB: an NoSQL database with additional features. Google Cloud Platform: a relative of Apache Hadoop.

Hadoop 59
article thumbnail

50 Cloud Computing Interview Questions and Answers for 2023

ProjectPro

50 Cloud Computing Interview Questions and Answers f0r 2023 Knowing how to answer the most commonly asked cloud computing questions can increase your chances of landing your dream cloud computing job roles. What are some popular use cases for cloud computing? These instances use their local storage to store data.