article thumbnail

Data Engineering Glossary

Silectis

Hadoop / HDFS Apache’s open-source software framework for processing big data. JSON JavaScript Object Notation – a data-interchange format for storing and transporting data. MySQL An open-source relational databse management system with a client-server model.

article thumbnail

100+ Data Engineer Interview Questions and Answers for 2023

ProjectPro

Below are some big data interview questions for data engineers based on the fundamental concepts of big data, such as data modeling, data analysis , data migration, data processing architecture, data storage, big data analytics, etc. What is meant by Aggregate Functions in SQL?

article thumbnail

Data Collection for Machine Learning: Steps, Methods, and Best Practices

AltexSoft

Data ingestion means taking data from several sources and moving it to a target system without any transformation. So it can be a part of data integration or a separate process aiming at transporting information in its initial form. Find sources of relevant data. Choose data collection methods and tools.