Remove 5-airflow-alternatives-for-data-orchestration
article thumbnail

5 Airflow Alternatives for Data Orchestration

KDnuggets

Top list of open-source tools for building and managing workflows.

Data 122
article thumbnail

Data News — Week 23.14

Christophe Blefari

Data News entering in town ( credits ) Hey you, if I wasn't late in my newsletter writing it wouldn't be me. But here is your usual Data News. This Tuesday we hosted the second part of the Airflow alternatives meetup with Prefect and Dagster. Data modeling Dear readers, I have to confess something.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Data News — Week 13.14

Christophe Blefari

Data News entering in town ( credits ) Hey you, if I wasn't late in my newsletter writing it wouldn't be me. But here is your usual Data News. This Tuesday we hosted the second part of the Airflow alternatives meetup with Prefect and Dagster. Data modeling Dear readers, I have to confess something.

article thumbnail

Supercharge your Airflow Pipelines with the Cloudera Provider Package

Cloudera

Many customers looking at modernizing their pipeline orchestration have turned to Apache Airflow, a flexible and scalable workflow manager for data engineers. Apache Airflow providers are a set of packages allowing services to define operators in their Directed Acyclic Graphs (DAGs) to access external systems.

Python 99
article thumbnail

Building a maintainable and modular LLM application stack with Hamilton

Towards Data Science

Specifically, we’ll cover pulling data from the web, creating text embeddings (vectors) and pushing them to a vector store. The application will receive a small data input (e.g., This data will move through different services (LLM, vector database, document store, etc.) Disclaimer: I’m one of the authors of the Hamilton package.

article thumbnail

Simplify Airflow DAG Creation and Maintenance with Hamilton in 8 minutes

Towards Data Science

How Hamilton can help you write more maintainable Airflow DAGs An abstract representation of how Airflow & Hamilton relate. Airflow helps bring it all together, while Hamilton has make the innards manageable. Just to recap, Airflow is the industry standard to orchestrate data pipelines.

Python 66
article thumbnail

Top Hadoop Projects and Spark Projects for Beginners 2021

ProjectPro

Big data has taken over many aspects of our lives and as it continues to grow and expand, big data is creating the need for better and faster data storage and analysis. These Apache Hadoop projects are mostly into migration, integration, scalability, data analytics, and streaming analysis. Data Migration 2.

Hadoop 52