Remove 2005 Remove Data Preparation Remove Data Process Remove Java
article thumbnail

20 Best Open Source Big Data Projects to Contribute on GitHub

ProjectPro

To execute pipelines, beam supports numerous distributed processing back-ends, including Apache Flink, Apache Spark , Apache Samza, Hazelcast Jet, Google Cloud Dataflow, etc. In addition to analytics and data science, RAPIDS focuses on everyday data preparation tasks. Apache CouchDB Source: idroot.us