Remove 2005 Remove Hadoop Remove Java Remove NoSQL
article thumbnail

20 Best Open Source Big Data Projects to Contribute on GitHub

ProjectPro

It even allows you to build a program that defines the data pipeline using open-source Beam SDKs (Software Development Kits) in any three programming languages: Java, Python, and Go. Apache Spark is also quite versatile, and it can run on a standalone cluster mode or Hadoop YARN , EC2, Mesos, Kubernetes, etc.