Brief History of Data Engineering
Jesse Anderson
DECEMBER 12, 2022
Doug Cutting took those papers and created Apache Hadoop in 2005. Hadoop was hard to program, and Apache Hive came along in 2010 to add SQL. With an immutable file system like HDFS, we needed scalable databases to read and write data randomly. At various times it’s been Java, Scala, and Python.
Let's personalize your content