article thumbnail

Brief History of Data Engineering

Jesse Anderson

Doug Cutting took those papers and created Apache Hadoop in 2005. They were the first companies to commercialize open source big data technologies and pushed the marketing and commercialization of Hadoop. Hadoop was hard to program, and Apache Hive came along in 2010 to add SQL. We lacked a scalable pub/sub system.

article thumbnail

Evolution of the Cloud Data Platform: From Google to Ascend

Ascend.io

Back in 2004, I got to work with MapReduce at Google years before Apache Hadoop was even released, using it on a nearly daily basis to analyze user activity on web search and analyze the efficacy of user experiments. So in this piece, I’ll give my take on the evolution of the cloud data platform, starting way back from my days at Google.

Cloud 52
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Evolution of the Cloud Data Platform: From Google to Ascend

Ascend.io

Back in 2004, I got to work with MapReduce at Google years before Apache Hadoop was even released, using it on a nearly daily basis to analyze user activity on web search and analyze the efficacy of user experiments. So in this piece, I’ll give my take on the evolution of the cloud data platform, starting way back from my days at Google.

Cloud 52
article thumbnail

Apache Hadoop turns 10: The Rise and Glory of Hadoop

ProjectPro

It is difficult to believe that the first Hadoop cluster was put into production at Yahoo, 10 years ago, on January 28 th , 2006. Ten years ago nobody was aware that an open source technology, like Apache Hadoop will fire a revolution in the world of big data. Happy Birthday Hadoop With more than 1.7

Hadoop 40
article thumbnail

Big Data Timeline- Series of Big Data Evolution

ProjectPro

"Big data is at the foundation of all of the megatrends that are happening today, from social to mobile to the cloud to gaming."- ”- Atul Butte, Stanford With the big data hype all around, it is the fuel of the 21 st century that is driving all that we do. .”- said Chris Lynch, the ex CEO of Vertica.

article thumbnail

Telecom Network Analytics: Transformation, Innovation, Automation

Cloudera

One of the most substantial big data workloads over the past fifteen years has been in the domain of telecom network analytics. The Dawn of Telco Big Data: 2007-2012. Suddenly, it was possible to build a data model of the network and create both a historical and predictive view of its behaviour.

article thumbnail

Top 8 Data Engineering Books [Beginners to Advanced]

Knowledge Hut

Acquire first-hand experience in learning Python packages for data processing and analysis. Big Data: Principles and best practices of scalable real-time data systems Big Data: Principles and Best Practices of Scalable Realtime Data Systems is an excellent resource for anyone who wants to learn the fundamentals of working with big data.