Sat.Dec 30, 2017 - Fri.Jan 05, 2018

article thumbnail

Recap of Hadoop News for December 2017

ProjectPro

News on Hadoop - December 2017 Apache Impala gets top-level status as open source Hadoop tool.TechTarget.com, December 1, 2017. The massively parallel processing engine born at Cloudera acquired the status of a top-level project within the Apache Foundation. The main objective of Impala is to provide SQL-like interactivity to big data analytics just like other big data tools - Hive, Spark SQL, Drill, HAWQ , Presto and others.

Hadoop 52
article thumbnail

Rock Solid Kafka and ZooKeeper Ops on AWS

Zalando Engineering

Reducing ops effort while maintaining Kafka and Zookeeper This post is targeted to those looking for ways to reduce ops effort while maintaining Kafka and Zookeeper deployments on AWS and also improving their availability and stability. In a nutshell, we are going to explain how using Elastic Network Interfaces can improve over a straight out of the box setup.

Kafka 40
article thumbnail

Staffing your big data team

Cloudera

Building the right team is as important as assembling the right IT infrastructure – and the needs differ just as dramatically. A traditional BI and analytics organization consists of three main groups: Analysts that develop reports often using sample data. The data management team – modelers that take requests, find data, and develop models to answer the questions.