performing Hive queries Implemented Partitioning, Dynamic Partitioning and Bucketing in HIVE. Developed Hive queries to process the data and generate the data cubes for visualizing. Environment: Hadoop Yarn architecture, MapReduce, HDFS, Hive, Pig, Java, SQL, Cloudera Manager, Sqoop, Oozie, Java (jdk 1.7), Eclipse. Hadoop Developer Comcast - Philadelphia, PA - January 2014 to October 2014 Purpose of the project is to create Enterprise Data Hub so that various business units and use the data from Hadoop…
Words 1560 - Pages 7