Clientele ➞

Advanced Hive and PiG Training


Duration: 2 Days


Gartner predicts that 4.4 Million Jobs will be created globally to support BigData. BigData is a popular term used to describe the exponential growth, availability and use of information, both structured and unstructured. It is imperative that organizations and IT leaders focus on the ever-increasing volume, variety and velocity of information that forms BigData. Hadoop is the core platform for structuring BigData, and solves the problem of making it useful for Analytics.This course is recommended as an advancement to Hadoop developers who are familiar with Hive and Pig. The participants will deep dive into high level concepts of Hive and Pig to enhance their applications. Hive and Pig are major key components of Hadoop. Both are focused to mine and analyze huge amounts of data and are helpful for the developers who are not well-versed with the MapReduce framework for writing data queries. Hive is a Data Warehousing package constructed on top of Hadoop for analyzing huge amounts of Data. It is mainly developed for user who are comfortable in using SQL and it conceptualizes the complexity of Hadoop because the users need mot write MapReduce programs. Pig is a high level data flow system that renders you a simple language platform that can be used for manipulating data and queries. It follows a multi query approach to cut down the number of times the data is scanned.

Why learn about Processing BigData with Hadoop?

  • Businesses are now aware of the large volumes of data that they generate in their day to day transactions. They have also realized that this BigData can provide very valuable insights once analyzed
  • The massive volume of BigData and its unstructured format make it difficult to analyze BigData. Hadoop brings the ability to cheaply process large amounts of data, regardless of structure.
  • If you are an IT professional who wants to stay up to date with the current buzzword then this is the course for you.
  • Knowledge about processing BigData with Hadoop will also prove to be a huge Resume builder for Students who will be trying for Placements soon.
  • If you are a developer who is uncertain about how Hadoop works, this course will clear things up and save you lot of time and effort
  • If you are business that is planning to shift to Hadoop, then this is the right course for your employees to get trained.
  • Processing BigData with Hadoop will prove to be an answer to many questions at once.
  • The session will be handled by very experienced trainers who not only have immense knowledge but are also loaded with valuable experience


  • This training is directed towards Hadoop developers who are familiar with the notion of Pig and Hive and want to develop advanced skill set on the same.

Who should attend

  • Hadoop developers who wants to advance their skill-set on Hive and Pig


  • Cloudthat Hadoop Developer Course

Course Outline

Apache Hive

Revisiting Hive Concepts

  • HIVE Architecture
  • The Warehouse
  • HQL – Querying Hive
  • Views
  • Indexes
  • Data Types
  • Table Partitioning

Hive – Next Level

  • Bucketing
  • Joins
  • Distributed Cache
  • UDF
  • Streaming and Transformations
  • Analytics Function
  • Ranking Function

Additional Hive Topics

  • Hive Thrift Service
  • Unit Testing in Hive
  • HiveStorageHandler
  • Hive Security
  • HCatalog Introduction

Apache Pig

Revisiting Pig Concepts

  • Grunt Shell
  • Pig Data Model
  • Pag Latin Basics

Advanced Pig

  • UDF
  • Streaming
  • Writing Pig Scripts
  • Testing with PigUnit

About The Trainer

Dr. Yash Mody

Hadoop, Big Data Solution Specialist,Adobe AEM Architect

Dr. Yasyash-modyh Mody, PhD, has developed and architected several enterprise applications using platforms like Hadoop, Oracle ADF, SalesForce, IBM Websphere, Quartz, SAP, Adobe AEM, Adobe LiveCycle, Apache Flex, TIBCO etc. At CloudThat, he works with our customers like PwC, Fidelity, Western Union, GE, HP, Oracle, Mahindra Bristlecone, Flipkart, Aditi, Sonata etc. to help them understand various big data technologies and design solutions for modern usecases like social media analytics, web analytics etc.

Over the years, Yash has trained over 1500 developers and architects from over 30 organisations.

View LinkedIn Profile

Other Details



For latest batch dates, fees, location and general inquiries, contact our sales team at: +91 8880002200 or email at

For purely technical queries about the course please contact Bhavesh at

Upcoming Batches


Fill out my online form.
Recently Viewed Courses.
  • Advanced Hive and PiG Training

  • Favorite Courses
    No Favourites added yet.

    Our Partners