Apache Pig Interview Questions and Answers

BY
Udemy

Mode

Online

Fees

₹ 1999

Quick Facts

particular details
Medium of instructions English
Mode of learning Self study
Mode of Delivery Video and Text Based

Course and certificate fees

Fees information
₹ 1,999
certificate availability

Yes

certificate providing authority

Udemy

The syllabus

Introduction

  • Introduction

Section 2

  • Scenario Based Question (File modification based)
  • How to remove single quotes from data using Pig?
  • How to compute sum of a field in all the rows from an alias?
  • What is the difference between GROUP and COGROUP in PIG?
  • Is there a way to do this? eg, pass the name of the file to be processed, etc?
  • Is there a way to export the results from Pig directly to a database like mysql?
  • How do you deal with empty or missing input files in Apache Pig?
  • Is there any way to store out to a single CSV file?
  • How can I cast the value, without having to do a FOREACH on all the records?
  • Scenario Based Question

Section 3

  • Scenario Based Question (Date)
  • How to optimize a group by statement in PIG latin?
  • How to handle spill memory in pig? How to fix it?
  • How to do Transpose in corresponding few columns in pig?
  • Find if a string is present inside another string in Pig?
  • Scenario Based Question (Programming)
  • Is there any way to store out to a single CSV file in Apache Pig?
  • Removing duplicates using PigLatin?
  • How to include external jar file using PIG?
  • How to reference columns in a FOREACH after a JOIN?

Section 4

  • How to Load multiple files from a date range (part of the directory structure)?
  • What are the data types of Pig Latin?
  • What are the different ways of executing Pig script?
  • What are the components of Pig Execution Environment?
  • How Pig programming gets converted into MapReduce jobs?
  • What is the difference between logical and physical plans?
  • How can I pass a parameter with space to a pig script?
  • How can I calculate a percentage (partial aggregate / total aggregate)?
  • How do I find out where the data comes from?
  • Is there a way to check if a map is empty?

Section 5

  • Does Pig allow grouping on expressions?
  • Is there a way for me to figure out how many rows exist in a dataset from alias?
  • Is there any difference between `==` and `eq` for numeric comparisons?
  • How do I prevent failure if some records don't have the needed number of columns
  • Does Pig support regular expressions?
  • Can I do a numerical comparison while filtering?
  • How do I make my Pig jobs run on a specified number of reducers?
  • What is the difference between Store and dump commands?
  • How to debug a pig script?
  • What is BloomMapFile used for?

Section 6

  • What are the limitations of the Pig?
  • What is the difference between GROUP and COGROUP operators in Pig?
  • Give some list of relational operators used in Pig?
  • Can we process vast amount of data in local mode? Why?
  • Explain about the different complex data types in Pig?
  • How do I control the number of mappers?
  • How can I load data using Unicode control characters as delimiters?
  • Scenario Based Question (Jars)
  • What is Pig?
  • Differentiate between the logical and physical plan of an Apache Pig script?

Section 7

  • What do you understand by an inner bag and outer bag in Pig?
  • Explain the difference between COUNT_STAR and COUNT functions in Apache Pig?
  • Explain about the scalar datatypes in Apache Pig.
  • Is it possible to join multiple fields in pig scripts?
  • What are the different String functions available in pig?
  • While writing evaluate UDF, which method has to be overridden?
  • Write a word count program in pig?
  • What is a skewed join?
  • How can I pass a specific hadoop configuration parameter to Pig?
  • What is the difference between Pig Latin and HiveQL ?

Section 8

  • Does Pig support multi-line commands?
  • What is the function of UNION and SPLIT operators? Give examples.
  • How to load files with different delimiter each time in piglatin?
  • How to count a number of rows in alias
  • Is it possible to pivot a table in one pass in Apache Pig?
  • Bonus Lecture

Trending Courses

Popular Courses

Popular Platforms

Learn more about the Courses

Download the Careers360 App on your Android phone

Regular exam updates, QnA, Predictors, College Applications & E-books now on your Mobile

Careers360 App
150M+ Students
30,000+ Colleges
500+ Exams
1500+ E-books