Hadoop MCQs - CSE MCQs Questions and Answers

Whatsapp
Facebook
Copy
Copy Text
Copy
Copy URL

You can run Pig in interactive mode using the ______ shell.

Grunt

HDFS

None of the mentioned

Whatsapp
Facebook
Copy
Copy Text
Copy
Copy URL

__________ is a framework for collecting and storing script-level statistics for Pig Latin.

Pig Stats

PStatistics

Pig Statistics

None of the mentioned

Whatsapp
Facebook
Copy
Copy Text
Copy
Copy URL

The ________ class mimics the behavior of the Main class but gives users a statistics object back.

PigRun

PigRunner

RunnerPig

None of the mentioned

Whatsapp
Facebook
Copy
Copy Text
Copy
Copy URL

___________ return a list of hdfs files to ship to distributed cache.

relativeToAbsolutePath()

setUdfContextSignature()

getCacheFiles()

getShipFiles()

Whatsapp
Facebook
Copy
Copy Text
Copy
Copy URL

The loader should use ______ method to communicate the load information to the underlying InputFormat.

relativeToAbsolutePath()

setUdfContextSignature()

getCacheFiles()

setLocation()

Whatsapp
Facebook
Copy
Copy Text
Copy
Copy URL

Which of the following command can be used for debugging ?

exec

execute

error

throw

Whatsapp
Facebook
Copy
Copy Text
Copy
Copy URL

Which of the following file contains user defined functions (UDFs) ?

script2-local.pig

pig.jar

tutorial.jar

excite.log.bz2

Whatsapp
Facebook
Copy
Copy Text
Copy
Copy URL

Which of the following script is used to check scripts that have failed jobs ?

a = load '/mapred/history/done' using HadoopJobHistoryLoader() as (j:map[], m:map[], r:map[]);

b = foreach a generate (Chararray) j#'STATUS' as status, j#'PIG_SCRIPT_ID' as id, j#'USER' as user, j#'JOBNAME' as script_name, j#'JOBID' as

job;

c = filter b by status != 'SUCCESS';

dump c;

a = load '/mapred/history/done' using HadoopJobHistoryLoader() as (j:map[], m:map[], r:map[]);

b = foreach a generate j#'PIG_SCRIPT_ID' as id, j#'USER' as user, j#'JOBNAME' as script_name, (Long) r#'NUMBER_REDUCES' as reduces;

c = group b by (id, user, script_name) parallel 10;

d = foreach c generate group.user, group.script_name, MAX(b.reduces) as max_reduces;

e = filter d by max_reduces == 1;

dump e;

a = load '/mapred/history/done' using HadoopJobHistoryLoader() as (j:map[], m:map[], r:map[]);

b = foreach a generate j#'PIG_SCRIPT_ID' as id, j#'USER' as user, j#'QUEUE_NAME' as queue;

c = group b by (id, user, queue) parallel 10;

d = foreach c generate group.user, group.queue, COUNT(b);

dump d;

None of the mentioned

Whatsapp
Facebook
Copy
Copy Text
Copy
Copy URL

Which of the following code is used to find scripts that use only the default parallelism ?

a = load '/mapred/history/done' using HadoopJobHistoryLoader() as (j:map[], m:map[], r:map[]);

b = foreach a generate (Chararray) j#'STATUS' as status, j#'PIG_SCRIPT_ID' as id, j#'USER' as user, j#'JOBNAME' as script_name, j#'JOBID' as

job;

c = filter b by status != 'SUCCESS';

dump c;

a = load '/mapred/history/done' using HadoopJobHistoryLoader() as (j:map[], m:map[], r:map[]);

b = foreach a generate j#'PIG_SCRIPT_ID' as id, j#'USER' as user, j#'JOBNAME' as script_name, (Long) r#'NUMBER_REDUCES' as reduces;

c = group b by (id, user, script_name) parallel 10;

d = foreach c generate group.user, group.script_name, MAX(b.reduces) as max_reduces;

e = filter d by max_reduces == 1;

dump e;

a = load '/mapred/history/done' using HadoopJobHistoryLoader() as (j:map[], m:map[], r:map[]);

b = foreach a generate j#'PIG_SCRIPT_ID' as id, j#'USER' as user, j#'QUEUE_NAME' as queue;

c = group b by (id, user, queue) parallel 10;

d = foreach c generate group.user, group.queue, COUNT(b);

dump d;

None of the mentioned