Trending questions in Big Data Hadoop

0 votes
1 answer

Namenode not getting started

You need to delete the contents of the ...READ MORE

Apr 23, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
2,241 views
0 votes
1 answer

Visualization Tool in Cloudera CDH

Cloudera CDH comes with a Hadoop ecosystem ...READ MORE

May 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,244 views
0 votes
1 answer

Can we use Apache Mahout without Hadoop dependency?

Yes. Not all of the Mahout depends ...READ MORE

May 17, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,151 views
0 votes
1 answer

What is difference between Oozie workflow, coordinator and bundle ?

Workflow does not have time specifications to ...READ MORE

Mar 29, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
3,299 views
0 votes
1 answer

What is the difference between a zero reducer and identity reducer in Hadoop Mapreduce?

A Zero reducer as the name suggests ...READ MORE

Apr 10, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
2,736 views
0 votes
1 answer

Differences between Hadoop-common, Hadoop-core and Hadoop-client?

To help provide some additional details regarding ...READ MORE

Mar 29, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
3,286 views
0 votes
1 answer

Hadoop query regarding setJarByClass method of Job class

setJarByClass method basically tells the nodes where ...READ MORE

Apr 15, 2018 in Big Data Hadoop by Shubham
• 13,490 points
2,478 views
0 votes
1 answer

Sqoop vs Oracle Hadoop Connectors

Most of the connectors would have the ...READ MORE

May 18, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,027 views
0 votes
1 answer

Can we use HDFS to store git repositories?

Yes, it is possible to use HDFS ...READ MORE

May 7, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,480 views
0 votes
1 answer

What is new in Hadoop 3?

Here are few changes in Hadoop 3 1. ...READ MORE

May 28, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
557 views
0 votes
1 answer

Setting the Number of Map tasks and Reduce Tasks not Working

The command that you are running is ...READ MORE

May 4, 2018 in Big Data Hadoop by Ashish
• 2,650 points
1,621 views
0 votes
1 answer

GUI for using Hadoop

Hue is open source and works well ...READ MORE

Apr 24, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
2,043 views
0 votes
1 answer

Could not find the Sqoop directory in HDP 2.2

You can find the Sqoop lib directory ...READ MORE

May 4, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,572 views
0 votes
1 answer

Hdfs Service Systemd - startup

I guess the issue you are facing ...READ MORE

Mar 29, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
3,115 views
0 votes
1 answer

How to practice programming with Hadoop?

Well there are multiple ways to solve ...READ MORE

Mar 30, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
3,103 views
0 votes
1 answer

Which Data Structure is used in case of Map Reduce?

In case of Hadoop, HDFS is used ...READ MORE

May 4, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,535 views
0 votes
1 answer

Cannot connect to Hive from MicroStrategy BI tool

The problem is with your hive authentication. ...READ MORE

May 8, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,347 views
0 votes
1 answer

Setting Hive/Hadoop property using Hive Query

You can set Hadoop & Hive conf ...READ MORE

Apr 18, 2018 in Big Data Hadoop by Shubham
• 13,490 points
2,156 views
0 votes
1 answer

ApplicationMaster in YARN Architecture

Let me help you with your confusion: YARN ...READ MORE

May 8, 2018 in Big Data Hadoop by Ashish
• 2,650 points
1,329 views
0 votes
1 answer

What does count command in Hadoop tells?

The output of the count command are: ...READ MORE

Apr 23, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,941 views
0 votes
1 answer

How Sqoop and Tera Data Connector for Hadoop differ from each other?

In order to make use of TD ...READ MORE

May 2, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,535 views
0 votes
1 answer

How to set the number of Map & Reduce tasks?

The map tasks created for a job ...READ MORE

Apr 18, 2018 in Big Data Hadoop by Shubham
• 13,490 points
2,086 views
0 votes
1 answer

How do I connect my Spark based HDInsight cluster to my blob storage?

Go through this blog: https://docs.microsoft.com/en-us/azure/hdinsight/hdinsight-hadoop-use-blob-storage#access-blobs I went through this ...READ MORE

Apr 15, 2018 in Big Data Hadoop by Shubham
• 13,490 points
2,172 views
0 votes
1 answer

How to transfer data from Netezza to HDFS using Apache Sqoop?

Remove the --direct option. It gives issue ...READ MORE

Apr 23, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,836 views
0 votes
1 answer

How to create a FileSystem object that can be used for reading from and writing to HDFS?

Read operation on HDFS In order to read ...READ MORE

Mar 21, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points

edited Mar 22, 2018 by nitinrawat895 3,245 views
0 votes
1 answer

Getting error in MapReduce job.setInputFormatClass

In old Hadoop API(i.e. below Hadoop 0.20.0), ...READ MORE

Apr 15, 2018 in Big Data Hadoop by Shubham
• 13,490 points
2,144 views
0 votes
1 answer

How to install Hadoop in Ubuntu?

Follow the steps as mentioned in the ...READ MORE

May 17, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
733 views
0 votes
1 answer

Sqoop version compatible with Hadoop 0.20?

Apache Sqoop has 2 major projects i.e. ...READ MORE

May 9, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,072 views
0 votes
1 answer

How to tune Spark jobs & optimize the performance?

You need to know the cluster properly ...READ MORE

Apr 18, 2018 in Big Data Hadoop by coldcode
• 2,090 points
1,979 views
0 votes
1 answer

What is the $HIVE_HOME in Hortonworks sandbox

You can find the hive-site.xml file /etc/hive/conf.install/hive-site.xml ...READ MORE

Apr 23, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,744 views
0 votes
1 answer

Joining Multiple Spark Dataframes

You can run the below code to ...READ MORE

Mar 26, 2018 in Big Data Hadoop by Bharani
• 4,660 points
2,927 views
0 votes
1 answer

Can we use Apache Sqoop and Hive both together? What is the difference between the two?

Yes, you can. In fact many people ...READ MORE

Mar 26, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,922 views
0 votes
1 answer

Hadoop combiners with aggregator functions

Both combiner & aggregator fulfill partly the ...READ MORE

May 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
676 views
0 votes
1 answer

Cassandra for datawarehouse

Actually, Cassandra being a NoSQL database with ...READ MORE

Apr 20, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,754 views
0 votes
1 answer

Relationship between Spark, Hadoop and Cassandra?

Spark is a distributed in memory processing ...READ MORE

Mar 26, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,832 views
0 votes
1 answer

Not able to start Job History Server in Hadoop 2.8.1

You have to start JobHistoryServer process specifically ...READ MORE

Mar 30, 2018 in Big Data Hadoop by Ashish
• 2,650 points
2,693 views
0 votes
1 answer

How to upload file to HDFS in Ubuntu

you can use  hadoop fs -copyFromLocal  "/home/ritwi ...READ MORE

Apr 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,810 views
0 votes
1 answer

How to import data to HBase from SQL server?

You can easily import the data from ...READ MORE

Apr 20, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,711 views
0 votes
1 answer

How to get started with Hadoop and do some development using Eclipse IDE?

Alright, there are couple of things that ...READ MORE

Apr 4, 2018 in Big Data Hadoop by Ashish
• 2,650 points
2,334 views
0 votes
1 answer

What are the advantages & disadvantages of Hadoop Dockerization?

As you are already having a Hadoop ...READ MORE

Apr 18, 2018 in Big Data Hadoop by coldcode
• 2,090 points
1,679 views
0 votes
1 answer

When is an identity mapper/reducer used?

1.One of the simplest example of Iterative ...READ MORE

Apr 3, 2018 in Big Data Hadoop by Ashish
• 2,650 points
2,321 views
0 votes
1 answer

IDE for writing and running hadoop jobs?

You can use the Eclipse IDE. It's ...READ MORE

Apr 10, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
2,052 views
0 votes
1 answer

Can we use apache Mahout without Hadoop dependency?

There is a number of algorithm implementations ...READ MORE

Apr 26, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,318 views
0 votes
1 answer

How can I use my host machine’s web browser to check my HDFS services running in the VM?

The sole purpose of the virtual machine ...READ MORE

Apr 18, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,661 views
+1 vote
1 answer

Why is jar file required to execute a MR code instead of class file?

We use hadoop keyword to invoke the ...READ MORE

Apr 24, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,345 views
0 votes
1 answer

What Distributed Cache is actually used for in Hadoop?

Basically distributed cache allows you to cache ...READ MORE

Apr 3, 2018 in Big Data Hadoop by Ashish
• 2,650 points
2,325 views
0 votes
1 answer

How to find hadoop distribution and version?

Just Use the command Hadoop version ...READ MORE

Apr 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points

edited Apr 6, 2018 by kurt_cobain 2,119 views
0 votes
1 answer

What is the function of getLocalCacheArchives method?

We use distributed cache to share those ...READ MORE

Apr 29, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,064 views
0 votes
1 answer

How to work with distributed cache in Hadoop?

The problem with your code is that ...READ MORE

Apr 20, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,446 views
0 votes
1 answer

How do I parse pdf file using MapReduce?

If you have your own custom InputFormat(WholeFileInputFormat). ...READ MORE

Apr 11, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,838 views