Trending questions in Big Data Hadoop

0 votes
1 answer

Cassandra for datawarehouse

Actually, Cassandra being a NoSQL database with ...READ MORE

Apr 20, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,738 views
0 votes
1 answer

Not able to start Job History Server in Hadoop 2.8.1

You have to start JobHistoryServer process specifically ...READ MORE

Mar 30, 2018 in Big Data Hadoop by Ashish
• 2,650 points
2,674 views
0 votes
1 answer

Relationship between Spark, Hadoop and Cassandra?

Spark is a distributed in memory processing ...READ MORE

Mar 26, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,788 views
0 votes
1 answer

How to upload file to HDFS in Ubuntu

you can use  hadoop fs -copyFromLocal  "/home/ritwi ...READ MORE

Apr 19, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,785 views
0 votes
1 answer

How to import data to HBase from SQL server?

You can easily import the data from ...READ MORE

Apr 20, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,691 views
0 votes
1 answer

How to get started with Hadoop and do some development using Eclipse IDE?

Alright, there are couple of things that ...READ MORE

Apr 4, 2018 in Big Data Hadoop by Ashish
• 2,650 points
2,305 views
0 votes
1 answer

What are the advantages & disadvantages of Hadoop Dockerization?

As you are already having a Hadoop ...READ MORE

Apr 18, 2018 in Big Data Hadoop by coldcode
• 2,090 points
1,664 views
0 votes
1 answer

When is an identity mapper/reducer used?

1.One of the simplest example of Iterative ...READ MORE

Apr 3, 2018 in Big Data Hadoop by Ashish
• 2,650 points
2,306 views
0 votes
1 answer

IDE for writing and running hadoop jobs?

You can use the Eclipse IDE. It's ...READ MORE

Apr 10, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
2,019 views
0 votes
1 answer

Can we use apache Mahout without Hadoop dependency?

There is a number of algorithm implementations ...READ MORE

Apr 26, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,287 views
+1 vote
1 answer

Why is jar file required to execute a MR code instead of class file?

We use hadoop keyword to invoke the ...READ MORE

Apr 24, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,328 views
0 votes
1 answer

How can I use my host machine’s web browser to check my HDFS services running in the VM?

The sole purpose of the virtual machine ...READ MORE

Apr 18, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,618 views
0 votes
1 answer

What Distributed Cache is actually used for in Hadoop?

Basically distributed cache allows you to cache ...READ MORE

Apr 3, 2018 in Big Data Hadoop by Ashish
• 2,650 points
2,293 views
0 votes
1 answer

How to find hadoop distribution and version?

Just Use the command Hadoop version ...READ MORE

Apr 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points

edited Apr 6, 2018 by kurt_cobain 2,105 views
0 votes
1 answer

How to work with distributed cache in Hadoop?

The problem with your code is that ...READ MORE

Apr 20, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,433 views
0 votes
1 answer

What is the function of getLocalCacheArchives method?

We use distributed cache to share those ...READ MORE

Apr 29, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,029 views
0 votes
1 answer

How do I parse pdf file using MapReduce?

If you have your own custom InputFormat(WholeFileInputFormat). ...READ MORE

Apr 11, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,813 views
0 votes
1 answer

Why ResourceManager crashes after sometime or while accessing HDFS in Hadoop 2.8.1 and Ubuntu 16.04?

I was facing the same problem and ...READ MORE

Apr 15, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,565 views
0 votes
1 answer

How to configure secondary namenode in Hadoop 2.x ?

bin/hadoop-daemon.sh start [namenode | secondarynamenode | datanode ...READ MORE

Apr 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,953 views
0 votes
1 answer

How do I print hadoop properties in command line?

You can use the following command to get ...READ MORE

Apr 6, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,950 views
0 votes
1 answer

Different ports in a Hadoop cluster environment?

Below image will help you in understanding ...READ MORE

Apr 6, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,949 views
0 votes
1 answer

In which directory Hadoop should be installed?

You can install Hadoop anywhere. You just ...READ MORE

Apr 17, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,498 views
+1 vote
1 answer

I installed Spark but while executing command, I am getting ‘hadoop’ command not found error?

For accessing Hadoop commands & HDFS, you ...READ MORE

Mar 21, 2018 in Big Data Hadoop by Shubham
• 13,490 points
2,573 views
0 votes
1 answer

Getting error in Hadoop Streaming: Type mismatch in Key from Map

In Hadoop streaming you have to customize ...READ MORE

Apr 18, 2018 in Big Data Hadoop by coldcode
• 2,090 points
1,378 views
0 votes
1 answer

How to install and configure a multi-node Hadoop cluster?

I would recommend you to install Cent ...READ MORE

Mar 22, 2018 in Big Data Hadoop by Shubham
• 13,490 points
2,584 views
0 votes
1 answer

Error in Hadoop Mapreduce

The file that you are referring here ...READ MORE

Apr 19, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,300 views
0 votes
1 answer

Replication Factor in Hadoop?

If you have provided a replication factor ...READ MORE

Apr 18, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,331 views
0 votes
1 answer

HortonWorks Hadoop encryption tools and data security

There are many tools available for encrypting ...READ MORE

Apr 20, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,234 views
0 votes
1 answer

What happend while formatting hadoop namenode?

Basically, NameNode is the master daemon that ...READ MORE

May 4, 2018 in Big Data Hadoop by Ashish
• 2,650 points
665 views
0 votes
1 answer

Hadoop for data migration

You can use Hadoop for these kinds ...READ MORE

Apr 18, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,315 views
0 votes
1 answer

Where Big Data tools like Hadoop and Spark comes into picture when we talk about ETL?

When we talk about ETL, ETL means ...READ MORE

May 3, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
651 views
0 votes
1 answer

How to checkout Hadoop 2.6.0 from git

Clone the following Git repository: git clone git ...READ MORE

Apr 23, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,085 views
0 votes
1 answer

How to implement data locality in Hadoop MapReduce?

You can use this getFileBlockLocations method of ...READ MORE

Apr 20, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,183 views
0 votes
1 answer

Map and Reduce task memory settings in Hadoop YARN

It's preferable and generally, it is recommended ...READ MORE

Apr 18, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,267 views
0 votes
1 answer

Which is better Neo4j or HBASE ?

For your requirement, I would recommend you ...READ MORE

Apr 26, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
882 views
0 votes
1 answer

How can we send data from MongoDB to Hadoop?

The MongoDB Connector for Hadoop reads data ...READ MORE

Mar 27, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,190 views
0 votes
1 answer

How Impala is fast compared to Hive in terms of query response?

Impala provides faster response as it uses MPP(massively ...READ MORE

Mar 21, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,379 views
0 votes
1 answer

What does Context mean in Hadoop MapR terms ?

The new API makes extensive use of ...READ MORE

Apr 6, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,675 views
0 votes
1 answer

How can we transfer a PDF file to HDFS?

You can easily upload any file to ...READ MORE

Apr 13, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,369 views
0 votes
1 answer

HDFS Location in Hadoop?

Try using the same path where you ...READ MORE

Apr 19, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,120 views
0 votes
1 answer

How to perform Near-real time analytics using Hadoop?

To perform Near-real time analytics, you have ...READ MORE

Apr 24, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
885 views
0 votes
1 answer

Why we are configuring mapred.job.tracker in YARN?

I really dont know the reason behind ...READ MORE

Mar 29, 2018 in Big Data Hadoop by Ashish
• 2,650 points
1,998 views
0 votes
0 answers

Are some Pig real time use cases available?

Can anyone provide me with some real ...READ MORE

Apr 21, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,077 views
0 votes
1 answer

What are the different ways to load data from Hadoop to Azure Data Lake?

I would recommend you to go through ...READ MORE

Apr 18, 2018 in Big Data Hadoop by coldcode
• 2,090 points
1,121 views
+2 votes
1 answer

Is Kafka and Zookeeper are required in a Big Data Cluster?

Apache Kafka is one of the components ...READ MORE

Mar 23, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,190 views
0 votes
1 answer

Why am I not able to see the Hadoop daemons that are running?

I guess you are starting the services ...READ MORE

Apr 18, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,108 views
0 votes
1 answer

Why Java Code in Hadoop uses own Data Types instead of basic Data types?

Hadoop provides us Writable interface based data ...READ MORE

Apr 15, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,228 views
0 votes
1 answer

Hadoop: TaskTracker and JobTracker don't start with start-dfs.sh

You must run the start-dfs..sh too. So when ...READ MORE

Apr 4, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,736 views
0 votes
1 answer

How to migrate/ change Hadoop 1.x maven project to Hadoop 2.x?

In Hadoop 2.2. most of the dependencies ...READ MORE

Apr 15, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,215 views
0 votes
1 answer

Hadoop vs Data Lake

A data lake is a storage repository that holds ...READ MORE

Mar 26, 2018 in Big Data Hadoop by Ashish
• 2,650 points
2,075 views