Recent questions tagged developer

0 votes
1 answer

Sqoop Metastore ?

Jul 19, 2018 in Big Data Hadoop by shams
• 3,670 points
1,318 views
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

How Namenode handles data node failures?

Jul 11, 2018 in Big Data Hadoop by Shubham
• 13,490 points
6,296 views
0 votes
1 answer

Kafka topic not being deleted

Jul 9, 2018 in Apache Kafka by Shubham
• 13,490 points
3,059 views
+1 vote
8 answers

How to print the contents of RDD in Apache Spark?

Jul 6, 2018 in Apache Spark by Shubham
• 13,490 points
61,796 views
0 votes
2 answers

How to use RDD filter with other function?

Jul 5, 2018 in Apache Spark by Shubham
• 13,490 points
9,674 views
0 votes
1 answer

How to add third party java jars for use in PySpark?

Jul 4, 2018 in Apache Spark by Shubham
• 13,490 points
8,693 views
0 votes
1 answer
0 votes
1 answer
+1 vote
1 answer

map vs mapValues in Spark

Jun 29, 2018 in Apache Spark by Shubham
• 13,490 points
16,035 views
+1 vote
3 answers

Which cluster type should I choose for Spark?

Jun 27, 2018 in Apache Spark by Shubham
• 13,490 points
1,568 views
0 votes
1 answer

Which is better in term of speed, Shark or Spark?

Jun 26, 2018 in Apache Spark by Shubham
• 13,490 points
930 views
0 votes
1 answer

Spark Driver roles

Jun 21, 2018 in Apache Spark by shams
• 3,670 points
1,046 views
0 votes
1 answer

Spark standalone client mode

Jun 20, 2018 in Apache Spark by shams
• 3,670 points
1,000 views
0 votes
1 answer

Ways to create RDD in Apache Spark

Jun 19, 2018 in Apache Spark by Shubham
• 13,490 points
4,059 views
0 votes
3 answers

Lineage Graph in Spark

Jun 19, 2018 in Apache Spark by Data_Nerd
• 2,390 points
11,730 views
0 votes
1 answer
0 votes
1 answer

How RDD persist the data in Spark?

Jun 18, 2018 in Apache Spark by kurt_cobain
• 9,350 points
1,399 views
0 votes
1 answer

What do we mean by an RDD in Spark?

Jun 18, 2018 in Apache Spark by kurt_cobain
• 9,350 points
4,046 views
0 votes
1 answer

Different Hadoop Modes

Jun 13, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
13,064 views
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

InputSplit vs HDFS Block

Jun 1, 2018 in Big Data Hadoop by shams
• 3,670 points
4,403 views
0 votes
1 answer

How does partitioning work in Spark?

May 31, 2018 in Apache Spark by coldcode
• 2,090 points
1,215 views
0 votes
1 answer

Is there any way to uncache RDD?

May 30, 2018 in Apache Spark by kurt_cobain
• 9,350 points
1,709 views
0 votes
1 answer

Sqoop vs distCP

May 30, 2018 in Big Data Hadoop by shams
• 3,670 points
1,327 views
0 votes
1 answer

NameNode without any data

May 29, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
1,293 views
0 votes
1 answer
0 votes
1 answer

How to find max value in pair RDD?

May 26, 2018 in Apache Spark by kurt_cobain
• 9,350 points
7,986 views
0 votes
1 answer
0 votes
1 answer

out of Memory Error in Hadoop

May 22, 2018 in Big Data Hadoop by coldcode
• 2,090 points
1,823 views
0 votes
1 answer
0 votes
1 answer

Is a HDFS block sequential ?

May 21, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,607 views
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

How to install Hadoop in Ubuntu?

May 17, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
651 views
0 votes
1 answer
0 votes
1 answer

Visualization Tool in Cloudera CDH

May 16, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,188 views
0 votes
10 answers