Recent questions tagged developer

0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

Sqoop Metastore ?

Jul 19, 2018 in Big Data Hadoop by shams
• 3,670 points
1,414 views
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

How Namenode handles data node failures?

Jul 11, 2018 in Big Data Hadoop by Shubham
• 13,490 points
6,571 views
0 votes
1 answer

Kafka topic not being deleted

Jul 9, 2018 in Apache Kafka by Shubham
• 13,490 points
3,195 views
+1 vote
8 answers

How to print the contents of RDD in Apache Spark?

Jul 6, 2018 in Apache Spark by Shubham
• 13,490 points
62,316 views
0 votes
2 answers

How to use RDD filter with other function?

Jul 5, 2018 in Apache Spark by Shubham
• 13,490 points
10,051 views
0 votes
1 answer

How to add third party java jars for use in PySpark?

Jul 4, 2018 in Apache Spark by Shubham
• 13,490 points
8,905 views
0 votes
1 answer
0 votes
1 answer
+1 vote
1 answer

map vs mapValues in Spark

Jun 29, 2018 in Apache Spark by Shubham
• 13,490 points
16,355 views
+1 vote
3 answers

Which cluster type should I choose for Spark?

Jun 27, 2018 in Apache Spark by Shubham
• 13,490 points
1,814 views
0 votes
1 answer

Which is better in term of speed, Shark or Spark?

Jun 26, 2018 in Apache Spark by Shubham
• 13,490 points
1,032 views
0 votes
1 answer

Spark Driver roles

Jun 21, 2018 in Apache Spark by shams
• 3,670 points
1,183 views
0 votes
1 answer

Spark standalone client mode

Jun 20, 2018 in Apache Spark by shams
• 3,670 points
1,230 views
0 votes
1 answer

Ways to create RDD in Apache Spark

Jun 19, 2018 in Apache Spark by Shubham
• 13,490 points
4,214 views
0 votes
3 answers

Lineage Graph in Spark

Jun 19, 2018 in Apache Spark by Data_Nerd
• 2,390 points
12,173 views
0 votes
1 answer
0 votes
1 answer

How RDD persist the data in Spark?

Jun 18, 2018 in Apache Spark by kurt_cobain
• 9,350 points
1,522 views
0 votes
1 answer

What do we mean by an RDD in Spark?

Jun 18, 2018 in Apache Spark by kurt_cobain
• 9,350 points
4,165 views
0 votes
1 answer

Different Hadoop Modes

Jun 13, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
13,236 views
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

InputSplit vs HDFS Block

Jun 1, 2018 in Big Data Hadoop by shams
• 3,670 points
4,557 views
0 votes
1 answer

How does partitioning work in Spark?

May 31, 2018 in Apache Spark by coldcode
• 2,090 points
1,349 views
0 votes
1 answer

Is there any way to uncache RDD?

May 30, 2018 in Apache Spark by kurt_cobain
• 9,350 points
1,826 views
0 votes
1 answer

Sqoop vs distCP

May 30, 2018 in Big Data Hadoop by shams
• 3,670 points
1,432 views
0 votes
1 answer

NameNode without any data

May 29, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
1,412 views
0 votes
1 answer
0 votes
1 answer

How to find max value in pair RDD?

May 26, 2018 in Apache Spark by kurt_cobain
• 9,350 points
8,154 views
0 votes
1 answer
0 votes
1 answer

out of Memory Error in Hadoop

May 22, 2018 in Big Data Hadoop by coldcode
• 2,090 points
1,917 views
0 votes
1 answer
0 votes
1 answer

Is a HDFS block sequential ?

May 21, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,782 views
0 votes
1 answer
0 votes
1 answer

Sqoop vs Oracle Hadoop Connectors

May 18, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,056 views
0 votes
1 answer
0 votes
1 answer

How to install Hadoop in Ubuntu?

May 17, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
762 views