Recent questions tagged spark

0 votes
1 answer

Spark - load CSV file as DataFrame?

Sep 25, 2018 in Big Data Hadoop by digger
• 26,740 points
7,370 views
0 votes
1 answer
0 votes
1 answer

What happens to RDD when one of the nodes goes down?

Sep 3, 2018 in Apache Spark by Shubham
• 13,490 points
2,421 views
0 votes
1 answer

Does Spark provide the storage layer too?

Sep 3, 2018 in Apache Spark by Shubham
• 13,490 points
2,040 views
0 votes
1 answer

Functions of Spark SQL?

Sep 3, 2018 in Apache Spark by Meci Matt
• 9,460 points
2,145 views
0 votes
1 answer

Languages supported by Apache Spark?

Sep 3, 2018 in Apache Spark by Meci Matt
• 9,460 points
8,011 views
0 votes
1 answer

How to connect Amazon RedShift in Apache Spark?

Aug 22, 2018 in AWS by datageek
• 2,540 points
8,562 views
+2 votes
3 answers
0 votes
2 answers

Which cluster type should I choose for Spark?

Aug 21, 2018 in Apache Spark by Shubham
• 13,490 points
2,754 views
0 votes
1 answer
0 votes
2 answers

Which of these will vanish: Flink vs Spark?

Aug 10, 2018 in Big Data Hadoop by Omkar
• 69,180 points
1,966 views
0 votes
2 answers
0 votes
1 answer

What makes Spark faster than MapReduce?

Jul 27, 2018 in Apache Spark by Neha
• 6,300 points
2,266 views
0 votes
1 answer

PySpark Config ?

Jul 26, 2018 in Apache Spark by shams
• 3,670 points
1,422 views
+1 vote
1 answer
0 votes
1 answer

A strange spark ERROR on AWS EMR

Jul 13, 2018 in AWS by Luke cage
• 360 points
2,273 views
+1 vote
8 answers

How to print the contents of RDD in Apache Spark?

Jul 6, 2018 in Apache Spark by Shubham
• 13,490 points
65,061 views
0 votes
2 answers

How to use RDD filter with other function?

Jul 5, 2018 in Apache Spark by Shubham
• 13,490 points
10,606 views
0 votes
1 answer

How to add third party java jars for use in PySpark?

Jul 4, 2018 in Apache Spark by Shubham
• 13,490 points
9,468 views
0 votes
1 answer
0 votes
1 answer
+1 vote
1 answer

map vs mapValues in Spark

Jun 29, 2018 in Apache Spark by Shubham
• 13,490 points
17,189 views
+1 vote
3 answers

Which cluster type should I choose for Spark?

Jun 27, 2018 in Apache Spark by Shubham
• 13,490 points
2,514 views
0 votes
1 answer

Which is better in term of speed, Shark or Spark?

Jun 26, 2018 in Apache Spark by Shubham
• 13,490 points
1,491 views
0 votes
1 answer

Spark Driver roles

Jun 21, 2018 in Apache Spark by shams
• 3,670 points
1,540 views
0 votes
2 answers

Parquet Files Advantages

Jun 21, 2018 in Apache Spark by Data_Nerd
• 2,390 points
2,850 views
0 votes
2 answers

map() and flatmap()

Jun 20, 2018 in Apache Spark by Ashish
• 2,650 points
1,813 views
0 votes
1 answer

Spark standalone client mode

Jun 20, 2018 in Apache Spark by shams
• 3,670 points
1,760 views
0 votes
1 answer

Ways to create RDD in Apache Spark

Jun 19, 2018 in Apache Spark by Shubham
• 13,490 points
4,750 views
0 votes
3 answers

Lineage Graph in Spark

Jun 19, 2018 in Apache Spark by Data_Nerd
• 2,390 points
13,206 views
0 votes
1 answer
0 votes
1 answer

How RDD persist the data in Spark?

Jun 18, 2018 in Apache Spark by kurt_cobain
• 9,350 points
2,006 views
0 votes
1 answer

What do we mean by an RDD in Spark?

Jun 18, 2018 in Apache Spark by kurt_cobain
• 9,350 points
4,714 views
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

Persistence Levels in Spark

Jun 8, 2018 in Apache Spark by Data_Nerd
• 2,390 points
6,801 views
0 votes
1 answer

What is Shark?

Jun 8, 2018 in Apache Spark by shams
• 3,670 points
1,613 views
+1 vote
1 answer

Kafka Feature

Jun 7, 2018 in Apache Spark by shams
• 3,670 points
2,548 views
0 votes
1 answer

SQLInterpreter in Spark

Jun 7, 2018 in Apache Spark by shams
• 3,670 points
1,292 views
0 votes
1 answer
0 votes
1 answer

Parquet File

Jun 4, 2018 in Apache Spark by shams
• 3,670 points
1,586 views