By default how many partitions are created in RDD in Apache spark

Question

Can anyone explain how many partitions are created in RDD in Apache spark?

Gitika · Answer 1 · Aug 2, 2019

Well, it depends on the block of files in HDFS. If you are using the default settings of Spark, then one partition is created for every block of a file. But you can explicitly specify the number of partitions to be created.

Here is an example below:

val rdd1 = sc.textFile("/home/hdadmin/wc-data.txt")

answered Aug 2, 2019 by Gitika
• 65,730 points

By default how many partitions are created in RDD in Apache spark

Your comment on this question:

1 answer to this question.

Your answer

Your comment on this answer:

Related Questions In Apache Spark

How to print the contents of RDD in Apache Spark?

How can I write a text file in HDFS not from an RDD, in Spark program?

How to convert rdd object to dataframe in spark

What do we mean by an RDD in Spark?

How RDD persist the data in Spark?

Ways to create RDD in Apache Spark

How is RDD in Spark different from Distributed Storage Management? Can anyone help me with this ?

What is the difference between rdd and dataframes in Apache Spark ?

How to save RDD in Apache Spark?

In how many modes Apache spark can run?

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES

TRENDING BLOG ARTICLES

TRENDING BLOG ARTICLES