Apache Spark Questions | Edureka Community

Code: srcDF.write.mode(tblmode).saveAsTable(s"${dbName}.${tgtHiveTableName}") error: 21/06/04 22:11:45 ERROR pa.TrxNbrx: org.apache.spark.SparkException: ...READ MORE

Jun 5, 2021 in Apache Spark by Rajesh

edited Mar 4 • 192 views

0 votes

0 answers

OI JANA TESTE LIVE

OI JANA TESTE LIVE READ MORE

Jun 5, 2021 in Apache Spark by Eufrasia

edited Mar 4 • 175 views

0 votes

0 answers

what parameters are required for a "windowed" operation such as reduceByKeyAndWindow?

a) Window length b) sliding interval c) Window Length ...READ MORE

Jun 4, 2021 in Apache Spark by anonymous

edited Mar 4 • 208 views

0 votes

1 answer

How to create dataframe for the comma delimited file?

.option("sep", delimeter) READ MORE

Oct 28, 2022 in Apache Spark by anonymous

edited Mar 5 • 3,997 views

+1 vote

3 answers

What is the difference between rdd and dataframes in Apache Spark ?

Comparison between Spark RDD vs DataFrame 1. Release ...READ MORE

Aug 28, 2018 in Apache Spark by shams
• 3,670 points • 44,175 views

0 votes

1 answer

What are some of the things you can monitor in the Spark Web UI?

The stages which are running slow READ MORE

Apr 29, 2021 in Apache Spark by anonymous

edited Mar 5 • 4,407 views

0 votes

1 answer

ImportError: No module named 'pyspark'

Hi@akhtar, By default pyspark in not present in ...READ MORE

May 6, 2020 in Apache Spark by MD
• 95,460 points • 16,217 views

0 votes

1 answer

How to select all columns with group by?

Try df.select(df("*")).groupby("id").agg(sum("salary")) READ MORE

Sep 17, 2021 in Apache Spark by Parimi Pavan

edited Mar 5 • 14,971 views

0 votes

0 answers

It is a schema for a pyspark nested dataframe ..what I want is to change the name of the ambiguous terms configdata to configdata0, configdata1 and so on . So how can it be achieved using rdd?

root |-- fields: struct (nullable = true) | ...READ MORE

Apr 27, 2021 in Apache Spark by sushant

edited Mar 4 • 222 views

+1 vote

1 answer

is not a Parquet file. expected magic number at tail [80, 65, 82, 49] but found [51, 53, 10, 10]

Hi@akhtar, Here you are trying to read a ...READ MORE

Feb 3, 2020 in Apache Spark by MD
• 95,460 points • 19,305 views

0 votes

0 answers

Real time Project challenges in Spark Data pipeline

Can anybody highlights some challenges they have ...READ MORE

Apr 6, 2021 in Apache Spark by anonymous

edited Mar 4 • 226 views

+1 vote

1 answer

Reading a text file through spark data frame

Try this: val df = sc.textFile("HDFS://nameservice1/user/edureka_168049/Structure_IT/samplefile.txt") df.collect() val df = ...READ MORE

Jul 24, 2019 in Apache Spark by Suri
• 26,953 views

0 votes

1 answer

Why Partitions are immutable in Spark?

Partitions use HDFS API. READ MORE

Aug 25, 2022 in Apache Spark by anonymous

edited Mar 5 • 2,639 views

0 votes

0 answers

The number of stages in a job is equal to the number of RDDs in DAG. however, under one of the cgiven conditions, the scheduler can truncate the lineage. identify it. [closed]

14)The number of stages in a job ...READ MORE

Nov 25, 2020 in Apache Spark by Edureka
• 200 points
closed Nov 25, 2020 by MD • 5,466 views

0 votes

2 answers

5)Using which one of the given choices will you create an RDD with specific partitioning?

Hi, @Ritu, option b for you, as Hash Partitioning ...READ MORE

Nov 23, 2020 in Apache Spark by Gitika
• 65,730 points • 5,054 views

0 votes

1 answer

The number of stages in a job is equal to the number of RDDs in DAG. however, under one of the cgiven conditions, the scheduler can truncate the lineage. identify it.

Hi@Edureka, Spark's internal scheduler may truncate the lineage of the RDD graph ...READ MORE

Nov 26, 2020 in Apache Spark by MD
• 95,460 points • 4,624 views

Page:

« prev
1
2
3
4
5
6
7
8
...
12
next »

All categories
Generative AI (1,454)
Power BI (1,316)
DevOps & Agile (4,138)
Data Science (100)
ChatGPT (30)
Cyber Security & Ethical Hacking (1,057)
Data Analytics (1,266)
Cloud Computing (4,053)
Machine Learning (337)
PMP (1,069)
Python (3,489)
SalesForce (201)
Selenium (1,624)
Software Testing (58)
Tableau (608)
Web Development (3,972)
UI UX Design (24)
Java (1,358)
Azure (157)
Database (858)
Big Data Hadoop (1,907)
Blockchain (1,673)
Digital Marketing (121)
C# (141)
C++ (272)
IoT (Internet of Things) (390)
Kotlin (8)
Linux Administration (389)
MicroStrategy (7)
Mobile Development (395)
Others (2,387)
RPA (653)
Talend (73)
TypeSript (124)
Apache Kafka (84)
Apache Spark (596)
Career Counselling (1,091)
Events & Trending Topics (28)
Ask us Anything! (71)

Trending questions in Apache Spark

The number of stages in a job is equal to the number of RDDs in DAG. however, under one of the cgiven conditions, the scheduler can truncate the lineage. identify it. [closed]

Most popular tags

Subscribe to our Newsletter, and get personalized recommendations.

CATEGORIES

TRENDING BLOG ARTICLES