questions/apache-spark
You aren't actually overwriting anything with this ...READ MORE
Use the function as following: var notFollowingList=List(9.8,7,6,3,1) df.filter(col("uid").isin(notFollowingList:_*)) You can ...READ MORE
val coder: (Int => String) = v ...READ MORE
ReduceByKey is the best for production. READ MORE
Hi, In Spark, fill() function of DataFrameNaFunctions class is used to replace ...READ MORE
Save it to a text file: line.saveAsTextFile("alicia.txt") Print contains ...READ MORE
its late but this how you can ...READ MORE
Recently, there are two new data abstractions ...READ MORE
Spark map function expresses a one-to-one transformation. ...READ MORE
When you concatenate any string with a ...READ MORE
The string Productivity has to be enclosed between single ...READ MORE
Using findspark is expected to solve the ...READ MORE
Hi@akhtar, To import this module in your program, ...READ MORE
You can use the function expr val data ...READ MORE
Comparison between Spark RDD vs DataFrame 1. Release ...READ MORE
Hi@akhtar, By default pyspark in not present in ...READ MORE
Hi, @Ritu, When creating a pair RDD from ...READ MORE
Hi@akhtar, Here you are trying to read a ...READ MORE
Try this: val df = sc.textFile("HDFS://nameservice1/user/edureka_168049/Structure_IT/samplefile.txt") df.collect() val df = ...READ MORE
14)The number of stages in a job ...READ MORE
Hi@ritu, You can create a data frame from ...READ MORE
Hi, @Ritu, option b for you, as Hash Partitioning ...READ MORE
Hi@Edureka, Spark's internal scheduler may truncate the lineage of the RDD graph ...READ MORE
Option c) Mapr Jobs that are submitted READ MORE
Hi@ritu, I think the problem can be solved ...READ MORE
Hi@ritu, You need to learn the Architecture of ...READ MORE
Hi@dani, You can find the euclidean distance using ...READ MORE
Hey, @Ritu, According to the question, the answer ...READ MORE
What allows spark to periodically persist data ...READ MORE
Hi@ritu, Spark DStream (Discretized Stream) is the basic ...READ MORE
Hi@ritu, Spark's internal scheduler may truncate the lineage of the RDD graph if ...READ MORE
Hi@Prasant, If Spark Streaming is not supporting tuple, ...READ MORE
Hi@ritu, Fault tolerance is the property that enables ...READ MORE
Option D) runtime error READ MORE
Hi@akhtar, Since Avro library is external to Spark, ...READ MORE
Hi @Ritu If you want to see the ...READ MORE
A Dataframe can be created from an ...READ MORE
Hi, I am able to understand your requirement. ...READ MORE
Hi@Ruben, I think you can add an escape ...READ MORE
error: expected class or object definition sc.parallelize (Array(1L, ...READ MORE
Hi@Edureka, Checkpointing is a process of truncating RDD ...READ MORE
Hi, @Ritu, According to the official documentation of Spark 1.2, ...READ MORE
17)from the given choices, identify the value ...READ MORE
Option a) List(5,100,10) The take method returns the first n elements in an ...READ MORE
Hi@ritu, AWS has lots of services. For spark ...READ MORE
Hi@ritu, To start your python spark shell, you ...READ MORE
Option d) Run time error. READ MORE
Hi@ritu, The most appropriate step according to me ...READ MORE
Hi@akhtar, You can write the spark dataframe in ...READ MORE
What does the below code print? val AgeDs ...READ MORE
OR
At least 1 upper-case and 1 lower-case letter
Minimum 8 characters and Maximum 50 characters
Already have an account? Sign in.