38792/validation-in-spark
Let's say you have created a partition for Hyderabad but you loaded Chennai data, what are the validation to prevent this?
This can be prevented if you use dynamic partition instead of static partition as in dynamic partitioning you can never load wrong data as it loads the partition from the column data itself.
Go to your Spark Web UI & ...READ MORE
Use Parquet. I'm not sure about CSV ...READ MORE
Hey, you can try something like this: df.write.partitionBy('year', ...READ MORE
You can access logs through the command yarn ...READ MORE
Instead of spliting on '\n'. You should ...READ MORE
Firstly you need to understand the concept ...READ MORE
org.apache.hadoop.mapred is the Old API org.apache.hadoop.mapreduce is the ...READ MORE
Hi, You can create one directory in HDFS ...READ MORE
I got this working with plain uncompressed ...READ MORE
val spark = SparkSession.builder().appName("Demo").getOrCreate() val path = new ...READ MORE
OR
At least 1 upper-case and 1 lower-case letter
Minimum 8 characters and Maximum 50 characters
Already have an account? Sign in.