Spark workers are not accepting any job (Kubernetes-Docker-Spark)

Question

I'm trying to create a distributed spark cluster on kubernetes. for this, I've created a kubernetes cluster and on top of it&#160;i'm trying to create a spark cluster. My docker file is# Copyright (c) Jupyter Development Team.# Distributed under the terms of the Modified BSD LicenseARG BASE_CONTAINER=jupyter/scipy-notebookFROM $BASE_CONTAINERLABEL maintainer="Jupyter Project <jupyter@googlegroups.com>"USER root# Spark dependenciesENV SPARK_VERSION 2.3.2ENV SPARK_HADOOP_PROFILE 2.7ENV SPARK_SRC_URL https://www.apache.org/dist/spark/spark-$SPARK_VERSION/spark-${SPARK_VERSION}-bin-hadoop${SPARK_HADOOP_PROFILE}.tgzENV SPARK_HOME=/opt/sparkENV PATH $PATH:$SPARK_HOME/binRUN apt-get update && \&#160; &#160; &#160;apt-get install -y openjdk-8-jdk-headless \&#160; &#160; &#160;postgresql && \&#160; &#160; rm -rf /var/lib/apt/lists/*ENV JAVA_HOME&#160; /usr/lib/jvm/java-8-openjdk-amd64/ENV PATH $PATH:$JAVA_HOME/bin&#160; &#160;&#160;RUN wget ${SPARK_SRC_URL}&#160; &#160;&#160;RUN tar -xzf spark-${SPARK_VERSION}-bin-hadoop${SPARK_HADOOP_PROFILE}.tgz&#160; &#160;RUN mv spark-${SPARK_VERSION}-bin-hadoop${SPARK_HADOOP_PROFILE} /opt/spark&#160;RUN rm -f spark-${SPARK_VERSION}-bin-hadoop${SPARK_HADOOP_PROFILE}.tgzUSER $NB_UIDENV POST_URL https://jdbc.postgresql.org/download/postgresql-42.2.5.jarRUN wget ${POST_URL}RUN mv postgresql-42.2.5.jar $SPARK_HOME/jars# Install pyarrowRUN conda install --quiet -y 'pyarrow' && \&#160; &#160; conda install pyspark==2.3.2 && \&#160; &#160; conda clean -tipsy && \&#160; &#160; fix-permissions $CONDA_DIR && \&#160; &#160; fix-permissions /home/$NB_USERUSER rootADD log4j.properties /opt/spark/conf/log4j.propertiesADD start-common.sh start-worker.sh start-master.sh /ADD loop.sh $SPARK_HOME/bin/ADD core-site.xml /opt/spark/conf/core-site.xmlADD spark-defaults.conf /opt/spark/conf/spark-defaults.confRUN chmod +x $SPARK_HOME/bin/loop.shRUN chmod +x /start-master.shRUN chmod +x /start-common.shRUN chmod +x /start-worker.shENV PATH $PATH:/opt/spark/bin/loop.shRUN apt-get updateRUN apt-get install curl -yWORKDIR /and my master and worker yaml files arekind: ReplicationControllerapiVersion: v1metadata:&#160; name: spark-master-controllerspec:&#160; replicas: 1&#160; selector:&#160; &#160; component: spark-master&#160; template:&#160; &#160; metadata:&#160; &#160; &#160; labels:&#160; &#160; &#160; &#160; component: spark-master&#160; &#160; spec:&#160; &#160; &#160; hostname: spark-master&#160; &#160; &#160; containers:&#160; &#160; &#160; &#160; - name: spark-master&#160; &#160; &#160; &#160; &#160; image: hrafiq/dockerhub:spark-jovyan-local&#160; &#160; &#160; &#160; &#160; command: ["sh", "/start-master.sh", "run"]&#160; &#160; &#160; &#160; &#160; imagePullPolicy: Always&#160; &#160; &#160; &#160; &#160; ports:&#160; &#160; &#160; &#160; &#160; &#160; - containerPort: 7077&#160; &#160; &#160; &#160; &#160; &#160; &#160; hostPort: 7077&#160; &#160; &#160; &#160; &#160; &#160; - containerPort: 8080&#160; &#160; &#160; &#160; &#160; &#160; &#160; hostPort: 8080&#160; &#160; &#160; &#160; &#160; &#160; - containerPort: 6066&#160; &#160; &#160; &#160; &#160; &#160; &#160; hostPort: 6066&#160; &#160; &#160; &#160; &#160; &#160; - containerPort: 7001&#160; &#160; &#160; &#160; &#160; &#160; &#160; hostPort: 7001&#160; &#160; &#160; &#160; &#160; &#160; - containerPort: 7002&#160; &#160; &#160; &#160; &#160; &#160; &#160; hostPort: 7002&#160; &#160; &#160; &#160; &#160; &#160; - containerPort: 7003&#160; &#160; &#160; &#160; &#160; &#160; &#160; hostPort: 7003&#160; &#160; &#160; &#160; &#160; &#160; - containerPort: 7004&#160; &#160; &#160; &#160; &#160; &#160; &#160; hostPort: 7004&#160; &#160; &#160; &#160; &#160; &#160; - containerPort: 7005&#160; &#160; &#160; &#160; &#160; &#160; &#160; hostPort: 7005&#160; &#160; &#160; &#160; &#160; &#160; - containerPort: 4040&#160; &#160; &#160; &#160; &#160; &#160; &#160; hostPort: 4040&#160; &#160; &#160; &#160; &#160; env:&#160; &#160; &#160; &#160; &#160; &#160; - name: SPARK_PUBLIC_DNS&#160; &#160; &#160; &#160; &#160; &#160; &#160; value: 192.168.1.254&#160; &#160; &#160; &#160; &#160; &#160; - name: SPARK_MASTER_IP&#160; &#160; &#160; &#160; &#160; &#160; &#160; value: 192.168.1.254And worker filekind: ReplicationControllerapiVersion: v1metadata:&#160; name: spark-worker-controllerspec:&#160; replicas: 2&#160; selector:&#160; &#160; component: spark-worker&#160; template:&#160; &#160; metadata:&#160; &#160; &#160; labels:&#160; &#160; &#160; &#160; component: spark-worker&#160; &#160; spec:&#160; &#160; &#160; containers:&#160; &#160; &#160; &#160; - name: spark-worker&#160; &#160; &#160; &#160; &#160; image: hrafiq/dockerhub:spark-jovyan-local&#160; &#160; &#160; &#160; &#160; command: ["sh", "/start-worker.sh","run"]&#160; &#160; &#160; &#160; &#160; imagePullPolicy: Always&#160; &#160; &#160; &#160; &#160; ports:&#160; &#160; &#160; &#160; &#160; &#160; - containerPort: 8081&#160; &#160; &#160; &#160; &#160; &#160; - containerPort: 7012&#160; &#160; &#160; &#160; &#160; &#160; - containerPort: 7013&#160; &#160; &#160; &#160; &#160; &#160; - containerPort: 7014&#160; &#160; &#160; &#160; &#160; &#160; - containerPort: 5001&#160; &#160; &#160; &#160; &#160; &#160; - containerPort: 5003&#160; &#160; &#160; &#160; &#160; &#160; - containerPort: 8881Workers get registered with master but still unable to execute any tasks, no cores are assigned to executors and no job executes. This error is displayed"Initial job has not accepted any resources; check your cluster UI to ensure that workers are registered and have sufficient resources"this is the spark UI

Hamza · Answer

When kubernetes picks 10.*.*.*/16 network as it's pod network then jobs executes successfully. Otherwise when it picks 192.168.*.*/16 subnet as its pod network then jobs does not execute. Due to 192.168.*.* network, it might be conflicting with existing&#160;LAN.

Spark workers are not accepting any job Kubernetes-Docker-Spark

Your comment on this question:

1 answer to this question.

Your answer

Your comment on this answer:

Related Questions In Apache Spark

what are the spark job and spark task and spark staging ?

what are the job optimization Technics in spark and scala ?

File not found exception while processing the spark job in yarn cluster mode with multinode hadoop cluster

Is there any way to check the Spark version?

Why does sortBy transformation trigger a Spark job?

How can I write a text file in HDFS not from an RDD, in Spark program?

Web UI (Dashboard): https://kubernetes.io/docs/tasks/access-application-cluster/web-ui-dashboard/

set up kubernetes NGINX ingress in AWS with SSL termination

Error while joining cluster with node

How to copy files from host to Docker container?

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES

TRENDING BLOG ARTICLES

TRENDING BLOG ARTICLES