Pyspark ERROR:py4j.java_gateway:An error occurred while trying to connect to the Java server (127.0.0.1:50532)

Question

Hello I was working with Pyspark, implementing a sentiment analysis project using ML package for the first time. The code was working good but suddenly it becomes showing the error mentioned above:

   ERROR:py4j.java_gateway:An error occurred while trying to connect to the Java server (127.0.0.1:50532)
Traceback (most recent call last):
  File "C:\opt\spark\spark-2.3.0-bin-hadoop2.7\python\lib\py4j-0.10.6-src.zip\py4j\java_gateway.py", line 852, in _get_connection
    connection = self.deque.pop()
IndexError: pop from an empty deque

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "C:\opt\spark\spark-2.3.0-bin-hadoop2.7\python\lib\py4j-0.10.6-src.zip\py4j\java_gateway.py", line 990, in start
    self.socket.connect((self.address, self.port))
ConnectionRefusedError: [WinError 10061] Aucune connexion n’a pu être établie car l’ordinateur cible l’a expressément refusée

Does someone can help please Here is the full error description?

I get this error when trying to initialize SparkContext from the shell. SparkContext is created automatically in the shell. — Taylrl
– Taylrl, Commented Aug 7, 2018 at 12:27
In my case am working in jupyter notebook so i am obliged to initialize manually sparkcontext — jowwel93
– jowwel93, Commented Aug 10, 2018 at 14:06

Andy_101 · Accepted Answer · 2021-02-12 19:00:01Z

13

Just restart your notebook if you are using Jupyter nootbook. If not then just restart the pyspark . that should solve the problem. It happens because you are using too many collects or some other memory related issue.

edited Feb 12, 2021 at 19:00

answered Apr 4, 2019 at 16:39

Andy_101

1,30810 silver badges21 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Pedro Muñoz · Accepted Answer · 2021-02-04 08:40:14Z

9

Add more resources to Spark. For example if you're working on local mode a configuration like the following should be sufficient:

spark = SparkSession.builder \
.appName('app_name') \
.master('local[*]') \
.config('spark.sql.execution.arrow.pyspark.enabled', True) \
.config('spark.sql.session.timeZone', 'UTC') \
.config('spark.driver.memory','32G') \
.config('spark.ui.showConsoleProgress', True) \
.config('spark.sql.repl.eagerEval.enabled', True) \
.getOrCreate()

answered Feb 4, 2021 at 8:40

Pedro Muñoz

7408 silver badges11 bronze badges

Comments

Neelotpal Shukla · Accepted Answer · 2020-04-23 23:39:45Z

7

I encountered this error while trying to use PySpark within a Docker container. In my case, the error was originating from me assigning more resources to Spark than Docker had access to.

answered Apr 23, 2020 at 23:39

Neelotpal Shukla

4766 silver badges14 bronze badges

2 Comments

Swain Subrat Kumar Over a year ago

How to go solve this issue if I'm running it on production. Like do I need to flush the ram and restart the application again or anything else?

Neelotpal Shukla Over a year ago

If you have been able to successfully run the application once, perhaps restarting it will help. In my case, I just couldn't get it to run, even once. Eventually ended up reducing the spark driver memory to what could safely fit within the container.

Keerthi Reddy · Accepted Answer · 2021-01-28 12:56:02Z

0

I encountered the same problem while working on colab. I terminated the current session and reconnected. It worked for me!

answered Jan 28, 2021 at 12:56

Keerthi Reddy

456 bronze badges

Comments

Majid Hajibaba · Accepted Answer · 2021-06-07 10:05:30Z

0

Maybe the port of spark UI is already occupied, maybe there are other errors before this error.

Maybe this can help you：https://stackoverflow.com/questions/32820087/spark-multiple-spark-submit-in-parallel

spark-submit --conf spark.ui.port=5051

edited Jun 7, 2021 at 10:05

Majid Hajibaba

3,2806 gold badges29 silver badges59 bronze badges

answered Jun 7, 2021 at 6:52

juhengzhe

363 bronze badges

Collectives™ on Stack Overflow

Pyspark ERROR:py4j.java_gateway:An error occurred while trying to connect to the Java server (127.0.0.1:50532)

5 Answers 5

Comments

Comments

2 Comments

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

Comments

Comments

2 Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related