Skip to content

[SPARK-52897][PYTHON] Update pandas to 2.3.1 #51579

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 4 commits into from

Conversation

bjornjorgensen
Copy link
Contributor

What changes were proposed in this pull request?

Upgrading from pandas==2.3.0 to pandas==2.3.1

Why are the changes needed?

Release Github

Release notes

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Pass GA

Was this patch authored or co-authored using generative AI tooling?

No.

@bjornjorgensen bjornjorgensen changed the title [SPARK-XXXX][PYTHON] Update pandas to 2.3.1 [SPARK-52897[PYTHON] Update pandas to 2.3.1 Jul 20, 2025
@bjornjorgensen
Copy link
Contributor Author

Copy link
Member

@HyukjinKwon HyukjinKwon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@bjornjorgensen
Copy link
Contributor Author

@HyukjinKwon are there another to that we need to update

I get this now


*****************************
* Building Python API docs. *
*****************************
Running Sphinx v4.5.0
/__w/spark/spark/python/pyspark/pandas/__init__.py:43: UserWarning: 'PYARROW_IGNORE_TIMEZONE' environment variable was not set. It is required to set this environment variable to '1' in both driver and executor sides if you use pyarrow>=2.0.0. pandas-on-Spark will set it for you but it does not work if there is a Spark context already launched.
  warnings.warn(
/__w/spark/spark/python/pyspark/pandas/supported_api_gen.py:117: UserWarning: Warning: pandas 2.3.1 is required; your version is 2.3.0
  warnings.warn(msg, UserWarning)

Configuration error:
There is a programmable error in your configuration file:

Traceback (most recent call last):
  File "/usr/local/lib/python3.11/dist-packages/sphinx/config.py", line 332, in eval_config_file
    exec(code, namespace)
  File "/__w/spark/spark/python/docs/source/conf.py", line 33, in <module>
    generate_supported_api(output_rst_file_path)
  File "/__w/spark/spark/python/pyspark/pandas/supported_api_gen.py", line 102, in generate_supported_api
    _check_pandas_version()
  File "/__w/spark/spark/python/pyspark/pandas/supported_api_gen.py", line 118, in _check_pandas_version
    raise ImportError(msg)
ImportError: Warning: pandas 2.3.1 is required; your version is 2.3.0

@@ -91,10 +91,10 @@ RUN mkdir -p /usr/local/pypy/pypy3.10 && \
ln -sf /usr/local/pypy/pypy3.10/bin/pypy /usr/local/bin/pypy3.10 && \
ln -sf /usr/local/pypy/pypy3.10/bin/pypy /usr/local/bin/pypy3
RUN curl -sS https://bootstrap.pypa.io/get-pip.py | pypy3
RUN pypy3 -m pip install numpy 'six==1.16.0' 'pandas==2.3.0' scipy coverage matplotlib lxml
RUN pypy3 -m pip install numpy 'six==1.16.0' 'pandas==2.3.1' scipy coverage matplotlib lxml
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this file is only used for old branches, let's skip it

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ok, I change all pandas==2.3.1 to pandas==2.3.0 in this file.

@bjornjorgensen
Copy link
Contributor Author

@zhengruifeng I updated pandas version in the dev/spark-test-image/docs/Dockerfile and now all tests passes.

@xinrong-meng xinrong-meng changed the title [SPARK-52897[PYTHON] Update pandas to 2.3.1 [SPARK-52897][PYTHON] Update pandas to 2.3.1 Jul 22, 2025
@the-sakthi
Copy link
Member

LGTM!

@yaooqinn yaooqinn closed this in 8b889ce Jul 24, 2025
@yaooqinn
Copy link
Member

Merged to master, thank you @bjornjorgensen and all

@bjornjorgensen bjornjorgensen deleted the pandas2_3-1 branch July 24, 2025 08:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants