[SPARK-54276][BUILD] Bump Hadoop 3.4.3#54029
Conversation
JIRA Issue Information=== Sub-task SPARK-54276 === This comment was automatically generated by GitHub Actions |
|
Nice! Thank you, @pan3793 . |
|
this is actually caused by my local maven repo dirty cache, sorry for making noise, the jars in the staging repo are good. |
|
@pan3793 sometimes it's good to rm -r all of ~/m2/repository/org/apache/hadoop (or any other project you actively work on). Saves disk space, even if your next few builds are slow. |
|
@pan3793 thanks for testing this. |
|
@steveloughran, thanks for tips, yes, I fixed it by For integration tests, I don't see any issue with default JDK 17, and I'm trying with JDK 25, so far, no issues are related to Hadoop. |
|
Looks like all failed tests with Java 25 already have solutions or are easy to fix, except for datasketches-java 6.2.0 - it does not work with Java 25, upgrading involves API changes, which breaks the compile, opened apache/datasketches-memory#270, and hope that datasketches-memory 3.0.2 can have a new patch version to solve the Java 25 compatibility issues. |
|
Thank you for pinging me, @steveloughran , and sorry for the late reply. I was traveling from South Korea to USA last weekend . I'm going to take a look at this PR. I don't think there is an Hadoop issue here. It seems that @pan3793 just wanted to verify the result on Java 25. The
|
|
@dongjoon-hyun, let me revert unrelated changes an keep this a simple Hadoop version upgrade, and I will open a new draft PR for Java 25 integration. BTW, I think I already have a solution for |
|
Thank you always, @pan3793 . |
|
Although the failures seem flaky ones, could you re-run the failed test pipelines to make it sure, @pan3793 ? |
|
Hi, @pan3793 . I sent you an email (chengpan@apache.org). |
|
thank you, @dongjoon-hyun, it's really a great news! |
|
Oh, my bad. I mistakenly send you an PMC template. It should be an Apache Spark Commiter invitation. Let me send out once more a correct one for the official committment. Very sorry, @pan3793 ~ |
|
Definitely, I'll help you to the member of PMC later. But you know that it should start from the committer first. |
|
I sent a new one to chengpan@apache.org . Could you please accept once more in the correctly email, @pan3793 ? |
|
@dongjoon-hyun, I have replied to the email. Thank you again. |
|
Now, I added you to the committer list. Please check your Whimsy, @pan3793 . It's my pleasure to cowork with you in the community.
|
|
It's announced too at BTW, do you have an LinkedIn account, @pan3793 ? |
|
@dongjoon-hyun, thanks! I'm not active on LinkedIn |
|
Got it. No problem~ |
|
there's a new RC out now; maven staging repo is |
|
@steveloughran, thanks for the information, I found it and have updated here to use it a few days ago, so far, the test results look good. but I didn't find the vote mail in common-dev_at_hadoop, am I missed something? |
|
Thank you, @steveloughran . The following seems to be the new RC1 email, @pan3793 . https://lists.apache.org/thread/pwntvvrxc6vb5sod74qmsjtb9wq0cn18 |
dongjoon-hyun
left a comment
There was a problem hiding this comment.
Could you use the official Apache Hadoop 3.4.3 since the vote succeeded, @pan3793 ?
|
@dongjoon-hyun, I see, but it seems the jars are not available on Maven Central yet, I'm waiting for that. |
|
Oh, ya. It's not synced yet. Thanks for checking. BTW, for Java 25, we need Apache Hadoop 3.5.0 still for your HADOOP-19821, right? |
|
@dongjoon-hyun, I can't say full Java 25 support, but Spark is already able to bootstrap and pass GHA (there are some issues unrelated to Hadoop need to fix though) with Hadoop 3.4.3 with Java 25. |
|
Now, it's ready. $ curl -I https://maven-central.storage-download.googleapis.com/maven2/org/apache/hadoop/hadoop-client-api/3.4.3/hadoop-client-api-3.4.3.pom
HTTP/2 200
... |
|
@dongjoon-hyun, I contacted the ASF infra team, and it seems they fixed the Maven sync issue. Removed the staging repo and rebased on the latest master. Now we just need to wait for CI pass (it should) |
|
CI is green now, it's ready to go. |
dongjoon-hyun
left a comment
There was a problem hiding this comment.
+1, LGTM. Thank you for working on this and collaborating the Apache Hadoop community, @pan3793 .
|
Merged to master for Apache Spark 4.2.0. I hope this unblocks the previous items. |

What changes were proposed in this pull request?
Upgrade Hadoop dependency to 3.4.3.
Why are the changes needed?
This release includes HADOOP-19212, which makes UGI work with Java 25.
https://hadoop.apache.org/release/3.4.3.html
Does this PR introduce any user-facing change?
No.
How was this patch tested?
Pass CI. Also verified
spark-sqlcan successfully bootstrap on JDK 25 nowWas this patch authored or co-authored using generative AI tooling?
No.