Skip to content

Conversation

@sfc-gh-jkew
Copy link
Contributor

@sfc-gh-jkew sfc-gh-jkew commented Sep 11, 2025

Addresses a performance issue with hybrid execution where a sequence of merge operations using the same DataFrames could result in transfer thrashing. In this PR we have the arguments of the operation move in-place so that subsequent operations all stay on the same backend. This behavior can be turned off by setting the BackendMergeCastInPlace variable to false.

  • first commit message and PR title follow format outlined here

    NOTE: If you edit the PR title to match this format, you need to add another commit (even if it's empty) or amend your last commit for the CI job that checks the PR title to pick up the new PR title.

  • passes flake8 modin/ asv_bench/benchmarks scripts/doc_checker.py
  • passes black --check modin/ asv_bench/benchmarks scripts/doc_checker.py
  • signed commit with git commit -s
  • Resolves Convert the QueryCompilerCaster to In-Place Casting to Reduce Data Migration Churn #7676
  • tests added and passing
  • module layout described at docs/development/architecture.rst is up-to-date

@sfc-gh-jkew sfc-gh-jkew force-pushed the jkew/hybrid-merge-in-place branch from 1df6492 to 55141ee Compare September 19, 2025 23:25
@sfc-gh-jkew sfc-gh-jkew changed the title WIP in-place casting between dataframe engines FEAT-#7676: in-place casting between dataframe engines Sep 22, 2025
@sfc-gh-jkew sfc-gh-jkew changed the title FEAT-#7676: in-place casting between dataframe engines FEAT-#7676: in-place casting between DataFrame engines Sep 22, 2025
@sfc-gh-jkew sfc-gh-jkew marked this pull request as ready for review September 22, 2025 23:44
Copy link
Contributor

@sfc-gh-joshi sfc-gh-joshi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A few more minor nits, LGTM after they're addressed.

sfc-gh-jkew and others added 12 commits September 25, 2025 14:40
@sfc-gh-jkew sfc-gh-jkew merged commit 6e861d0 into main Sep 26, 2025
75 of 76 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Convert the QueryCompilerCaster to In-Place Casting to Reduce Data Migration Churn

4 participants