chore: Add memory reservation debug logging #3489
Open
+161
−3
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Which issue does this PR close?
Part of #3445
Rationale for this change
These changes were originally in #2521, which also included Python scripts for analyzing the data. This PR just adds the logging changes. I am hoping this makes review easier. We can discuss adding analysis scripts in a separate issue or PR in the future.
When debugging memory reservation issues in Comet, it is helpful to see all memory pool interactions (grow, shrink, try_grow) with task IDs and consumer names. This PR adds a lightweight opt-in logging decorator for the native memory pool.
What changes are included in this PR?
spark.comet.debug.memory(default:false)LoggingPoolthat wraps the existing memory pool when the config is enabledgrow,shrink, andtry_growcalls with task attempt ID, consumer name, and result (Ok/Err)Example output:
How are these changes tested?
Manual testing with TPC-DS/TPC-H benchmarks. The logging pool is a thin decorator that delegates all operations to the underlying pool, so it does not change any behavior.
🤖 Generated with Claude Code