Skip to content

[SPARK-55663][PYTHON] Unify __module__ for data source functions#54457

Open
gaogaotiantian wants to merge 3 commits intoapache:masterfrom
gaogaotiantian:unify-module-name
Open

[SPARK-55663][PYTHON] Unify __module__ for data source functions#54457
gaogaotiantian wants to merge 3 commits intoapache:masterfrom
gaogaotiantian:unify-module-name

Conversation

@gaogaotiantian
Copy link
Contributor

What changes were proposed in this pull request?

Always set __module__ to be something meaningful for datasource functions and workers.

Why are the changes needed?

The data source profiler depends on the module name of the worker/function. When invoked with simpler worker, the __module__ would be __main__ which is not informative enough for the profilers. We should avoid having them to be __main__.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Locally simple worker + profiler recognizes it.

Was this patch authored or co-authored using generative AI tooling?

No.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants