asyncio extension #930

bdunahu · 2025-08-06T00:43:34Z

Currently, it is impossible for Scalene to report over 100% for a program; native code which executes in parallel should still be associated with a single line of python, and even if this were not the case (this applies to the multiprocessing library too), the samples can only be accounted for if they are present in the new_frames variable during each sample, each of which are currently treated as separate threads and assigned a normalized time based on how many there are.

This means the reporting problem is a new issue: it would be wrong to add the idle frames to new_frames, because that would imply they block the python interpreter from doing anything else while they're waiting.

When an asynchronous task is suspended, we instead assume it is waiting for the entire sampling interval. The solution this PR implements for the reporting problem is to treat idle tasks as if they run sequentially after non-waiting code, one after the other (i.e., nothing truly happens in parallel). This means the total CPU time passed is adjusted to match every sample.

Because asynchronous tasks run regardless of what the GIL is doing, the results are usually biased towards asynchronous code. It still leads to the behavior most users would likely expect.

Current state:

idle-task frame collection logic is implemented in scalene_asyncio
it is possible to prevent time from being assigned to the asyncio event loop by filtering out frames which belong to a thread that is running an event loop but has no current task. However, I do not do this, because throwing out frames complicates the above approach to reporting results.
the passing of Scalene.should_trace is hacky. This function is also passed to scalene_utility.add_stack, is it possible for the implementation be moved to the utility file?

Revert before merge. Allows tests to run on guix/nix.

Includes some shortcuts in logic. There seems to be an issue with reporting pertaining now that it is possible to collect more than one frame per thread.

jaltmayerpizzorno

Preliminary review while I wait for an answer to a question I posed on Slack.

scalene/scalene_profiler.py

scalene/scalene_asyncio.py

scalene/scalene_profiler.py

scalene/scalene_asyncio.py

scalene/scalene_profiler.py

emeryberger · 2025-12-20T21:55:58Z

With the recent refactoring of scalene_profiler.py, this is going to take some work to bring up to date.

bdunahu · 2025-12-22T03:49:22Z

With the recent refactoring of scalene_profiler.py, this is going to take some work to bring up to date.

Is there room for improving how Scalene reports asyncio code? Attributing CPU time to the event loop internals is not very useful to the user, but it is where the CPU actually spends time when the event loop sits in the select call with nothing to do (#805 shows how the correct results are still un-intuitive).

There didn't seem to be an easy way to work in 'asynchronous time' into the profile results like other profilers do unless we are considering adding a new column/flag? If not, this pull request can probably be closed.

bdunahu added 6 commits August 3, 2025 20:51

Redirect python shebang to use `env'

ca2ab3a

Revert before merge. Allows tests to run on guix/nix.

Initial methods to collect idle asyncio task frames

ee06f3a

Initial logic to incorporate profiling idle tasks

1e18c76

Includes some shortcuts in logic. There seems to be an issue with reporting pertaining now that it is possible to collect more than one frame per thread.

Correctly discard frames from tasks which call other tasks

4af02e4

Remove some leftover duplicated logic from _get_idle_task_frames

4ee2486

Exhaustively search async generators, fix asyncgen double assignment

8791e75

emeryberger requested a review from jaltmayerpizzorno August 6, 2025 15:33

jaltmayerpizzorno reviewed Aug 8, 2025

View reviewed changes

bdunahu added 8 commits August 8, 2025 21:21

Address request for type checks in search_awaitable (now trace_down)

c0b97dc

(Fix?) Ensure task is not assigned time when waiting on current task

b5bf0da

Do not factor in thread information when adding time to idle tasks

5ed76cc

Add safety for when new_frames are empty, idle_async_frames are not

ce8be5a

Do not profile frames if they belong to an event loop without a task

d10fed4

Fix typing inconsistency on ScaleneAsyncio.loops

630a2dc

New metric to output percentages: total samples, not total time

0a6e836

Readd samples belonging to the event loop.

c49a691

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

asyncio extension #930

asyncio extension #930

Uh oh!

bdunahu commented Aug 6, 2025 •

edited

Loading

Uh oh!

jaltmayerpizzorno left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

emeryberger commented Dec 20, 2025

Uh oh!

bdunahu commented Dec 22, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

asyncio extension #930

Are you sure you want to change the base?

asyncio extension #930

Uh oh!

Conversation

bdunahu commented Aug 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jaltmayerpizzorno left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

emeryberger commented Dec 20, 2025

Uh oh!

bdunahu commented Dec 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

bdunahu commented Aug 6, 2025 •

edited

Loading

bdunahu commented Dec 22, 2025 •

edited

Loading