Optimize flush performance and refactor backend with execution context #206

Paul1365972 · 2025-10-06T17:02:38Z

This PR introduces several optimizations and refactoring improvements to the redpiler backend:

Changes:

Optimize flush performance: Track changed nodes in a separate vector to avoid unnecessarly iterating through the entire nodes array during flush operations
Extract tick scheduler and the events queue into a dedicated ExecutionContext (also contains the new changes vec)
Pin compiler options at start of compilation, this prevents the case where we switch io_only mode mid execution, we never used or really suppored this anyways so it should be fine
I also saw that the Node layout was a bit weird, we were wasting 18 bytes on padding. By reducing the updates inline vec from 10 to 9 entries we save 16 bytes (alternative being, adding 3 entries for free).

The flush optimization reduces overhead by only processing nodes that have actually changed, which should improve performance especially for large redstone circuits.

We also still have the bug where the reset function does not properly sync all components block state.
To further optimze it, I also decided to just skip syncing the comparators block entities as we weren't properly doing is anyways at the moment.
This should probably be addressed in a different PR.

BramOtte · 2025-10-06T19:52:42Z

From a few quick tests on my laptop and this seems to have comparable performance to my implementation (https://github.com/BramOtte/MCHPRS/tree/changed-vec)
but implemented a bit cleaner and also cleaning up some other parts.

I ran iris mandelbrot at 50'000 ticks per flush and got these results:

master run 1: DONE in 49.558927159s processing 135000000 ticks, effective rtps: 2724029.912247278
master run 2: DONE in 49.795871091s processing 135000000 ticks, effective rtps: 2711068.1476641465
paul   run 1: DONE in 45.279627796s processing 135000000 ticks, effective rtps: 2981473.2711192006
paul   run 2: DONE in 45.881503924s processing 135000000 ticks, effective rtps: 2942362.1384255304
bram   run 1: DONE in 45.920522083s processing 135000000 ticks, effective rtps: 2939862.0459059994
bram   run 2: DONE in 45.068884875s processing 135000000 ticks, effective rtps: 2995414.694071682

Paul1365972 · 2025-10-06T21:26:20Z

Thanks for testing. I still haven't set up my benchmark environment again, so I just prayed my changes would work just as well as yours did. Someone should also probably verify what impact the inline vec size has, although I feel like this was done already (Probably should be its own PR if I am being honest).

…ontext

Paul1365972 · 2025-10-06T23:58:43Z

Decided it's a good idea to separate this. I will open a separate PR for the node struct change. Although it will cause a merge conflict, as far as I can see. Just ping me and I will rebase either one.

Paul1365972 · 2025-10-22T16:59:28Z

Some ideas for changes:

Rename ExecutionContext to something more meaningful
We currently run self.redpiler.flush(&mut self.world) directly after self.tickn(batch_size as u64) (Reference). Would it make sense (and not introduce bugs/regressions) to only flush just before we send the data to the player according to the world send rate schedule? This could potentially be a big improvement in certain cases.

EDIT: One reason idea 2 is questionable is that we can imagine a future backend that runs ticks asynchronously (for example a multi-threaded or FPGA based one). The change would then remove our "sync point" where we ensure the ticks actually ran and mess with our ability to analyze how long it took to execute these ticks. Although there must be a better way to handle this.

Paul1365972 added 4 commits October 7, 2025 01:32

redpiler: extract tick scheduler and events into separate execution c…

3e5d857

…ontext

redpiler: document jitbackend lifecycle

5c99623

redpiler: fix io only at start of compilation

911e819

redpiler: optimize flush by tracking changed nodes in separate vec

dee4dbd

Paul1365972 force-pushed the changed_vec branch from 8b80097 to dee4dbd Compare October 6, 2025 23:32

Paul1365972 mentioned this pull request Oct 7, 2025

Document and adjust node struct size for better padding usage #207

Closed

BramOtte mentioned this pull request Dec 8, 2025

Interleaved forward links #220

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Optimize flush performance and refactor backend with execution context #206

Optimize flush performance and refactor backend with execution context #206

Uh oh!

Paul1365972 commented Oct 6, 2025

Uh oh!

BramOtte commented Oct 6, 2025 •

edited

Loading

Uh oh!

Paul1365972 commented Oct 6, 2025

Uh oh!

Paul1365972 commented Oct 6, 2025

Uh oh!

Paul1365972 commented Oct 22, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Optimize flush performance and refactor backend with execution context #206

Are you sure you want to change the base?

Optimize flush performance and refactor backend with execution context #206

Uh oh!

Conversation

Paul1365972 commented Oct 6, 2025

Uh oh!

BramOtte commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Paul1365972 commented Oct 6, 2025

Uh oh!

Paul1365972 commented Oct 6, 2025

Uh oh!

Paul1365972 commented Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

BramOtte commented Oct 6, 2025 •

edited

Loading

Paul1365972 commented Oct 22, 2025 •

edited

Loading