Skip to content

Conversation

@ChaoWao
Copy link
Owner

@ChaoWao ChaoWao commented Feb 11, 2026

Port paged attention from host_build_graph to aicpu_build_graph where the AICPU device builds the task graph via a dlopen'd orchestration plugin with concurrent build||schedule. Kernels are identical; only the orchestration and kernel_config differ.

Port paged attention from host_build_graph to aicpu_build_graph where
the AICPU device builds the task graph via a dlopen'd orchestration
plugin with concurrent build||schedule. Kernels are identical; only the
orchestration and kernel_config differ.

Tested: Small/Case1/Case2 all pass on a2a3 device.

Co-Authored-By: Claude Opus 4.6 <[email protected]>
@ChaoWao ChaoWao merged commit 5334d3d into main Feb 11, 2026
3 checks passed
@ChaoWao ChaoWao deleted the fix-page-attention-aicpu branch February 11, 2026 03:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant