Enable end2end affine-to-neura lowering #53

ShangkunLi · 2025-06-20T07:00:50Z

In this pr:

Add lowering from memref and builtin dialects
Add neura.load_indexed/store_indexed operation for memref like memory access
Fix some typos in files

Now, we can write code in cpp and lower it to affine dialect using polygeist. And more high-level transforms can be implemented in affine level, like polyhedral-based optimization, loop-unroll/fusion/fission/interchange/tiling/vectorize, etc.

ShangkunLi · 2025-06-20T12:16:58Z

Sry, some errors occurred when I try to transform bert_nodex.mlir into dataflow mlir.

Will fix it soon.

ShangkunLi · 2025-06-20T13:09:36Z

Sry, some errors occurred when I try to transform bert_nodex.mlir into dataflow mlir.

Will fix it soon.

The problem arises from the TransformCtrlToDataFlowPass.cpp incorrectly handling nested loops. I may try to solve it.

Current pass cannot handle the cond_br that do not pass arguments, like

^bb3(%8: !neura.data<i64, i1>):  // 2 preds: ^bb2, ^bb4
    %9 = "neura.cast"(%8) <{cast_type = "int_to_index"}> : (!neura.data<i64, i1>) -> !neura.data<index, i1>
    %10 = "neura.icmp"(%9, %1) <{cmpType = "slt"}> : (!neura.data<index, i1>, !neura.data<index, i1>) -> !neura.data<i1, i1>
    neura.cond_br %10 : !neura.data<i1, i1> then to ^bb4 else to ^bb5
^bb4:  // pred: ^bb3
    %11 = neura.load_indexed %arg0[%2, %2, %2, %2, %2, %9 : !neura.data<index, i1>, !neura.data<index, i1>, !neura.data<index, i1>, !neura.data<index, i1>, !neura.data<index, i1>, !neura.data<index, i1>] memref<?x1x1x1x1x128xi8> : !neura.data<i8, i1>
    neura.store_indexed %11 to %arg1[%2, %2, %5, %2, %2, %9 : !neura.data<index, i1>, !neura.data<index, i1>, !neura.data<index, i1>, !neura.data<index, i1>, !neura.data<index, i1>, !neura.data<index, i1>] memref<?x1x128x1x1x128xi8> : !neura.data<i8, i1>
    %12 = "neura.add"(%9, %0) : (!neura.data<index, i1>, !neura.data<index, i1>) -> !neura.data<index, i1>
    %13 = "neura.cast"(%12) <{cast_type = "index_to_int"}> : (!neura.data<index, i1>) -> !neura.data<i64, i1>
    neura.br %13 : !neura.data<i64, i1> to ^bb3

Errors when inserting reserve & ctrl_mov for nested loop.

…ore_indexed

lib/Conversion/ArithToNeura/ArithToNeuraPass.cpp

lib/Conversion/BuiltinToNeura/BuiltinToNeuraPass.cpp

lib/Conversion/MemRefToNeura/MemRefToNeuraPass.cpp

test/affine2neura/bert/bert_node0/bert_node0.mlir

tancheng · 2025-06-20T15:09:28Z

Sry, some errors occurred when I try to transform bert_nodex.mlir into dataflow mlir.
Will fix it soon.

The problem arises from the TransformCtrlToDataFlowPass.cpp incorrectly handling nested loops. I may try to solve it.

Current pass cannot handle the cond_br that do not pass arguments, like

^bb3(%8: !neura.data<i64, i1>):  // 2 preds: ^bb2, ^bb4
    %9 = "neura.cast"(%8) <{cast_type = "int_to_index"}> : (!neura.data<i64, i1>) -> !neura.data<index, i1>
    %10 = "neura.icmp"(%9, %1) <{cmpType = "slt"}> : (!neura.data<index, i1>, !neura.data<index, i1>) -> !neura.data<i1, i1>
    neura.cond_br %10 : !neura.data<i1, i1> then to ^bb4 else to ^bb5
^bb4:  // pred: ^bb3
    %11 = neura.load_indexed %arg0[%2, %2, %2, %2, %2, %9 : !neura.data<index, i1>, !neura.data<index, i1>, !neura.data<index, i1>, !neura.data<index, i1>, !neura.data<index, i1>, !neura.data<index, i1>] memref<?x1x1x1x1x128xi8> : !neura.data<i8, i1>
    neura.store_indexed %11 to %arg1[%2, %2, %5, %2, %2, %9 : !neura.data<index, i1>, !neura.data<index, i1>, !neura.data<index, i1>, !neura.data<index, i1>, !neura.data<index, i1>, !neura.data<index, i1>] memref<?x1x128x1x1x128xi8> : !neura.data<i8, i1>
    %12 = "neura.add"(%9, %0) : (!neura.data<index, i1>, !neura.data<index, i1>) -> !neura.data<index, i1>
    %13 = "neura.cast"(%12) <{cast_type = "index_to_int"}> : (!neura.data<index, i1>) -> !neura.data<i64, i1>
    neura.br %13 : !neura.data<i64, i1> to ^bb3

Errors when inserting reserve & ctrl_mov for nested loop.

Thanks @ShangkunLi, I didn't handle such case as GPT/Gemini told me MLIR has the rule that all basic block would only have arguments as live-ins (rather than directly use previously existing variable in other blocks), but it seems not always correct. Do you wanna fix this in this PR or later? I didn't see the --transform-ctrl-to-data is used in bert_node0.mlir?

ShangkunLi · 2025-06-20T15:16:54Z

Sry, some errors occurred when I try to transform bert_nodex.mlir into dataflow mlir.
Will fix it soon.

The problem arises from the TransformCtrlToDataFlowPass.cpp incorrectly handling nested loops. I may try to solve it.

Current pass cannot handle the cond_br that do not pass arguments, like
^bb3(%8: !neura.data<i64, i1>):  // 2 preds: ^bb2, ^bb4
    %9 = "neura.cast"(%8) <{cast_type = "int_to_index"}> : (!neura.data<i64, i1>) -> !neura.data<index, i1>
    %10 = "neura.icmp"(%9, %1) <{cmpType = "slt"}> : (!neura.data<index, i1>, !neura.data<index, i1>) -> !neura.data<i1, i1>
    neura.cond_br %10 : !neura.data<i1, i1> then to ^bb4 else to ^bb5
^bb4:  // pred: ^bb3
    %11 = neura.load_indexed %arg0[%2, %2, %2, %2, %2, %9 : !neura.data<index, i1>, !neura.data<index, i1>, !neura.data<index, i1>, !neura.data<index, i1>, !neura.data<index, i1>, !neura.data<index, i1>] memref<?x1x1x1x1x128xi8> : !neura.data<i8, i1>
    neura.store_indexed %11 to %arg1[%2, %2, %5, %2, %2, %9 : !neura.data<index, i1>, !neura.data<index, i1>, !neura.data<index, i1>, !neura.data<index, i1>, !neura.data<index, i1>, !neura.data<index, i1>] memref<?x1x128x1x1x128xi8> : !neura.data<i8, i1>
    %12 = "neura.add"(%9, %0) : (!neura.data<index, i1>, !neura.data<index, i1>) -> !neura.data<index, i1>
    %13 = "neura.cast"(%12) <{cast_type = "index_to_int"}> : (!neura.data<index, i1>) -> !neura.data<i64, i1>
    neura.br %13 : !neura.data<i64, i1> to ^bb3
Errors when inserting reserve & ctrl_mov for nested loop.
Thanks @ShangkunLi, I didn't handle such case as GPT/Gemini told me MLIR has the rule that all basic block would only have arguments as live-ins (rather than directly use previously existing variable in other blocks), but it seems not always correct. Do you wanna fix this in this PR or later? I didn't see the --transform-ctrl-to-data is used in bert_node0.mlir?

Filed an issue #54. I may try to fix this in the next pr. For this pr, I just tested these lowering patterns.

tancheng

Thanks for the PR, let's wait for the pass on github action before merging :-)

add memref/builtin lowering

1e2f469

ShangkunLi marked this pull request as ready for review June 20, 2025 08:55

ShangkunLi requested a review from tancheng June 20, 2025 08:55

ShangkunLi mentioned this pull request Jun 20, 2025

[P1] Fix the problem when lowering nested loop into data flow #54

Closed

[fix] change the assembly format of ops and enbale predicated load/st…

9ef9c4c

…ore_indexed

tancheng reviewed Jun 20, 2025

View reviewed changes

tancheng assigned Yfeng-44, HobbitQia, ShangkunLi, YanzhouTang and MeowMJ and unassigned Yfeng-44, HobbitQia, YanzhouTang and MeowMJ Jun 20, 2025

tancheng requested review from YanzhouTang, Yfeng-44 and yuqisun June 20, 2025 15:10

tancheng added the new feature New feature or request label Jun 20, 2025

ShangkunLi and others added 2 commits June 21, 2025 00:05

Merge branch 'coredac:main' into memref-builtin-lower

6d64e84

[fix] fix some typos

93620bc

tancheng approved these changes Jun 20, 2025

View reviewed changes

ShangkunLi merged commit 805548c into coredac:main Jun 20, 2025
1 check passed

ShangkunLi mentioned this pull request Jul 14, 2025

[P1] Fuse some non-sense cast operations #77

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable end2end affine-to-neura lowering #53

Enable end2end affine-to-neura lowering #53

Uh oh!

ShangkunLi commented Jun 20, 2025

Uh oh!

ShangkunLi commented Jun 20, 2025

Uh oh!

ShangkunLi commented Jun 20, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tancheng commented Jun 20, 2025

Uh oh!

ShangkunLi commented Jun 20, 2025

Uh oh!

tancheng left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Enable end2end affine-to-neura lowering #53

Enable end2end affine-to-neura lowering #53

Uh oh!

Conversation

ShangkunLi commented Jun 20, 2025

Uh oh!

ShangkunLi commented Jun 20, 2025

Uh oh!

ShangkunLi commented Jun 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tancheng commented Jun 20, 2025

Uh oh!

ShangkunLi commented Jun 20, 2025

Uh oh!

tancheng left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

ShangkunLi commented Jun 20, 2025 •

edited

Loading