wrong tensor arrange for conv

Hey!
looks like there is a mess in tensors arrangements in few place here:
https://github.com/lucidrains/tranception-pytorch/blob/610ebf21c4cf3b7f0d560c89610051c7242527a0/tranception_pytorch/tranception_pytorch.py#L140
does not make sense to go to 'b h n d' last dim is channels dim

than next
https://github.com/lucidrains/tranception-pytorch/blob/610ebf21c4cf3b7f0d560c89610051c7242527a0/tranception_pytorch/tranception_pytorch.py#L148

we split heads into groups and merge with batch and try to apply convolution that expects channels as second dim which is seq-len in our case now.

there is also 
https://github.com/lucidrains/tranception-pytorch/blob/610ebf21c4cf3b7f0d560c89610051c7242527a0/tranception_pytorch/tranception_pytorch.py#L122
setup of convolution layer to expect full inner_dim as input channels somehow


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

wrong tensor arrange for conv #1

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

wrong tensor arrange for conv #1

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions