Learning codebooks also updates the encoder ?

In the case of a learnable codebook, it seems to me that the encoder outputs are not completely detached before they are used to compute the  [codebook loss](https://github.com/lucidrains/vector-quantize-pytorch/blob/master/vector_quantize_pytorch/vector_quantize_pytorch.py#L1138) because they are still connected to the loss indirectly via [distance matrix](https://github.com/lucidrains/vector-quantize-pytorch/blob/master/vector_quantize_pytorch/vector_quantize_pytorch.py#L695) and [quantized vectors ](https://github.com/lucidrains/vector-quantize-pytorch/blob/master/vector_quantize_pytorch/vector_quantize_pytorch.py#L718). The encoder still accumulates gradients (even though they are not updated by the in-place optimizer). Should not the encoder outputs be detached also before they are used to compute the distances ?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Learning codebooks also updates the encoder ? #236

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Learning codebooks also updates the encoder ? #236

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions