Skip to content

Sigmoid activation on FiLM layer outputs #1

@neilthefrobot

Description

@neilthefrobot

There is a sigmoid activation on gamma and beta in the FiLM layers. This makes the affine transformation only able to shift in the positive direction and the scaling becomes very limited. In the paper they actually tested trying different activations on the affine transformation variables and they all hurt performance. If you just leave the output as is without any activation you should see significant improvement.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions