Why were the self-attention layers removed from Janner's UNet?

I was looking at the modifications made to Janner's UNet to implement FiLM conditioning. One thing I noticed was that the self-attention layers were removed. Compare for instance these:
1) https://github.com/jannerm/diffuser/blob/7ea422860cc0106e5ca5949d980f04b799d5462c/diffuser/models/temporal.py#L85
2) https://github.com/real-stanford/diffusion_policy/blob/5ba07ac6661db573af695b419a7947ecb704690f/diffusion_policy/model/diffusion/conditional_unet1d.py#L140

Apart from the missing self-attention and the introduction of FiLM conditioning, everything else is identical. Was there any reason for this design choice? Does the self-attention mess up the FiLM conditioning in the conv-1d layers?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why were the self-attention layers removed from Janner's UNet? #147

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Why were the self-attention layers removed from Janner's UNet? #147

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions