`GPT2Attention()` class with `_attn()` method when `add_cross_attention=True` and therefore `is_cross_attention=True`. #35430

CHLEE-Leo · 2024-12-27T04:54:13Z

Feature request

Model description

It seems like GPT2Attention() class allows_attn() method with causal_mask only when is_cross_attention=False, but not when is_cross_attention=True.

It would be more productive if GPT2Attention() supports _attn() method with causal_mask even with is_cross_attention=True.

Motivation

When developing EncoderDecoderModel where the encoder is ViTModel and the decoder isGPT2Model, the current GPT2Model class does not support causal_mask if add_cross_attention=True and therefore is_cross_attention=True.

Your contribution

No contribution.

The text was updated successfully, but these errors were encountered:

CHLEE-Leo added the Feature request Request for a new feature label Dec 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`GPT2Attention()` class with `_attn()` method when `add_cross_attention=True` and therefore `is_cross_attention=True`. #35430

`GPT2Attention()` class with `_attn()` method when `add_cross_attention=True` and therefore `is_cross_attention=True`. #35430

CHLEE-Leo commented Dec 27, 2024 •

edited

Loading

GPT2Attention() class with _attn() method when add_cross_attention=True and therefore is_cross_attention=True. #35430

GPT2Attention() class with _attn() method when add_cross_attention=True and therefore is_cross_attention=True. #35430

Comments

CHLEE-Leo commented Dec 27, 2024 • edited Loading

Feature request

Model description

Motivation

Your contribution

`GPT2Attention()` class with `_attn()` method when `add_cross_attention=True` and therefore `is_cross_attention=True`. #35430

`GPT2Attention()` class with `_attn()` method when `add_cross_attention=True` and therefore `is_cross_attention=True`. #35430

CHLEE-Leo commented Dec 27, 2024 •

edited

Loading