jmvcoelho
/

GPTNeoX-160m

Model card Files Files and versions Community

jmvcoelho commited on Nov 3, 2024

Commit

e3618df

·

verified ·

1 Parent(s): 8ae54ea

Update modeling_custom.py

Files changed (1) hide show

modeling_custom.py +1 -1

modeling_custom.py CHANGED Viewed

@@ -410,7 +410,7 @@ class GPTNeoXFlashAttention2(GPTNeoXAttention):
         attention_dropout = self.config.attention_dropout if self.training else 0.0
-        #TODO: Compute attention
         attn_weights = ...
         #TODO: Reshape outputs before projection

         attention_dropout = self.config.attention_dropout if self.training else 0.0
+        #TODO: Compute attention with _flash_attention_forward
         attn_weights = ...
         #TODO: Reshape outputs before projection