support use_flash_attn in from_pretrained

#2

This adds a shortcut to enable flash attention and xformer attention.

michael-guenther changed pull request status to open
gmastrapas changed pull request status to merged
Your need to confirm your account before you can post a new comment.

Sign up or log in to comment