Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Dovakiins
/
qwerrwe
like
0
Build error
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
c1b741d
qwerrwe
/
src
/
axolotl
/
monkeypatch
121 kB
100 contributors
History:
51 commits
winglian
optimize calculation of cu_seqlens from position_ids (#1084) [skip ci]
90036eb
unverified
almost 2 years ago
mixtral
bump transformers and update attention class map name (#1023)
almost 2 years ago
btlm_attn_hijack_flash.py
Safe
2.32 kB
flash_attention + sample packing for stablelm 3b (#671)
about 2 years ago
fastchat_conversation_turns.py
Safe
8.33 kB
Added chatglm3 conversation type for training models like TinyLLama (#1036)
almost 2 years ago
llama_attn_hijack_flash.py
Safe
27.1 kB
adds llama and mistral dropout support (#858)
almost 2 years ago
llama_attn_hijack_sdp.py
Safe
4.81 kB
various bugfixes (#856)
almost 2 years ago
llama_attn_hijack_xformers.py
Safe
5.69 kB
various bugfixes (#856)
almost 2 years ago
llama_expand_mask.py
Safe
1.92 kB
Attention mask and position id fixes for packing (#285)
about 2 years ago
mistral_attn_hijack_flash.py
Safe
22.4 kB
adds llama and mistral dropout support (#858)
almost 2 years ago
relora.py
Safe
14 kB
fix checkpints on multigpu (#481)
about 2 years ago
stablelm_attn_hijack_flash.py
Safe
15.4 kB
flash_attention + sample packing for stablelm 3b (#671)
about 2 years ago
utils.py
4.22 kB
optimize calculation of cu_seqlens from position_ids (#1084) [skip ci]
almost 2 years ago