Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:
Dovakiins
/
qwerrwe
Build error

App Files Files Community
Fetching metadata from the HF Docker repository...
qwerrwe / src /axolotl /monkeypatch
115 kB
  • 100 contributors
History: 57 commits
winglian's picture
winglian
Mixtral fixes 20240124 (#1192) [skip ci]
54d2ac1 unverified almost 2 years ago
  • falcon
    Falcon embeddings (#1149) [skip docker] almost 2 years ago
  • mixtral
    Mixtral fixes 20240124 (#1192) [skip ci] almost 2 years ago
  • phi
    Phi2 multipack (#1173) almost 2 years ago
  • qwen2
    Qwen2 (#1166) almost 2 years ago
  • btlm_attn_hijack_flash.py
    2.32 kB
    flash_attention + sample packing for stablelm 3b (#671) about 2 years ago
  • fastchat_conversation_turns.py
    8.33 kB
    Added chatglm3 conversation type for training models like TinyLLama (#1036) almost 2 years ago
  • llama_attn_hijack_flash.py
    32 kB
    Add shifted sparse attention (#973) [skip-ci] almost 2 years ago
  • llama_attn_hijack_sdp.py
    4.81 kB
    various bugfixes (#856) almost 2 years ago
  • llama_attn_hijack_xformers.py
    5.69 kB
    various bugfixes (#856) almost 2 years ago
  • llama_expand_mask.py
    1.92 kB
    Attention mask and position id fixes for packing (#285) about 2 years ago
  • mistral_attn_hijack_flash.py
    22.4 kB
    adds llama and mistral dropout support (#858) almost 2 years ago
  • relora.py
    14 kB
    fix checkpints on multigpu (#481) about 2 years ago
  • stablelm_attn_hijack_flash.py
    15.4 kB
    flash_attention + sample packing for stablelm 3b (#671) about 2 years ago
  • utils.py
    5.3 kB
    Multipack simplify for Mixtral (#1142) almost 2 years ago