Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:
vishaljoshi24
/
trl-4-dnd
Paused

App Files Files Community
Fetching metadata from the HF Docker repository...
trl-4-dnd / examples /research_projects /stack_llama /scripts
Ctrl+K
Ctrl+K
  • 1 contributor
History: 1 commit
vishaljoshi24's picture
vishaljoshi24
Initial Commit
a080fe0 12 days ago
  • README.md
    1.87 kB
    Initial Commit 12 days ago
  • merge_peft_adapter.py
    2.61 kB
    Initial Commit 12 days ago
  • reward_modeling.py
    11.9 kB
    Initial Commit 12 days ago
  • rl_training.py
    10.3 kB
    Initial Commit 12 days ago
  • supervised_finetuning.py
    7.73 kB
    Initial Commit 12 days ago