The Differences Between Direct Alignment Algorithms are a Blur Paper • 2502.01237 • Published Feb 3 • 112
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-0.0001_neftune_alpha-10 Text Generation • Updated Aug 14, 2024 • 164
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-0.0001_neftune_alpha-5 Text Generation • Updated Aug 14, 2024 • 162
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-0.0001_neftune_alpha-0 Text Generation • Updated Aug 14, 2024 • 131
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-3e-05_neftune_alpha-10 Text Generation • Updated Aug 14, 2024 • 166
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-3e-05_neftune_alpha-5 Text Generation • Updated Aug 14, 2024 • 135
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-3e-05_neftune_alpha-0 Text Generation • Updated Aug 14, 2024 • 171
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-1e-05_neftune_alpha-10 Text Generation • Updated Aug 14, 2024 • 166
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-1e-05_neftune_alpha-5 Text Generation • Updated Aug 13, 2024 • 164
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-1e-05_neftune_alpha-0 Text Generation • Updated Aug 13, 2024 • 161
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-0.0001_neftune_alpha-10 Text Generation • Updated Aug 13, 2024 • 167
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-0.0001_neftune_alpha-5 Text Generation • Updated Aug 13, 2024 • 124
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-0.0001_neftune_alpha-0 Text Generation • Updated Aug 13, 2024 • 124
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-3e-05_neftune_alpha-10 Text Generation • Updated Aug 13, 2024 • 161
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-3e-05_neftune_alpha-5 Text Generation • Updated Aug 13, 2024 • 120
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-3e-05_neftune_alpha-0 Text Generation • Updated Aug 13, 2024 • 161
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-1e-05_neftune_alpha-10 Text Generation • Updated Aug 13, 2024 • 160
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-1e-05_neftune_alpha-5 Text Generation • Updated Aug 13, 2024 • 162
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-1e-05_neftune_alpha-0 Text Generation • Updated Aug 13, 2024 • 163
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-1_max_lr-0.0001_neftune_alpha-5 Text Generation • Updated Aug 13, 2024 • 122