autoprogrammer/olmoe_densebackward0125_lr1e-06_ESFT-summary_epoch_0 Text Generation • 7B • Updated May 8 • 12
autoprogrammer/OLMoE-1B-7B-0125_lr2e-05_ESFT-summary_epoch_3_freeze Text Generation • 7B • Updated May 8 • 13
autoprogrammer/OLMoE-1B-7B-0125_lr2e-05_ESFT-summary_epoch_2_freeze Text Generation • 7B • Updated May 8 • 3
autoprogrammer/OLMoE-1B-7B-0125_lr2e-05_ESFT-summary_epoch_1_freeze Text Generation • 7B • Updated May 8 • 12
autoprogrammer/OLMoE-1B-7B-0125_lr2e-05_ESFT-summary_epoch_0_freeze Text Generation • 7B • Updated May 8 • 12
autoprogrammer/olmoe_densebackward0125_lr1e-04_ESFT-law_epoch_2 Text Generation • 7B • Updated May 8 • 3
autoprogrammer/olmoe_densebackward0125_lr1e-04_ESFT-law_epoch_1 Text Generation • 7B • Updated May 8 • 3
autoprogrammer/olmoe_densebackward0125_lr1e-04_ESFT-law_epoch_0 Text Generation • 7B • Updated May 8 • 2
autoprogrammer/OLMoE-1B-7B-0125_ESFT-intent_lr2e-05_epoch4_freeze Text Generation • 7B • Updated May 8 • 12
autoprogrammer/OLMoE-1B-7B-0125_ESFT-translation_r2e-05_epoch4_freeze Feature Extraction • 7B • Updated May 7 • 1
autoprogrammer/Llama-3.2-1B-Instruct-commonsense_qa-medmcqa-linear Text Generation • 1B • Updated Jan 7 • 18