MasterControlAIML
/

DeepSeek-R1-Qwen2.5-3b-LLM-Judge-Reward-JSON-Unstructured-To-Structured-Lora

Text Generation

text-generation-inference

Model card Files Files and versions

bhaviktheslider commited on Jun 18

Commit

8faf870

·

verified ·

1 Parent(s): 3ecf53c

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ language:
 | **Field**             | **Value**                                  |
 |-----------------------|--------------------------------------------|
-| **Developed by**      | **bhaviktheslider**                        |
 | **License**           | Apache 2.0                                 |
 | **Finetuned from**    | `unsloth/Qwen2.5-3B-Instruct`              |
 | **Training Framework**| [Unsloth](https://github.com/unslothai/unsloth) × Hugging Face TRL |
@@ -204,7 +204,7 @@ Stay tuned—numbers landing faster than you can say “schema validation.”
 ```bibtex
 @misc{bhaviktheslider_2025_unsloth_qwen2.5_3b_grpo,
   title  = {An Unsloth-accelerated GRPO-trained Qwen 2.5-3B for JSON structuring},
-  author = {Bhaviktheslider},
   year   = {2025},
   howpublished = {\url{https://huggingface.co/MasterControlAIML/DeepSeek-R1-Qwen2.5-3b-LLM-Judge-Reward-JSON-Unstructured-To-Structured-Lora}}
 }

 | **Field**             | **Value**                                  |
 |-----------------------|--------------------------------------------|
+| **Developed by**      | **MasterControlAIML**                        |
 | **License**           | Apache 2.0                                 |
 | **Finetuned from**    | `unsloth/Qwen2.5-3B-Instruct`              |
 | **Training Framework**| [Unsloth](https://github.com/unslothai/unsloth) × Hugging Face TRL |
 ```bibtex
 @misc{bhaviktheslider_2025_unsloth_qwen2.5_3b_grpo,
   title  = {An Unsloth-accelerated GRPO-trained Qwen 2.5-3B for JSON structuring},
+  author = {MasterControlAIML},
   year   = {2025},
   howpublished = {\url{https://huggingface.co/MasterControlAIML/DeepSeek-R1-Qwen2.5-3b-LLM-Judge-Reward-JSON-Unstructured-To-Structured-Lora}}
 }