futureHQ commited on Mar 20

Commit

7e7d945

verified ·

1 Parent(s): ba1326f

Upload folder using huggingface_hub

Browse files

Files changed (20) hide show

.gitattributes +2 -0
.ipynb_checkpoints/README-checkpoint.md +136 -0
.ipynb_checkpoints/adapter_config-checkpoint.json +31 -0
.ipynb_checkpoints/chat_template-checkpoint.json +3 -0
.ipynb_checkpoints/preprocessor_config-checkpoint.json +29 -0
.ipynb_checkpoints/processor_config-checkpoint.json +4 -0
.ipynb_checkpoints/special_tokens_map-checkpoint.json +33 -0
.ipynb_checkpoints/tokenizer-checkpoint.json +3 -0
.ipynb_checkpoints/tokenizer_config-checkpoint.json +0 -0
README.md +136 -0
adapter_config.json +31 -0
adapter_model.safetensors +3 -0
added_tokens.json +3 -0
chat_template.json +3 -0
preprocessor_config.json +29 -0
processor_config.json +4 -0
special_tokens_map.json +33 -0
tokenizer.json +3 -0
tokenizer.model +3 -0
tokenizer_config.json +0 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+.ipynb_checkpoints/tokenizer-checkpoint.json filter=lfs diff=lfs merge=lfs -text
+tokenizer.json filter=lfs diff=lfs merge=lfs -text

.ipynb_checkpoints/README-checkpoint.md ADDED Viewed

	@@ -0,0 +1,136 @@

+---
+base_model: unsloth/gemma-3-27b-it
+library_name: peft
+license: gemma
+---
+# LOGIC-27B
+LOGIC Framework (Lay Out Problem, Options, Groundwork, Inference, Conclusion)
+Example Output
+```xml
+<layOutProblem>
+Explain the problem or question in detail.
+</layOutProblem>
+<options>
+List different strategies or paths you might take to solve the problem.
+</options>
+<groundwork>
+Note important facts, formulas, or references needed. Establish constraints.
+</groundwork>
+<inference>
+Perform detailed reasoning or computations. Validate steps or check for errors.
+</inference>
+<conclusion>
+Present the clear answer or outcome.
+</conclusion>
+```
+Why Use LOGIC?
+Emphasizes exploring multiple options.
+Encourages building a strong groundwork of facts or constraints.
+Includes inference to handle the step-by-step thinking process.
+Ends with a conclusion that ties it all together.
+### Model Details
+- **Model Name:** Bifröst-27B
+- **Base Architecture:** gemma3
+- **Application:** Reasoning using LOGIC Framework
+- **Release Date:** 16-March-2025
+```python
+system_prompt = \
+f"""You are given a problem.
+You are LOGIC
+Always follow the format
+layOutProblem
+options
+groundwork
+inference
+conclusion
+Explain the problem or question in detail.
+Place it between <layOutProblem> and </layOutProblem>
+List different strategies or paths you might take to solve the problem.
+Place it between <options> and </options>
+Note important facts, formulas, or references needed. Establish constraints.
+Place it between <groundwork> and </groundwork>
+Think about the problem and provide your working out.
+Place it between <inference> and </inference>.
+Then, provide your solution between <conclusion> and </conclusion>
+"""
+system_prompt
+```
+Example Prompt
+```python
+system_prompt = \
+messages = [
+    {"role": "system", "content": system_prompt},
+    {"role": "user",   "content": "A triangle has sides 13, 14, and 15. Find the radius of its circumcircle."},
+]
+text = tokenizer.apply_chat_template(
+    messages,
+    add_generation_prompt = True,
+    tokenize = False,
+)
+from transformers import TextStreamer
+_ = model.generate(
+    **tokenizer(text, return_tensors = "pt").to("cuda"),
+    max_new_tokens = 1024,
+    temperature = 1.0, top_p = 0.95, top_k = 64,
+    streamer = TextStreamer(tokenizer, skip_prompt = True),
+)
+system_prompt
+```
+Example Output
+```xml
+<layOutProblem>
+The problem asks us to find the radius of the circumcircle of a triangle with sides of length 13, 14, and 15. The circumcircle is the circle that passes through all three vertices of the triangle. The radius of the circumcircle is often denoted by *R*. We are given the lengths of the three sides of the triangle, and we need to determine the radius of the circle that circumscribes it.
+</layOutProblem>
+<options>
+1. **Use the formula R = abc / 4K**, where a, b, and c are the side lengths and K is the area of the triangle. This requires us to first calculate the area of the triangle. We can use Heron's formula to find the area.
+2. **Use the Law of Sines:**  a/sin(A) = b/sin(B) = c/sin(C) = 2R. This requires us to find one of the angles of the triangle, which can be done using the Law of Cosines.
+3. **Coordinate Geometry:** Place the triangle on a coordinate plane and find the circumcenter, then calculate the distance from the circumcenter to any vertex. This is generally more complex.
+</options>
+<groundwork>
+* **Heron's Formula:**  For a triangle with sides a, b, and c, the area K is given by K = sqrt(s(s-a)(s-b)(s-c)), where s is the semi-perimeter, s = (a+b+c)/2.
+* **Law of Cosines:** c² = a² + b² - 2ab*cos(C)
+* **Law of Sines:** a/sin(A) = b/sin(B) = c/sin(C) = 2R
+* **Circumradius Formula:** R = abc / 4K
+* **Side lengths:** a = 13, b = 14, c = 15
+* **Constraint:** The radius must be a positive real number.
+</groundwork>
+<inference>
+Let's use the formula R = abc / 4K. First, we need to find the area K using Heron's formula.
+The semi-perimeter s = (13 + 14 + 15) / 2 = 42 / 2 = 21.
+Now, we can calculate the area K:
+K = sqrt(21(21-13)(21-14)(21-15)) = sqrt(21 * 8 * 7 * 6) = sqrt(3 * 7 * 2³ * 7 * 2 * 3) = sqrt(2⁴ * 3² * 7²) = 2² * 3 * 7 = 4 * 21 = 84.
+Now we can find the circumradius R:
+R = (13 * 14 * 15) / (4 * 84) = (13 * 14 * 15) / 336 = (13 * 2 * 7 * 3 * 5) / (2⁴ * 3 * 7) = (13 * 5) / 8 = 65 / 8 = 8.125
+</inference>
+<conclusion>
+The radius of the circumcircle is 65/8 or 8.125.
+</conclusion><end_of_turn>
+```

.ipynb_checkpoints/adapter_config-checkpoint.json ADDED Viewed

	@@ -0,0 +1,31 @@

+{
+  "alpha_pattern": {},
+  "auto_mapping": null,
+  "base_model_name_or_path": "unsloth/gemma-3-27b-it",
+  "bias": "none",
+  "corda_config": null,
+  "eva_config": null,
+  "exclude_modules": null,
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layer_replication": null,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 64,
+  "lora_bias": false,
+  "lora_dropout": 0,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "r": 64,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": "(?:.*?(?:language|text).*?(?:self_attn|attention|attn|mlp|feed_forward|ffn|dense).*?(?:k_proj|v_proj|q_proj|out_proj|fc1|fc2|o_proj|gate_proj|up_proj|down_proj).*?)|(?:\\bmodel\\.layers\\.[\\d]{1,}\\.(?:self_attn|attention|attn|mlp|feed_forward|ffn|dense)\\.(?:(?:k_proj|v_proj|q_proj|out_proj|fc1|fc2|o_proj|gate_proj|up_proj|down_proj)))",
+  "task_type": "CAUSAL_LM",
+  "trainable_token_indices": null,
+  "use_dora": false,
+  "use_rslora": false
+}

.ipynb_checkpoints/chat_template-checkpoint.json ADDED Viewed

	@@ -0,0 +1,3 @@

+{
+  "chat_template": "{{ bos_token }}\n{%- if messages[0]['role'] == 'system' -%}\n    {%- if messages[0]['content'] is string -%}\n        {%- set first_user_prefix = messages[0]['content'] + '\n\n' -%}\n    {%- else -%}\n        {%- set first_user_prefix = messages[0]['content'][0]['text'] + '\n\n' -%}\n    {%- endif -%}\n    {%- set loop_messages = messages[1:] -%}\n{%- else -%}\n    {%- set first_user_prefix = \"\" -%}\n    {%- set loop_messages = messages -%}\n{%- endif -%}\n{%- for message in loop_messages -%}\n    {%- if (message['role'] == 'user') != (loop.index0 % 2 == 0) -%}\n        {{ raise_exception(\"Conversation roles must alternate user/assistant/user/assistant/...\") }}\n    {%- endif -%}\n    {%- if (message['role'] == 'assistant') -%}\n        {%- set role = \"model\" -%}\n    {%- else -%}\n        {%- set role = message['role'] -%}\n    {%- endif -%}\n    {{ '<start_of_turn>' + role + '\n' + (first_user_prefix if loop.first else \"\") }}\n    {%- if message['content'] is string -%}\n        {{ message['content'] | trim }}\n    {%- elif message['content'] is iterable -%}\n        {%- for item in message['content'] -%}\n            {%- if item['type'] == 'image' -%}\n                {{ '<start_of_image>' }}\n            {%- elif item['type'] == 'text' -%}\n                {{ item['text'] | trim }}\n            {%- endif -%}\n        {%- endfor -%}\n    {%- else -%}\n        {{ raise_exception(\"Invalid content type\") }}\n    {%- endif -%}\n    {{ '<end_of_turn>\n' }}\n{%- endfor -%}\n{%- if add_generation_prompt -%}\n    {{'<start_of_turn>model\n'}}\n{%- endif -%}\n"
+}

.ipynb_checkpoints/preprocessor_config-checkpoint.json ADDED Viewed

	@@ -0,0 +1,29 @@

+{
+  "do_convert_rgb": null,
+  "do_normalize": true,
+  "do_pan_and_scan": null,
+  "do_rescale": true,
+  "do_resize": true,
+  "image_mean": [
+    0.5,
+    0.5,
+    0.5
+  ],
+  "image_processor_type": "Gemma3ImageProcessor",
+  "image_seq_length": 256,
+  "image_std": [
+    0.5,
+    0.5,
+    0.5
+  ],
+  "pan_and_scan_max_num_crops": null,
+  "pan_and_scan_min_crop_size": null,
+  "pan_and_scan_min_ratio_to_activate": null,
+  "processor_class": "Gemma3Processor",
+  "resample": 2,
+  "rescale_factor": 0.00392156862745098,
+  "size": {
+    "height": 896,
+    "width": 896
+  }
+}

.ipynb_checkpoints/processor_config-checkpoint.json ADDED Viewed

	@@ -0,0 +1,4 @@

+{
+  "image_seq_length": 256,
+  "processor_class": "Gemma3Processor"
+}

.ipynb_checkpoints/special_tokens_map-checkpoint.json ADDED Viewed

	@@ -0,0 +1,33 @@

+{
+  "boi_token": "<start_of_image>",
+  "bos_token": {
+    "content": "<bos>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "eoi_token": "<end_of_image>",
+  "eos_token": {
+    "content": "<end_of_turn>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "image_token": "<image_soft_token>",
+  "pad_token": {
+    "content": "<pad>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "unk_token": {
+    "content": "<unk>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  }
+}

.ipynb_checkpoints/tokenizer-checkpoint.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4667f2089529e8e7657cfb6d1c19910ae71ff5f28aa7ab2ff2763330affad795
+size 33384568

.ipynb_checkpoints/tokenizer_config-checkpoint.json ADDED Viewed

The diff for this file is too large to render. See raw diff

README.md ADDED Viewed

	@@ -0,0 +1,136 @@

+---
+base_model: unsloth/gemma-3-27b-it
+library_name: peft
+license: gemma
+---
+# LOGIC-27B
+LOGIC Framework (Lay Out Problem, Options, Groundwork, Inference, Conclusion)
+Example Output
+```xml
+<layOutProblem>
+Explain the problem or question in detail.
+</layOutProblem>
+<options>
+List different strategies or paths you might take to solve the problem.
+</options>
+<groundwork>
+Note important facts, formulas, or references needed. Establish constraints.
+</groundwork>
+<inference>
+Perform detailed reasoning or computations. Validate steps or check for errors.
+</inference>
+<conclusion>
+Present the clear answer or outcome.
+</conclusion>
+```
+Why Use LOGIC?
+Emphasizes exploring multiple options.
+Encourages building a strong groundwork of facts or constraints.
+Includes inference to handle the step-by-step thinking process.
+Ends with a conclusion that ties it all together.
+### Model Details
+- **Model Name:** Bifröst-27B
+- **Base Architecture:** gemma3
+- **Application:** Reasoning using LOGIC Framework
+- **Release Date:** 16-March-2025
+```python
+system_prompt = \
+f"""You are given a problem.
+You are LOGIC
+Always follow the format
+layOutProblem
+options
+groundwork
+inference
+conclusion
+Explain the problem or question in detail.
+Place it between <layOutProblem> and </layOutProblem>
+List different strategies or paths you might take to solve the problem.
+Place it between <options> and </options>
+Note important facts, formulas, or references needed. Establish constraints.
+Place it between <groundwork> and </groundwork>
+Think about the problem and provide your working out.
+Place it between <inference> and </inference>.
+Then, provide your solution between <conclusion> and </conclusion>
+"""
+system_prompt
+```
+Example Prompt
+```python
+system_prompt = \
+messages = [
+    {"role": "system", "content": system_prompt},
+    {"role": "user",   "content": "A triangle has sides 13, 14, and 15. Find the radius of its circumcircle."},
+]
+text = tokenizer.apply_chat_template(
+    messages,
+    add_generation_prompt = True,
+    tokenize = False,
+)
+from transformers import TextStreamer
+_ = model.generate(
+    **tokenizer(text, return_tensors = "pt").to("cuda"),
+    max_new_tokens = 1024,
+    temperature = 1.0, top_p = 0.95, top_k = 64,
+    streamer = TextStreamer(tokenizer, skip_prompt = True),
+)
+system_prompt
+```
+Example Output
+```xml
+<layOutProblem>
+The problem asks us to find the radius of the circumcircle of a triangle with sides of length 13, 14, and 15. The circumcircle is the circle that passes through all three vertices of the triangle. The radius of the circumcircle is often denoted by *R*. We are given the lengths of the three sides of the triangle, and we need to determine the radius of the circle that circumscribes it.
+</layOutProblem>
+<options>
+1. **Use the formula R = abc / 4K**, where a, b, and c are the side lengths and K is the area of the triangle. This requires us to first calculate the area of the triangle. We can use Heron's formula to find the area.
+2. **Use the Law of Sines:**  a/sin(A) = b/sin(B) = c/sin(C) = 2R. This requires us to find one of the angles of the triangle, which can be done using the Law of Cosines.
+3. **Coordinate Geometry:** Place the triangle on a coordinate plane and find the circumcenter, then calculate the distance from the circumcenter to any vertex. This is generally more complex.
+</options>
+<groundwork>
+* **Heron's Formula:**  For a triangle with sides a, b, and c, the area K is given by K = sqrt(s(s-a)(s-b)(s-c)), where s is the semi-perimeter, s = (a+b+c)/2.
+* **Law of Cosines:** c² = a² + b² - 2ab*cos(C)
+* **Law of Sines:** a/sin(A) = b/sin(B) = c/sin(C) = 2R
+* **Circumradius Formula:** R = abc / 4K
+* **Side lengths:** a = 13, b = 14, c = 15
+* **Constraint:** The radius must be a positive real number.
+</groundwork>
+<inference>
+Let's use the formula R = abc / 4K. First, we need to find the area K using Heron's formula.
+The semi-perimeter s = (13 + 14 + 15) / 2 = 42 / 2 = 21.
+Now, we can calculate the area K:
+K = sqrt(21(21-13)(21-14)(21-15)) = sqrt(21 * 8 * 7 * 6) = sqrt(3 * 7 * 2³ * 7 * 2 * 3) = sqrt(2⁴ * 3² * 7²) = 2² * 3 * 7 = 4 * 21 = 84.
+Now we can find the circumradius R:
+R = (13 * 14 * 15) / (4 * 84) = (13 * 14 * 15) / 336 = (13 * 2 * 7 * 3 * 5) / (2⁴ * 3 * 7) = (13 * 5) / 8 = 65 / 8 = 8.125
+</inference>
+<conclusion>
+The radius of the circumcircle is 65/8 or 8.125.
+</conclusion><end_of_turn>
+```

adapter_config.json ADDED Viewed

	@@ -0,0 +1,31 @@

+{
+  "alpha_pattern": {},
+  "auto_mapping": null,
+  "base_model_name_or_path": "unsloth/gemma-3-27b-it",
+  "bias": "none",
+  "corda_config": null,
+  "eva_config": null,
+  "exclude_modules": null,
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layer_replication": null,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 64,
+  "lora_bias": false,
+  "lora_dropout": 0,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "r": 64,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": "(?:.*?(?:language|text).*?(?:self_attn|attention|attn|mlp|feed_forward|ffn|dense).*?(?:k_proj|v_proj|q_proj|out_proj|fc1|fc2|o_proj|gate_proj|up_proj|down_proj).*?)|(?:\\bmodel\\.layers\\.[\\d]{1,}\\.(?:self_attn|attention|attn|mlp|feed_forward|ffn|dense)\\.(?:(?:k_proj|v_proj|q_proj|out_proj|fc1|fc2|o_proj|gate_proj|up_proj|down_proj)))",
+  "task_type": "CAUSAL_LM",
+  "trainable_token_indices": null,
+  "use_dora": false,
+  "use_rslora": false
+}

adapter_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f2fb0a4686e2f05caa33ccf8d4db249f3697c2f2abd55a5e58699118842c54c3
+size 1816396048

added_tokens.json ADDED Viewed

	@@ -0,0 +1,3 @@

+{
+  "<image_soft_token>": 262144
+}

chat_template.json ADDED Viewed

	@@ -0,0 +1,3 @@

+{
+  "chat_template": "{{ bos_token }}\n{%- if messages[0]['role'] == 'system' -%}\n    {%- if messages[0]['content'] is string -%}\n        {%- set first_user_prefix = messages[0]['content'] + '\n\n' -%}\n    {%- else -%}\n        {%- set first_user_prefix = messages[0]['content'][0]['text'] + '\n\n' -%}\n    {%- endif -%}\n    {%- set loop_messages = messages[1:] -%}\n{%- else -%}\n    {%- set first_user_prefix = \"\" -%}\n    {%- set loop_messages = messages -%}\n{%- endif -%}\n{%- for message in loop_messages -%}\n    {%- if (message['role'] == 'user') != (loop.index0 % 2 == 0) -%}\n        {{ raise_exception(\"Conversation roles must alternate user/assistant/user/assistant/...\") }}\n    {%- endif -%}\n    {%- if (message['role'] == 'assistant') -%}\n        {%- set role = \"model\" -%}\n    {%- else -%}\n        {%- set role = message['role'] -%}\n    {%- endif -%}\n    {{ '<start_of_turn>' + role + '\n' + (first_user_prefix if loop.first else \"\") }}\n    {%- if message['content'] is string -%}\n        {{ message['content'] | trim }}\n    {%- elif message['content'] is iterable -%}\n        {%- for item in message['content'] -%}\n            {%- if item['type'] == 'image' -%}\n                {{ '<start_of_image>' }}\n            {%- elif item['type'] == 'text' -%}\n                {{ item['text'] | trim }}\n            {%- endif -%}\n        {%- endfor -%}\n    {%- else -%}\n        {{ raise_exception(\"Invalid content type\") }}\n    {%- endif -%}\n    {{ '<end_of_turn>\n' }}\n{%- endfor -%}\n{%- if add_generation_prompt -%}\n    {{'<start_of_turn>model\n'}}\n{%- endif -%}\n"
+}

preprocessor_config.json ADDED Viewed

	@@ -0,0 +1,29 @@

+{
+  "do_convert_rgb": null,
+  "do_normalize": true,
+  "do_pan_and_scan": null,
+  "do_rescale": true,
+  "do_resize": true,
+  "image_mean": [
+    0.5,
+    0.5,
+    0.5
+  ],
+  "image_processor_type": "Gemma3ImageProcessor",
+  "image_seq_length": 256,
+  "image_std": [
+    0.5,
+    0.5,
+    0.5
+  ],
+  "pan_and_scan_max_num_crops": null,
+  "pan_and_scan_min_crop_size": null,
+  "pan_and_scan_min_ratio_to_activate": null,
+  "processor_class": "Gemma3Processor",
+  "resample": 2,
+  "rescale_factor": 0.00392156862745098,
+  "size": {
+    "height": 896,
+    "width": 896
+  }
+}

processor_config.json ADDED Viewed

	@@ -0,0 +1,4 @@

+{
+  "image_seq_length": 256,
+  "processor_class": "Gemma3Processor"
+}

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,33 @@

+{
+  "boi_token": "<start_of_image>",
+  "bos_token": {
+    "content": "<bos>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "eoi_token": "<end_of_image>",
+  "eos_token": {
+    "content": "<end_of_turn>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "image_token": "<image_soft_token>",
+  "pad_token": {
+    "content": "<pad>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "unk_token": {
+    "content": "<unk>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  }
+}

tokenizer.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4667f2089529e8e7657cfb6d1c19910ae71ff5f28aa7ab2ff2763330affad795
+size 33384568

tokenizer.model ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1299c11d7cf632ef3b4e11937501358ada021bbdf7c47638d13c0ee982f2e79c
+size 4689074

tokenizer_config.json ADDED Viewed

The diff for this file is too large to render. See raw diff