Update README.md
Browse files
README.md
CHANGED
@@ -1,4 +1,4 @@
|
|
1 |
-
Test network using differential attention instead of classical attention. Other than some alterations to the attention, this is otherwise the same configuration as https://huggingface.co/HuggingFaceTB/SmolLM2-135M-Instruct
|
2 |
|
3 |
# Scripts:
|
4 |
- `inference.py` to run the model with some test prompts
|
|
|
1 |
+
Test network using differential attention instead of classical attention (using nope). Other than some alterations to the attention, this is otherwise the same configuration as https://huggingface.co/HuggingFaceTB/SmolLM2-135M-Instruct
|
2 |
|
3 |
# Scripts:
|
4 |
- `inference.py` to run the model with some test prompts
|