Stanisław Szymczyk
sszymczyk
AI & ML interests
None yet
Recent Activity
new activity
about 1 month ago
RUC-AIBOX/Virgo-72B:Missing tokenizer.json and tokenizer_config.json files
Organizations
None yet
sszymczyk's activity
Requesting Support for GGUF Quantization of MiniMax-Text-01 through llama.cpp
4
#1 opened about 1 month ago
by
Doctor-Chad-PhD
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6776855be57a4c8f9e6e7aaf/rPvn5Og7NX7PXk1d1mnXP.jpeg)
Missing tokenizer.json and tokenizer_config.json files
1
#2 opened about 1 month ago
by
sszymczyk
Please add the "tokenizer.model" file
2
#3 opened about 1 month ago
by
ken133
CUDA out of memory error during fp8 to bf16 model conversion + fix
1
#17 opened about 2 months ago
by
sszymczyk
Hardware Requirements
6
#1 opened 3 months ago
by
Lightchain
Problem with specific output format
#15 opened 3 months ago
by
sszymczyk
Please test the QwQ-32B-Preview model
#3 opened 3 months ago
by
sszymczyk
Can you provide code for inference with MCTS?
6
#3 opened 3 months ago
by
sszymczyk
Reason behind not using special tokens in the prompt format?
2
#2 opened 3 months ago
by
Doctor-Shotgun
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1670736706483-632b9f9866f28bf34ae85487.jpeg)
The curse of the Consolidated Safetensors strikes again...
2
#4 opened 3 months ago
by
jukofyork
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65995c45539c808e84c38bf1/k0y3ULloWQEMvosQwHgrE.png)
What call() function parameters besides "query" can be used by the model when doing brave_search and wolfram_alpha tool calls?
#89 opened 6 months ago
by
sszymczyk
What form of the built-in brave_search and wolfram_alpha tool call output is expected by the model?
3
#88 opened 6 months ago
by
sszymczyk
The model often enters infinite generation loops
13
#32 opened 7 months ago
by
sszymczyk
Translation to German doesn't work in 3B model
#8 opened 8 months ago
by
sszymczyk
Calculation of _mscale during YARN RoPE scaling
1
#4 opened 9 months ago
by
sszymczyk
Wrong BOS and EOS tokens in tokenizer.model file
1
#12 opened 10 months ago
by
sszymczyk
Confusing ArcticDecoderLayer::forward() implementation
#11 opened 10 months ago
by
sszymczyk
Problem with repeated generation of newline characters
2
#3 opened 10 months ago
by
sszymczyk