Akarshan Biswas
qnixsynapse
AI & ML interests
NLP, models, quantization
Recent Activity
upvoted
a
collection
2 days ago
Gemma 3 Release
liked
a model
26 days ago
deepseek-ai/DeepSeek-R1
upvoted
a
paper
about 1 month ago
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth
Approach
Organizations
None yet
qnixsynapse's activity
Tool calling support in Gemma 2
2
#50 opened 3 months ago
by
qnixsynapse

Is this really an Instruct model?
#1 opened 6 months ago
by
qnixsynapse

[MODELS] Discussion
642
#372 opened about 1 year ago
by
victor

[TOOLS] Community Discussion
27
#455 opened 10 months ago
by
victor

Wrong number of tensors; expected 292, got 291
6
#69 opened 8 months ago
by
KingBadger
[FEATURE] Tools
69
#470 opened 10 months ago
by
victor

Utterly based
1
#9 opened 8 months ago
by
llama-anon

Add IQ Quantization support with the help of imatrix and GPUs
8
#35 opened 11 months ago
by
qnixsynapse

Suggestion: Host Gemma2 using keras_nlp instead of transformers library for the time being
2
#498 opened 9 months ago
by
qnixsynapse

The best 8B in the planet right now. PERIOD!
2
#22 opened 11 months ago
by
cyberneticos

How many active parameters does this model have?
3
#6 opened 11 months ago
by
lewtun

7B or 8B?
4
#24 opened about 1 year ago
by
amgadhasan
Which model is responsible for naming of the thread?
8
#402 opened 11 months ago
by
qnixsynapse

Consider adding <start_of_context> and <stop_of_context> or similar special tokens for context ingestion.
#13 opened 11 months ago
by
qnixsynapse

Number of parameters
8
#9 opened 11 months ago
by
HugoLaurencon

RMSNorm eps value is wrong
#20 opened about 1 year ago
by
qnixsynapse

RMSNorm eps value is wrong
#19 opened about 1 year ago
by
qnixsynapse

Loading the model
3
#3 opened over 1 year ago
by
PyrroAiakid
Looking for GGUF format for this model
1
#14 opened over 1 year ago
by
barha
Help needed to load model
19
#13 opened over 1 year ago
by
sanjay-dev-ds-28