Input Parameters
Hidden size:
Intermediate size:
Vocab size:
Number of key-value heads:
Number of attention heads:
Number of hidden layers:
Include bias?
Yes
No
Calculate
Model Parameter Results
1 Layer Parameters
1 Layer Parameters:
0
Full Layers Parameters:
0
Attention Parameters
Feed Forward Parameters
Embedding Parameters
Complete Model Size:
0