Triangle104
/

Herodotos-14B_V0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

We're all stories in the end

Merge Method

This model was merged using the TIES merge method using v000000/Qwen2.5-Lumen-14B as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: v000000/Qwen2.5-Lumen-14B
    #no parameters necessary for base model
  - model: Qwen/Qwen2.5-Coder-14B
    parameters:
      density: 0.5
      weight: 0.5
  - model: Krystalan/DRT-o1-14B
    parameters:
      density: 0.5
      weight: 0.5

merge_method: ties
base_model: v000000/Qwen2.5-Lumen-14B
parameters:
  normalize: false
  int8_mask: true
dtype: float16

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	4.56
IFEval (0-Shot)	18.79
BBH (3-Shot)	2.95
MATH Lvl 5 (4-Shot)	0.00
GPQA (0-shot)	0.00
MuSR (0-shot)	3.81
MMLU-PRO (5-shot)	1.83

Downloads last month: 52

Safetensors

Model size

14.8B params

Tensor type

FP16

·

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

Model tree for Triangle104/Herodotos-14B_V0.1

Krystalan/DRT-14B

Qwen/Qwen2.5-Coder-14B

v000000/Qwen2.5-Lumen-14B

Merge model

this model

Quantizations

Collections including Triangle104/Herodotos-14B_V0.1

Qwen

Alibaba Cloud-based models • 1246 items • Updated 6 days ago • 5

RP

Roleplaying Models • 1285 items • Updated 15 days ago • 5

Merges

Personal Merges • 104 items • Updated 18 days ago • 1

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

18.790
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

2.950
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

0.000
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

0.000
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

3.810
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

1.830

View on Papers With Code