gair-prox
/

Llama-2-7B-ProXMath

Text Generation

Model card Files Files and versions Community

SinclairWang commited on Sep 17, 2024

Commit

003bb45

·

verified ·

1 Parent(s): 36f112e

Update README.md

Files changed (1) hide show

README.md +39 -3

README.md CHANGED Viewed

@@ -1,3 +1,39 @@
----
-license: llama2
----

+---
+license: llama2
+datasets:
+- gair-prox/open-web-math-pro
+language:
+- en
+pipeline_tag: text-generation
+tags:
+- llama2
+- math
+- reasoning
+base_model:
+- meta-llama/Llama-2-7b-hf
+---
+# Llama-2-7B-ProXMath
+<p align="center">
+  <img src="prox-teaser.png">
+</p>
+[ArXiv](http://arxiv.org/abs/xxxx) | [Data: OpenWebMath-Pro](https://huggingface.co/datasets/gair-prox/open-web-math-pro) | [Code](https://github.com/GAIR-NLP/program-every-example)
+**Llama-2-7B-ProXMath** is a math-adapted Llama-2-7B model that is continually pre-trained on [OpenWebMath-Pro](https://huggingface.co/datasets/gair-prox/open-web-math-pro) (a refined version by ProX) for **10**B tokens.
+## Evaluations
+ProX models are evaluated on 9 common math reasoning benchmarks.
+| Model               | asdiv | gsm8k | mathqa | mawps | minerva_math | mmlu_stem | sat_math | svamp | tabmwp | average |
+|---------------------|:-----:|:-----:|:------:|:-----:|:------------:|:---------:|:--------:|:-----:|:------:|:-------:|
+| Llama-2-7B          |  51.6 |  14.1 |  12.5  |  63.6 |      3.8     |    32.9   |   34.4   |  39.5 |  30.9  |  31.48  |
+| Llama-2-7B-ProXMath |  63.7 |  30.6 |  40.1  |  79.3 |     16.8     |    43.8   |   53.1   |  50.2 |  37.3  |   46.1  |
+### Citation
+```
+@misc{TBD
+}
+```