This is a d-Matrix functional reference of the OPT-6B7 model. The reference provides the following functional configurations:

Configuration Explanation
BASELINE a reference functionally equivalent to the original model
BASIC all linear algebraic operands quantized to MXINT8-64, and all other operations transformed to approximated kernel simulations

Usage

Install d-Matrix Dmx_Compressor first.

pip install dmx_compressor

The following is an example model and its evaluation.

git clone https://github.com/EleutherAI/lm-evaluation-harness
cd lm-evaluation-harness
pip install -e .
from dmx.compressor.modeling import DmxModel
import lm_eval

model_args = "pretrained=d-matrix/opt-6b7,trust_remote_code=True"

lm = lm_eval.api.registry.get_model("hf").create_from_arg_string(model_args, {"batch_size": 1})

# Transform the model with DMX
lm._model = DmxModel.from_torch(lm._model)

eval_results = lm_eval.evaluate(lm, lm_eval.tasks.get_task_dict(["wikitext"]))  # Assign desired task, i.e. "wikitext"
Downloads last month
39
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Evaluation results