CyberAgentLM2-7B-Chat (CALM2-7B-Chat)

Model Description

CyberAgentLM2-Chat is a fine-tuned model of CyberAgentLM2 for dialogue use cases.

Requirements

  • transformers >= 4.34.1
  • accelerate

Usage

import transformers
from transformers import AutoModelForCausalLM, AutoTokenizer, TextStreamer

assert transformers.__version__ >= "4.34.1"

model = AutoModelForCausalLM.from_pretrained("cyberagent/calm2-7b-chat", device_map="auto", torch_dtype="auto")
tokenizer = AutoTokenizer.from_pretrained("cyberagent/calm2-7b-chat")
streamer = TextStreamer(tokenizer, skip_prompt=True, skip_special_tokens=True)

prompt = """USER: AIใซใ‚ˆใฃใฆ็ง้”ใฎๆšฎใ‚‰ใ—ใฏใฉใฎใ‚ˆใ†ใซๅค‰ใ‚ใ‚Šใพใ™ใ‹๏ผŸ
ASSISTANT: """

token_ids = tokenizer.encode(prompt, return_tensors="pt")
output_ids = model.generate(
    input_ids=token_ids.to(model.device),
    max_new_tokens=300,
    do_sample=True,
    temperature=0.8,
    streamer=streamer,
)

Chat Template

USER: {user_message1}
ASSISTANT: {assistant_message1}<|endoftext|>
USER: {user_message2}
ASSISTANT: {assistant_message2}<|endoftext|>
USER: {user_message3}
ASSISTANT: {assistant_message3}<|endoftext|>

Model Details

  • Model size: 7B
  • Context length: 32768
  • Model type: Transformer-based Language Model
  • Language(s): Japanese, English
  • Developed by: CyberAgent, Inc.
  • License: Apache-2.0

Author

Ryosuke Ishigami

Citations

@article{touvron2023llama,
  title={LLaMA: Open and Efficient Foundation Language Models},
  author={Touvron, Hugo and Lavril, Thibaut and Izacard, Gautier and Martinet, Xavier and Lachaux, Marie-Anne and Lacroix, Timoth{\'e}e and Rozi{\`e}re, Baptiste and Goyal, Naman and Hambro, Eric and Azhar, Faisal and Rodriguez, Aurelien and Joulin, Armand and Grave, Edouard and Lample, Guillaume},
  journal={arXiv preprint arXiv:2302.13971},
  year={2023}
}
Downloads last month
5,791
Safetensors
Model size
7.01B params
Tensor type
BF16
ยท
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model authors have turned it off explicitly.

Model tree for cyberagent/calm2-7b-chat

Finetunes
4 models
Merges
1 model
Quantizations
3 models

Spaces using cyberagent/calm2-7b-chat 3