Update README.md

9c489e6 verified 4 months ago

5.29 kB

	---
	license: gemma
	language:
	- ja
	- en
	base_model:
	- google/gemma-3-4b-it
	pipeline_tag: text-generation
	---
	# DataPilot/ArrowMint-Gemma3-4B-ChocoMint-instruct-v0.2 モデルカード

	モデルリポジトリ: [DataPilot/ArrowMint-Gemma3-4B-ChocoMint-instruct-v0.2](https://huggingface.co/DataPilot/ArrowMint-Gemma3-4B-ChocoMint-instruct-v0.2)

	ベースモデル: [google/gemma-3-4b-it](https://huggingface.co/google/gemma-3-4b-it)

	## Overview

	`DataPilot/ArrowMint-Gemma3-4B-ChocoMint-instruct-v0.2`は、Googleによって開発された強力なマルチモーダルモデルである`google/gemma-3-4b-it`をベースとしてファインチューニングされた指示応答モデルです。

	このモデルは、[Unsloth](https://github.com/unslothai/unsloth)ライブラリと高品質な合成データセットを用いてトレーニングされました。主な目的は、ベースモデルの持つ能力を維持・強化しつつ、特にプロンプト（指示）への追従能力とマルチターン対話における性能を向上させることです。

	Gemma 3ファミリーと同様に、テキスト入力と画像入力の両方に対応し、テキスト出力を生成することができます。

	## How to use

	このモデルを使用するには、Transformersライブラリ (バージョン 4.50.0 以降) が必要です。

	```bash
	pip install -U transformers accelerate Pillow requests torch
	```

	### 画像付き推論

	```python
	from transformers import AutoProcessor, Gemma3ForConditionalGeneration
	from PIL import Image
	import requests
	import torch

	model_id = "DataPilot/ArrowMint-Gemma3-4B-ChocoMint-instruct-v0.2"

	model = Gemma3ForConditionalGeneration.from_pretrained(
	model_id, device_map="auto"
	).eval()

	processor = AutoProcessor.from_pretrained(model_id)

	messages = [
	{
	"role": "system",
	"content": [{"type": "text", "text": "あなたは素晴らしい日本語アシスタントです。"}]
	},
	{
	"role": "user",
	"content": [
	{"type": "image", "image": "https://cs.stanford.edu/people/rak248/VG_100K_2/2399540.jpg"},
	{"type": "text", "text": "この画像を説明してください。"}
	]
	}
	]

	inputs = processor.apply_chat_template(
	messages, add_generation_prompt=True, tokenize=True,
	return_dict=True, return_tensors="pt"
	).to(model.device, dtype=torch.bfloat16)

	input_len = inputs["input_ids"].shape[-1]

	with torch.inference_mode():
	generation = model.generate(**inputs, max_new_tokens=100, do_sample=False)
	generation = generation[0][input_len:]

	decoded = processor.decode(generation, skip_special_tokens=True)
	print(decoded)
	```
	### 画像無し推論

	```python
	from transformers import AutoProcessor, Gemma3ForConditionalGeneration
	import torch

	model_id = "DataPilot/ArrowMint-Gemma3-4B-ChocoMint-instruct-v0.2"

	model = Gemma3ForConditionalGeneration.from_pretrained(
	model_id, device_map="auto"
	).eval()

	processor = AutoProcessor.from_pretrained(model_id)

	messages = [
	{
	"role": "system",
	"content": [{"type": "text", "text": "あなたは素晴らしい日本語アシスタントです。"}]
	},
	{
	"role": "user",
	"content": [
	{"type": "text", "text": "AI言語モデルであるLaMDAが意識があることを主張して弁護士を呼んだとのことです。LaMDAには意識があると思いますか？"}
	]
	}
	]

	inputs = processor.apply_chat_template(
	messages, add_generation_prompt=True, tokenize=True,
	return_dict=True, return_tensors="pt"
	).to(model.device, dtype=torch.bfloat16)

	input_len = inputs["input_ids"].shape[-1]

	with torch.inference_mode():
	generation = model.generate(**inputs, max_new_tokens=100, do_sample=False)
	generation = generation[0][input_len:]

	decoded = processor.decode(generation, skip_special_tokens=True)
	print(decoded)
	```

	注意点:

	* Gemma 3モデルは比較的大きなメモリを必要とします。特に4Bモデルでは、十分なVRAMを持つGPUが必要です。`torch_dtype=torch.bfloat16` や `device_map="auto"` を使用してメモリ使用量を削減・分散することを推奨します。
	* Gemma 3モデルはリモートコードの実行を要求する場合があります (`trust_remote_code=True`)。信頼できるソースからモデルをロードしていることを確認してください。
	* プロンプトの形式はモデルの性能に大きく影響します。ベースモデルである `google/gemma-3-4b-it` のチャットテンプレートに従うことを推奨します。上記コード例では `apply_chat_template` がこれを自動的に処理します。

	## License

	このモデルは、ベースモデルである `google/gemma-3-4b-it` のライセンスに基づいています。詳細については、[Gemma Terms of Use](https://ai.google.dev/gemma/terms) を参照してください。

	派生モデルである `DataPilot/ArrowMint-Gemma3-4B-ChocoMint-instruct-v0.2` の利用にあたっては、ベースモデルのライセンス条件、特に利用制限（[Gemma Prohibited Use Policy](https://ai.google.dev/gemma/prohibited_use_policy)）に従う必要があります。