Hibernates
/

Hibernates-JP-1.3b-Max

vision-language

Model card Files Files and versions Community

hibernatesai commited on Feb 9

Commit

24086a8

·

verified ·

1 Parent(s): 2626cb3

Update README.md

Files changed (1) hide show

README.md +51 -1

README.md CHANGED Viewed

@@ -1,8 +1,58 @@
 # Hibernates-JP-1.3b-Max Model Card
 Hibernates-JP-1.3b-Max は高解像度(1280x1280)に対応した日本語マルチモーダル言語モデルです。画像理解と自然な対話を組み合わせ、視覚的なコンテキストについて日本語で会話することができます。
-Hibernates-JP-1.3b-Max is a Japanese multimodal language model that supports high-resolution (1280x1280) images. It combines image understanding and natural dialogue, enabling conversations in Japanese about visual contexts.
 ## Updates in Latest Version
 - 高解像度(1280x1280)での画像処理に対応

+---
+language: ja
+tags:
+- japanese
+- vision-language
+- multimodal
+license: apache-2.0
+datasets:
+- custom
+model-index:
+- name: LLaVA-JP-1.3B
+  results: []
+---
+# Hibernates-JP-1.3b-Max
+This is a Japanese vision-language model based on LLaVA architecture with 1.3B parameters.
+## Model Details
+- Model Type: Vision-Language Model
+- Base Model: HibernatesGpt2 (1.3B parameters)
+- Vision Encoder: ConvNeXt Large
+- Training Data: Custom Japanese vision-language dataset
+- Context Length: 1024 tokens
+- Vision Resolution: 1280x1280
+- License: Apache 2.0
+## Usage
+[Add usage instructions here]
+## Training Details
+- Vision Encoder: ConvNeXt Large
+- Hidden Size: 2048
+- Number of Attention Heads: 16
+- Number of Layers: 24
+- Vision Feature Selection: patch
+- Vision Select Layer: -2
+- Multimodal Projector Type: mlp2x_gelu
+## Limitations
+[Add model limitations here]
+## Citation
+[Add citation information if applicable]
 # Hibernates-JP-1.3b-Max Model Card
 Hibernates-JP-1.3b-Max は高解像度(1280x1280)に対応した日本語マルチモーダル言語モデルです。画像理解と自然な対話を組み合わせ、視覚的なコンテキストについて日本語で会話することができます。
 ## Updates in Latest Version
 - 高解像度(1280x1280)での画像処理に対応