hibernatesai commited on
Commit
24086a8
·
verified ·
1 Parent(s): 2626cb3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +51 -1
README.md CHANGED
@@ -1,8 +1,58 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  # Hibernates-JP-1.3b-Max Model Card
2
 
3
 
4
  Hibernates-JP-1.3b-Max は高解像度(1280x1280)に対応した日本語マルチモーダル言語モデルです。画像理解と自然な対話を組み合わせ、視覚的なコンテキストについて日本語で会話することができます。
5
- Hibernates-JP-1.3b-Max is a Japanese multimodal language model that supports high-resolution (1280x1280) images. It combines image understanding and natural dialogue, enabling conversations in Japanese about visual contexts.
6
  ## Updates in Latest Version
7
 
8
  - 高解像度(1280x1280)での画像処理に対応
 
1
+ ---
2
+ language: ja
3
+ tags:
4
+ - japanese
5
+ - vision-language
6
+ - multimodal
7
+ license: apache-2.0
8
+ datasets:
9
+ - custom
10
+ model-index:
11
+ - name: LLaVA-JP-1.3B
12
+ results: []
13
+ ---
14
+
15
+ # Hibernates-JP-1.3b-Max
16
+
17
+ This is a Japanese vision-language model based on LLaVA architecture with 1.3B parameters.
18
+
19
+ ## Model Details
20
+
21
+ - Model Type: Vision-Language Model
22
+ - Base Model: HibernatesGpt2 (1.3B parameters)
23
+ - Vision Encoder: ConvNeXt Large
24
+ - Training Data: Custom Japanese vision-language dataset
25
+ - Context Length: 1024 tokens
26
+ - Vision Resolution: 1280x1280
27
+ - License: Apache 2.0
28
+
29
+ ## Usage
30
+
31
+ [Add usage instructions here]
32
+
33
+ ## Training Details
34
+
35
+ - Vision Encoder: ConvNeXt Large
36
+ - Hidden Size: 2048
37
+ - Number of Attention Heads: 16
38
+ - Number of Layers: 24
39
+ - Vision Feature Selection: patch
40
+ - Vision Select Layer: -2
41
+ - Multimodal Projector Type: mlp2x_gelu
42
+
43
+ ## Limitations
44
+
45
+ [Add model limitations here]
46
+
47
+ ## Citation
48
+
49
+ [Add citation information if applicable]
50
+
51
  # Hibernates-JP-1.3b-Max Model Card
52
 
53
 
54
  Hibernates-JP-1.3b-Max は高解像度(1280x1280)に対応した日本語マルチモーダル言語モデルです。画像理解と自然な対話を組み合わせ、視覚的なコンテキストについて日本語で会話することができます。
55
+
56
  ## Updates in Latest Version
57
 
58
  - 高解像度(1280x1280)での画像処理に対応