File size: 2,129 Bytes
bea2d5a
 
 
 
 
 
 
559f959
bea2d5a
 
69802fa
111febb
5475400
111febb
 
 
23dc8fe
 
 
 
 
 
 
 
 
 
 
 
 
 
111febb
 
 
 
ef2bed7
111febb
92cd67e
a445a05
111febb
eba8c64
111febb
 
 
 
 
 
 
 
 
 
7f3f4bc
 
 
 
 
559f959
10c276b
7f3f4bc
 
111febb
 
 
559f959
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
---
title: README
emoji: 🐠
colorFrom: purple
colorTo: purple
sdk: static
pinned: false
license: cc-by-4.0
---

# 🏒 Welcome to Itbanque

**Itbanque** is dedicated to providing both high-quality fine-tuned models and structured datasets for AI, machine learning, and data-driven applications across various domains.

---

## 🧠 **Our Models**

Itbanque fine-tunes open-source foundation models for domain-specific tasks, with a current focus on speech translation and transcription.
We specialize in Whisper-based models adapted for accurate subtitle generation, especially for Japanese β†’ Chinese translation.


### **Whisper-base-ja2zh**
A Whisper base model fully fine-tuned for Japanese speech to Chinese text translation.

- **BLEU Score** on Test Set: 0.72
- **Dataset**: ScreenTalk-JA2ZH

---

## πŸ“Š **Our Datasets**
We offer datasets with **structured, high-quality, and continuously updated** data, making them ideal for training AI models. 

### πŸ”Ή **ScreenTalk**
A large-scale transcribed/translated speech dataset sourced from screen content, suitable for ASR and NLP tasks.

- **XS Size** – Limited sample dataset.
- **Full Size** – Full access + real-time updates.

πŸ‘‰ [Explore ScreenTalk Dataset](https://huggingface.co/datasets/DataLabX/ScreenTalk-XS)

---

## πŸš€ **Why Choose DataLabX?**
βœ… **High-quality, structured datasets** for AI training.  
βœ… **Regular updates** to ensure fresh, relevant data.  
βœ… **Different dataset sizes** to fit various user needs, from xs to full version.

---

πŸ’‘ Support Our Work
We are committed to providing high-quality datasets for AI research and development. Your support enables us to continue expanding and refining our datasets for better AI applications across multiple industries.

πŸ”— Donate & Support

<img src="https://cdn-uploads.huggingface.co/production/uploads/6781996a81e69ba91a2070f1/Bby8AOiyJ5MarpLttuKrF.jpeg" width="250" height="250"/>

---

## πŸ“¬ **Get in Touch**
If you have any questions, need a custom dataset, or require enterprise licensing, feel free to reach out:

πŸ“§ **Contact:** [fj11](mailto:[email protected])