metadata

base_model: mini1013/master_domain
library_name: setfit
metrics:
  - accuracy
pipeline_tag: text-classification
tags:
  - setfit
  - sentence-transformers
  - text-classification
  - generated_from_setfit_trainer
widget:
  - text: 김정문알로에 큐어+ 하이드라 릴리프크림 익스트림 50g 3개+미니2개 옵션없음 빈티지브릿지
  - text: 듀이셀 프라이빗케어 마스크 5매입 세트 옵션없음 다사다 유한책임회사
  - text: 코튼 만화 얼굴 수면 아이 마스크, 귀여운 재미있는 커버, 여행 휴식 밴드, 1 개 05 A006-5 글로젠
  - text: 마이크로바이옴 비건 모델링팩 모공 수축 수분 진정 마스크 팩 2. 모델링팩 2set (14매입) 주식회사 에이치티오인터내셔널
  - text: 린제이 팩도구 + 프리미엄 모델링 팩 쿨 티트리 1kg 01. 팩도구4종+지퍼백★_(미백영양)더마테크 나이아신아마이드 400g 예승
inference: true
model-index:
  - name: SetFit with mini1013/master_domain
    results:
      - task:
          type: text-classification
          name: Text Classification
        dataset:
          name: Unknown
          type: unknown
          split: test
        metrics:
          - type: accuracy
            value: 0.6094032549728752
            name: Accuracy

SetFit with mini1013/master_domain

This is a SetFit model that can be used for Text Classification. This SetFit model uses mini1013/master_domain as the Sentence Transformer embedding model. A LogisticRegression instance is used for classification.

The model has been trained using an efficient few-shot learning technique that involves:

Fine-tuning a Sentence Transformer with contrastive learning.
Training a classification head with features from the fine-tuned Sentence Transformer.

Model Details

Model Description

Model Type: SetFit
Sentence Transformer body: mini1013/master_domain
Classification head: a LogisticRegression instance
Maximum Sequence Length: 512 tokens
Number of Classes: 8 classes

Model Sources

Repository: SetFit on GitHub
Paper: Efficient Few-Shot Learning Without Prompts
Blogpost: SetFit: Efficient Few-Shot Learning Without Prompts

Model Labels

Label	Examples
1.0	'[듀이셀] 세라젯 마스크팩 5매 듀이셀' '[김정문알로에] 큐어 피토 앰플 마스크팩 시즌5 (30매)[34653634] NS홈쇼핑' '션리 일일일팩 광채, 보습, 미백, 주름, 콜라겐, 6종 대용량 마스크팩 100매 ShionLe'
6.0	'가르니에 블랙헤드 코팩 3박스 옵션없음 케이바겐' '장영란 아기코세트 멜팅클리어패드 30매 + 모공앰플 30ml 옵션없음 세일러15' '본트리 레몬티트리 블랙헤드 코팩 10개입 2개 옵션없음 건강드림'
5.0	'뉴스킨 에포크 글레이셜 머린 머드 / 머드팩 125g [특가]에포크 머린머드/24년 9월까지 (주)인포러스' '레몬꿀팩 라비오라 레몬 허니 브라이트닝 워시오프 팩 100g 외 제품들 워시오프 팩 100g + 코어탄력크림 50ml 쉼포니' '이즈앤트리 머그워트 카밍 클레이 마스크 100ml 1개 옵션없음 비래유통'
4.0	'[빌리프] 슈퍼나이츠 클리어 수딩 마스크 점보 120 mL 옵션없음 (주)엘지생활건강' '메노킨 30초 퀵 버블 마스크_브라이트 옵션없음 주식회사 포레스트에비뉴' '소노지아 스킨블링 버블마스크 콜라겐 수면팩 (80ML) 옵션없음 미네렐라샵'
7.0	'메디필 레드락토 콜라겐 랩핑마스크 70ml 물광 리프팅 팩 옵션없음 웬디스룸' '동국 벨프리모 필오프팩 4개+브러쉬TV광고상품 랩핑마스크 모델링팩 8개+브러쉬2개 주식회사 셀업' '끌레드벨 파워 리프트 프로그램 원킬 브이 리프팅 마스크 70ml 옵션없음 달달하우스'
2.0	'더마픽스 휴멀 콜라겐 마스크 23g 옵션없음 다판다' '시트까지 비건인증 달바 화이트 트러플 너리싱 트리트먼트 마스크 1매 옵션없음 오피스템' '오겔리 노워터 아이패치 누드 화이트 화이트 와이에스엠무역'
3.0	'린제이 프리미엄 쿨 티트리 모델링 마스크 고무팩 820g 1021908 옵션없음 메가랜드' '데쌍브르 골드 필 오프 모델링팩 1kg 데쌍브르 골드 필 오프 모델링팩 1kg 마가렛스킨' '데쌍브르 골드 모델링 마스크 1000g + 고무볼,스파츌라,앰플 골드 모델링마스크 반하다'
0.0	'더후 공진향 인양 넥앤페이스 탄력 리페어75ml 옵션없음 씨플랩' '마리꼬 인스턴트 수딩 마스크 50ml / 150ml 마마리꼬 인스턴트 수딩 마스크 150ml_마리꼬 튜브형 샘플 3ml (랜덤) 4개 오늘의뷰티' 'IS-SA 오이 바이탈라이징 마사지크림 500g 옵션없음 우리의쇼핑'

Evaluation

Metrics

Label	Accuracy
all	0.6094

Uses

Direct Use for Inference

First install the SetFit library:

pip install setfit

Then you can load this model and run inference.

from setfit import SetFitModel

# Download from the 🤗 Hub
model = SetFitModel.from_pretrained("mini1013/master_cate_bt2_test")
# Run inference
preds = model("듀이셀 프라이빗케어 마스크 5매입 세트 옵션없음 다사다 유한책임회사")

Training Details

Training Set Metrics

Training set	Min	Median	Max
Word count	3	9.7453	23

Label	Training Sample Count
0.0	16
1.0	10
2.0	42
3.0	21
4.0	20
5.0	19
6.0	15
7.0	18

Training Hyperparameters

batch_size: (512, 512)
num_epochs: (50, 50)
max_steps: -1
sampling_strategy: oversampling
num_iterations: 60
body_learning_rate: (2e-05, 1e-05)
head_learning_rate: 0.01
loss: CosineSimilarityLoss
distance_metric: cosine_distance
margin: 0.25
end_to_end: False
use_amp: False
warmup_proportion: 0.1
l2_weight: 0.01
seed: 42
eval_max_steps: -1
load_best_model_at_end: False

Training Results

Epoch	Step	Training Loss	Validation Loss
0.0526	1	0.4874	-
2.6316	50	0.341	-
5.2632	100	0.0524	-
7.8947	150	0.0023	-
10.5263	200	0.0001	-
13.1579	250	0.0001	-
15.7895	300	0.0	-
18.4211	350	0.0	-
21.0526	400	0.0	-
23.6842	450	0.0	-
26.3158	500	0.0	-
28.9474	550	0.0	-
31.5789	600	0.0	-
34.2105	650	0.0	-
36.8421	700	0.0	-
39.4737	750	0.0	-
42.1053	800	0.0	-
44.7368	850	0.0	-
47.3684	900	0.0	-
50.0	950	0.0	-

Framework Versions

Python: 3.10.12
SetFit: 1.1.0
Sentence Transformers: 3.3.1
Transformers: 4.44.2
PyTorch: 2.2.0a0+81ea7a4
Datasets: 3.2.0
Tokenizers: 0.19.1

Citation

BibTeX

@article{https://doi.org/10.48550/arxiv.2209.11055,
    doi = {10.48550/ARXIV.2209.11055},
    url = {https://arxiv.org/abs/2209.11055},
    author = {Tunstall, Lewis and Reimers, Nils and Jo, Unso Eun Seo and Bates, Luke and Korat, Daniel and Wasserblat, Moshe and Pereg, Oren},
    keywords = {Computation and Language (cs.CL), FOS: Computer and information sciences, FOS: Computer and information sciences},
    title = {Efficient Few-Shot Learning Without Prompts},
    publisher = {arXiv},
    year = {2022},
    copyright = {Creative Commons Attribution 4.0 International}
}