metadata

base_model: mini1013/master_domain
library_name: setfit
metrics:
  - accuracy
pipeline_tag: text-classification
tags:
  - setfit
  - sentence-transformers
  - text-classification
  - generated_from_setfit_trainer
widget:
  - text: >-
      루나 롱래스팅 팁 컨실러 6.5/7.5g (SPF34 PA++) [챔피언십 에디션] [챔피언십] 팁 컨실러 0.7호+듀얼 컨실러
      애경산업 (주) 청양공장
  - text: 올리브영프라이머 VDL 컬러 코렉팅 프라이머 30ml 로즈쿼츠 오늘의 몰
  - text: 미니 튜브형 케이스 로션케이스 가정생활용품 50ml 셀러팩토리
  - text: 이자녹스 듀얼 커버 쿠션 15ml(SPF50+) 본품 23호 동의함 구일브라더스
  - text: 에스티로더 더블웨어 플로리스 하이드레이팅 프라이머 SPF45/PA++++ (30ml) 옵션없음 마이소브
inference: true
model-index:
  - name: SetFit with mini1013/master_domain
    results:
      - task:
          type: text-classification
          name: Text Classification
        dataset:
          name: Unknown
          type: unknown
          split: test
        metrics:
          - type: accuracy
            value: 0.7306034482758621
            name: Accuracy

SetFit with mini1013/master_domain

This is a SetFit model that can be used for Text Classification. This SetFit model uses mini1013/master_domain as the Sentence Transformer embedding model. A LogisticRegression instance is used for classification.

The model has been trained using an efficient few-shot learning technique that involves:

Fine-tuning a Sentence Transformer with contrastive learning.
Training a classification head with features from the fine-tuned Sentence Transformer.

Model Details

Model Description

Model Type: SetFit
Sentence Transformer body: mini1013/master_domain
Classification head: a LogisticRegression instance
Maximum Sequence Length: 512 tokens
Number of Classes: 7 classes

Model Sources

Repository: SetFit on GitHub
Paper: Efficient Few-Shot Learning Without Prompts
Blogpost: SetFit: Efficient Few-Shot Learning Without Prompts

Model Labels

Label	Examples
0.0	'아누아 매트벗 글로우 커버 베이지 50ml(SPF50+) 옵션없음 엠더블유스토어(MWstore)' '아누아 환해지는 매트벗 글로우 커버 베이지 50ml SPF50 PA++++ 1개 아누아 환해지는 매트벗 글로우 커버 베이지 1 호랑이커머스' '닥터슈라멕 슈라멕비비 슈라멕 블레미쉬 밤 클래식 30ml 옵션없음 이노포유'
3.0	'티핏 커버 업 프로 컨실러 웜(딥베이지/내추럴베이지/피치베이지) 옵션없음 주식회사 더한' '더샘 커버 퍼펙션 트리플 팟 컨실러 05 다크업 베이지 커버 퍼펙션 트리플 팟 컨실러 05 다크업 베 쇼핑천국이야기' '나스 소프트 매트 컴플리트 컨실러 6.2g 커스터드 블루밍컴퍼니'
1.0	'설화수 퍼펙팅 베일 베이스 30ml/1호/2호/SPF29 PA++ 1호 핑크베이지 케이상사' '가히 한겹 크림 30ml(SPF35) 옵션없음 엔에스유' '입큰 톤웨어 틴티드 베이스 40ml(SPF35) 소프트레몬 쑤기쓰마켓'
5.0	'티니올 크리미 팩트 견미리 동안 커버 본품 2개+리필 4개 티니올 팩트 오로라 에디션 23호_본품(케이스랜덤)+리필 2개 더블유앤더블유' '미샤 매직쿠션 본품+리필 - 모이스트/커버래스팅 커버래스팅 23호+리필 원팝' 'Make Up For Ever 매트 벨벳 스킨 풀 커버리지 R260 핑크 베이지 옵션없음 글랜가'
4.0	'MAJOLICA MAJORCA (마조리카 마조르카) 프레스토 포어 커버 10g 옵션없음 Thanks Auction' '동시에 잡는 트윈케익 한방 21호 지속성을케이크 커버력과케이크 옵션없음 평강이' 'Almay 프레스드 파우더 올 세트 노 샤인, 마이 베스트 라이트, [100] 0.20 oz 옵션없음 케이피스토어'
6.0	'앤디시도 유아몬드 애교살메이커 애교살쿠션 웜베이지 블랭크' '바비브라운 프라이머 플러스 프로텍션 40ml(SPF50+) 옵션없음 다사다 유한책임회사' '바닐라코 프라임 프라이머 하이드레이팅 30ml 옵션없음 비엠유통'
2.0	'V&A Beauty 쿠션+앰플+크림 SET 브이앤에이' 'AHC 누드 크림&스틱 세트 (크림 40ml+컨실링 스틱 10g+미니 2종+파우치) AHC 공식스토어' '[에뛰드] 님프 광채 볼류머+컨실러 2종 세트 에뛰드'

Evaluation

Metrics

Label	Accuracy
all	0.7306

Uses

Direct Use for Inference

First install the SetFit library:

pip install setfit

Then you can load this model and run inference.

from setfit import SetFitModel

# Download from the 🤗 Hub
model = SetFitModel.from_pretrained("mini1013/master_cate_bt4_test")
# Run inference
preds = model("미니 튜브형 케이스 로션케이스 가정생활용품 50ml 셀러팩토리")

Training Details

Training Set Metrics

Training set	Min	Median	Max
Word count	5	9.7872	19

Label	Training Sample Count
0.0	19
1.0	21
2.0	10
3.0	19
4.0	28
5.0	23
6.0	21

Training Hyperparameters

batch_size: (512, 512)
num_epochs: (30, 30)
max_steps: -1
sampling_strategy: oversampling
num_iterations: 60
body_learning_rate: (2e-05, 1e-05)
head_learning_rate: 0.01
loss: CosineSimilarityLoss
distance_metric: cosine_distance
margin: 0.25
end_to_end: False
use_amp: False
warmup_proportion: 0.1
l2_weight: 0.01
seed: 42
eval_max_steps: -1
load_best_model_at_end: False

Training Results

Epoch	Step	Training Loss	Validation Loss
0.0588	1	0.4976	-
2.9412	50	0.2385	-
5.8824	100	0.006	-
8.8235	150	0.0002	-
11.7647	200	0.0001	-
14.7059	250	0.0001	-
17.6471	300	0.0001	-
20.5882	350	0.0001	-
23.5294	400	0.0001	-
26.4706	450	0.0001	-
29.4118	500	0.0001	-

Framework Versions

Python: 3.10.12
SetFit: 1.1.0
Sentence Transformers: 3.3.1
Transformers: 4.44.2
PyTorch: 2.2.0a0+81ea7a4
Datasets: 3.2.0
Tokenizers: 0.19.1

Citation

BibTeX

@article{https://doi.org/10.48550/arxiv.2209.11055,
    doi = {10.48550/ARXIV.2209.11055},
    url = {https://arxiv.org/abs/2209.11055},
    author = {Tunstall, Lewis and Reimers, Nils and Jo, Unso Eun Seo and Bates, Luke and Korat, Daniel and Wasserblat, Moshe and Pereg, Oren},
    keywords = {Computation and Language (cs.CL), FOS: Computer and information sciences, FOS: Computer and information sciences},
    title = {Efficient Few-Shot Learning Without Prompts},
    publisher = {arXiv},
    year = {2022},
    copyright = {Creative Commons Attribution 4.0 International}
}