Update README.md

fcb55ab verified about 2 months ago

5.18 kB

	---
	base_model: meta-llama/Meta-Llama-3.1-8B-Instruct
	library_name: peft
	---


	# Model Card for Fine-Tuned LLaMA Empathy

	## Model Summary

	Fine-Tuned LLaMA Empathy is a large language model fine-tuned to enhance emotional understanding and generate needs-based responses. This model is designed for use in psychology, therapy, conflict resolution, human-computer interaction, and online moderation. It is based on the Meta-Llama-3.1-8B-Instruct model and utilizes LoRA (Low-Rank Adaptation) for efficient fine-tuning.

	## Model Details

	### Model Description

	- Developed by: AI Medical in collaboration with Ruslanmv.com
	- Funded by:
	- Shared by: AI Medical
	- Model type: Fine-tuned Meta-Llama-3.1-8B-Instruct
	- Language(s) (NLP): English
	- License: Creative Commons Attribution 4.0 International License (CC BY 4.0)
	- Fine-tuned from model: meta-llama/Meta-Llama-3.1-8B-Instruct

	### Model Sources
	- Repository: [Hugging Face Model Repository](https://huggingface.co/ruslanmv/fine_tuned_llama_empathy)


	## Uses

	### Direct Use
	- Psychology & Therapy: Assisting professionals in understanding and responding empathetically to patient emotions.
	- Conflict Resolution: Helping mediators decode emotional expressions and address underlying needs.
	- Human-Computer Interaction: Enhancing chatbots and virtual assistants with emotionally aware responses.
	- Social Media Moderation: Reducing toxicity and improving online discourse through need-based responses.
	- Education: Supporting emotional intelligence training and communication skill development.

	### Downstream Use
	- Fine-tuning for specialized applications in mental health, conflict resolution, or AI-driven assistance.
	- Integration into virtual therapists, mental health applications, and online support systems.

	### Out-of-Scope Use
	- Not a substitute for professional psychological evaluation or medical treatment.
	- Not suitable for high-risk applications requiring absolute accuracy in emotional interpretation.

	## Bias, Risks, and Limitations

	- Bias: As with any NLP model, biases may exist due to the dataset and training methodology. LLaMA models, in particular, have shown biases.
	- Risk of Misinterpretation: Emotional expressions are subjective and may be misclassified in complex scenarios.
	- Generalization Limitations: May not fully capture cultural and contextual variations in emotional expressions.

	### Recommendations
	Users should verify outputs before applying them in professional or high-stakes settings. Continuous evaluation and user feedback are recommended.

	## How to Get Started with the Model

	```python
	from transformers import pipeline

	model_name = "ruslanmv/fine_tuned_llama_empathy"
	model = pipeline("text-generation", model=model_name)

	prompt = "I feel betrayed."
	response = model(prompt, max_length=50)
	print(response)
	```

	## Training Details

	### Training Data
	- Dataset: Annotated dataset mapping evaluative expressions to emotions and needs.
	- Annotations: 1,500+ labeled examples linking expressions to emotional states and corresponding needs.

	### Training Procedure

	#### Preprocessing
	- Tokenized using Hugging Face `transformers` library.
	- Augmented with synonym variations and paraphrased sentences.

	#### Training Hyperparameters
	- Training regime: Mixed precision training using LoRA.
	- Batch size: 32
	- Learning rate: 2e-5
	- Training steps: 1k
	- Hardware: 1x A100 GPU using DeepSpeed ZeRO-3

	## Evaluation

	### Testing Data, Factors & Metrics

	#### Testing Data
	- Held-out dataset containing unseen evaluative expressions.

	#### Factors
	- Performance across different emotional expression categories.
	- Sensitivity to nuanced phrasing and variations.

	#### Metrics
	- Accuracy: Measures correct classification of emotions and needs.
	- Precision & Recall: Evaluates the balance between capturing true emotions and avoiding false positives.
	- F1-Score: Measures the balance between precision and recall.

	### Results

	- Accuracy: 89.5%
	- F1-Score: 87.2%
	- Latency: <500ms response time

	## Environmental Impact

	- Hardware Type: A100 GPUs
	- Training Time: hours
	- Carbon Emitted: Estimated using [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute).

	## Technical Specifications

	### Model Architecture and Objective
	- Base Model: meta-llama/Meta-Llama-3.1-8B-Instruct
	- Fine-tuned using LoRA for parameter-efficient training. Key LoRA parameters: `r=8`, `lora_alpha=16`, `lora_dropout=0.2`, `target_modules=["v_proj", "q_proj"]`

	### Compute Infrastructure
	- Hardware: AWS spot instances (1x A100 GPUs)
	- Software: Hugging Face `transformers`, PEFT, PyTorch

	## Citation

	If you use this model, please cite:

	```bibtex
	@misc{ai-medical_2025,
	author = {AI Medical, ruslanmv.com},
	title = {Fine-Tuned LLaMA Empathy},
	year = {2025},
	howpublished = {\url{[https://huggingface.co/ruslanmv/fine_tuned_llama_empathy](https://huggingface.co/ruslanmv/fine_tuned_llama_empathy)}}
	}
	```

	## More Information
	- Model Card Authors: AI Medical Team, ruslanmv.com
	- Framework Versions: PEFT 0.14.0