Gavit0 commited on
Commit
78cfa8f
Β·
verified Β·
0 Parent(s):

initial commit

Browse files
Files changed (2) hide show
  1. .gitattributes +55 -0
  2. README.md +128 -0
.gitattributes ADDED
@@ -0,0 +1,55 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ *.7z filter=lfs diff=lfs merge=lfs -text
2
+ *.arrow filter=lfs diff=lfs merge=lfs -text
3
+ *.bin filter=lfs diff=lfs merge=lfs -text
4
+ *.bz2 filter=lfs diff=lfs merge=lfs -text
5
+ *.ckpt filter=lfs diff=lfs merge=lfs -text
6
+ *.ftz filter=lfs diff=lfs merge=lfs -text
7
+ *.gz filter=lfs diff=lfs merge=lfs -text
8
+ *.h5 filter=lfs diff=lfs merge=lfs -text
9
+ *.joblib filter=lfs diff=lfs merge=lfs -text
10
+ *.lfs.* filter=lfs diff=lfs merge=lfs -text
11
+ *.lz4 filter=lfs diff=lfs merge=lfs -text
12
+ *.mlmodel filter=lfs diff=lfs merge=lfs -text
13
+ *.model filter=lfs diff=lfs merge=lfs -text
14
+ *.msgpack filter=lfs diff=lfs merge=lfs -text
15
+ *.npy filter=lfs diff=lfs merge=lfs -text
16
+ *.npz filter=lfs diff=lfs merge=lfs -text
17
+ *.onnx filter=lfs diff=lfs merge=lfs -text
18
+ *.ot filter=lfs diff=lfs merge=lfs -text
19
+ *.parquet filter=lfs diff=lfs merge=lfs -text
20
+ *.pb filter=lfs diff=lfs merge=lfs -text
21
+ *.pickle filter=lfs diff=lfs merge=lfs -text
22
+ *.pkl filter=lfs diff=lfs merge=lfs -text
23
+ *.pt filter=lfs diff=lfs merge=lfs -text
24
+ *.pth filter=lfs diff=lfs merge=lfs -text
25
+ *.rar filter=lfs diff=lfs merge=lfs -text
26
+ *.safetensors filter=lfs diff=lfs merge=lfs -text
27
+ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
28
+ *.tar.* filter=lfs diff=lfs merge=lfs -text
29
+ *.tar filter=lfs diff=lfs merge=lfs -text
30
+ *.tflite filter=lfs diff=lfs merge=lfs -text
31
+ *.tgz filter=lfs diff=lfs merge=lfs -text
32
+ *.wasm filter=lfs diff=lfs merge=lfs -text
33
+ *.xz filter=lfs diff=lfs merge=lfs -text
34
+ *.zip filter=lfs diff=lfs merge=lfs -text
35
+ *.zst filter=lfs diff=lfs merge=lfs -text
36
+ *tfevents* filter=lfs diff=lfs merge=lfs -text
37
+ # Audio files - uncompressed
38
+ *.pcm filter=lfs diff=lfs merge=lfs -text
39
+ *.sam filter=lfs diff=lfs merge=lfs -text
40
+ *.raw filter=lfs diff=lfs merge=lfs -text
41
+ # Audio files - compressed
42
+ *.aac filter=lfs diff=lfs merge=lfs -text
43
+ *.flac filter=lfs diff=lfs merge=lfs -text
44
+ *.mp3 filter=lfs diff=lfs merge=lfs -text
45
+ *.ogg filter=lfs diff=lfs merge=lfs -text
46
+ *.wav filter=lfs diff=lfs merge=lfs -text
47
+ # Image files - uncompressed
48
+ *.bmp filter=lfs diff=lfs merge=lfs -text
49
+ *.gif filter=lfs diff=lfs merge=lfs -text
50
+ *.png filter=lfs diff=lfs merge=lfs -text
51
+ *.tiff filter=lfs diff=lfs merge=lfs -text
52
+ # Image files - compressed
53
+ *.jpg filter=lfs diff=lfs merge=lfs -text
54
+ *.jpeg filter=lfs diff=lfs merge=lfs -text
55
+ *.webp filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,128 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - text-to-image
4
+ - lora
5
+ - diffusers
6
+ - template:diffusion-lora
7
+ widget:
8
+ - text: >-
9
+ Person with expression of pain due to a heart attack, Middle-aged woman in
10
+ her kitchen leaning on a counter, struggling to breathe, showing signs of a
11
+ cardiac emergency.
12
+ parameters:
13
+ negative_prompt: >-
14
+ blurry, deformed face, bad anatomy, poorly drawn face, out of focus, ugly,
15
+ noisy, extra fingers, distorted, grainy, worst quality, low quality, low
16
+ resolution, illustration, dull, watermark, close-up, 3d, 2d, painting,
17
+ sketch, render, cartoon, grain, kitsch
18
+ output:
19
+ url: images/sd-2.1-infarct-lora-019-623607189.jpg
20
+ - text: >-
21
+ Person with expression of pain due to a heart attack, Elderly man at a
22
+ sports stadium surrounded by a crowd, clutching his chest with a distressed
23
+ look, indicating a heart attack.
24
+ parameters:
25
+ negative_prompt: >-
26
+ blurry, deformed face, bad anatomy, poorly drawn face, out of focus, ugly,
27
+ noisy, extra fingers, distorted, grainy, worst quality, low quality, low
28
+ resolution, illustration, dull, watermark, close-up, 3d, 2d, painting,
29
+ sketch, render, cartoon, grain, kitsch
30
+ output:
31
+ url: images/sd-2.1-infarct-lora-084-766027158.jpg
32
+ base_model: stabilityai/stable-diffusion-2-1-base
33
+ instance_prompt: Person with expression of pain due to a heart attack, infarct
34
+ license: mit
35
+ ---
36
+ # Infarct Image
37
+
38
+ <Gallery />
39
+
40
+ ## Model description
41
+
42
+ # InfarctImage - LoRA Fine-Tuned Model for Heart Attack Simulation
43
+
44
+ ![sd-2.1-infarct-lora-051-3501614513.jpg](https:&#x2F;&#x2F;cdn-uploads.huggingface.co&#x2F;production&#x2F;uploads&#x2F;644580ada56444c355da1b15&#x2F;GGoidcsg95LqZrdlv4eMq.jpeg)
45
+
46
+ ## πŸ“Œ Description
47
+ **InfarctImage** is a LoRA-based model fine-tuned on **Stable Diffusion 2.1**, designed to generate realistic images of people simulating a heart attack. This model was developed as part of a study on synthetic dataset generation for human activity recognition and medical emergency monitoring applications.
48
+
49
+ πŸ”— **Related Article:** Coming soon.
50
+
51
+ ## 🎯 Objective
52
+ The model addresses the issue of data scarcity in medical and anomaly detection environments. Generating high-quality synthetic images enables:
53
+ - Expanding existing datasets without relying on real-world data.
54
+ - Overcoming ethical and logistical restrictions in medical image collection.
55
+ - Enhancing AI-based detection of critical events like heart attacks.
56
+
57
+ ## πŸ“₯ Download and Installation
58
+ To use this model with **Diffusers**, follow these steps:
59
+
60
+ &#x60;&#x60;&#x60;python
61
+ from diffusers import StableDiffusionPipeline
62
+ from peft import PeftModel
63
+ import torch
64
+
65
+ # Load the base model
66
+ base_model &#x3D; StableDiffusionPipeline.from_pretrained(&quot;stabilityai&#x2F;stable-diffusion-2-1&quot;)
67
+
68
+ # Load LoRA weights
69
+ lora_model &#x3D; PeftModel.from_pretrained(base_model, &quot;G
70
+ ![sd-2.1-infarct-lora-051-3501614513.jpg](https:&#x2F;&#x2F;cdn-uploads.huggingface.co&#x2F;production&#x2F;uploads&#x2F;644580ada56444c355da1b15&#x2F;vqPxaiq1iKpC1EQKp-gnD.jpeg)
71
+ avit0&#x2F;InfarctImage&quot;)
72
+
73
+ lora_model.to(torch.device(&quot;cuda&quot;)) # Move to GPU if available
74
+ &#x60;&#x60;&#x60;
75
+
76
+ ## πŸ“Š Training Data
77
+ The model was trained on a dataset of 100 manually annotated images, including:
78
+ - 50 images of people simulating heart attack symptoms.
79
+ - 50 images of people in neutral contexts.
80
+
81
+ The dataset was processed and annotated using **BLIP (Bootstrapping Language-Image Pretraining)** to enhance image descriptions and improve training prompts.
82
+
83
+ ## πŸ”§ Hyperparameters and Configuration
84
+ - **Base Model:** Stable Diffusion 2.1
85
+ - **Fine-Tuning Technique:** LoRA (Low-Rank Adaptation)
86
+ - **Learning Rate:** 0.0001
87
+ - **Batch Size:** 1
88
+ - **Epochs:** 10
89
+ - **Hardware:** NVIDIA RTX 4090 (24GB VRAM)
90
+
91
+ ## πŸ“ˆ Model Evaluation
92
+ **LPIPS (Learned Perceptual Image Patch Similarity)** was used to evaluate the quality of the generated images. Results show that LoRA fine-tuning improves the perceptual similarity of generated images compared to real training data.
93
+
94
+ | Model | LPIPS (↓ Better) |
95
+ |--------|---------------|
96
+ | SD 2.1 Base | 0.7366 |
97
+ | SD 2.1 + LoRA | 0.6919 |
98
+
99
+ ## πŸ† Usage Examples
100
+ You can generate images using prompts like:
101
+ &#x60;&#x60;&#x60;python
102
+ prompt &#x3D; &quot;Person with expression of pain due to a heart attack, A middle-aged man clutching his chest in pain, showing signs of a heart attack.&quot;
103
+ image &#x3D; lora_model(prompt&#x3D;prompt).images[0]
104
+ image.show()
105
+ &#x60;&#x60;&#x60;
106
+
107
+ ## πŸ“œ License
108
+ This model is distributed under the **MIT License**.
109
+
110
+ ## πŸ’‘ Contributions and Contact
111
+ If you want to contribute or have any questions, contact me at **[email protected]** or open an issue in this repository.
112
+
113
+
114
+
115
+
116
+
117
+ ## Trigger words
118
+
119
+ You should use `Person with expression of pain due to a heart attack` to trigger the image generation.
120
+
121
+ You should use `infarct` to trigger the image generation.
122
+
123
+
124
+ ## Download model
125
+
126
+ Weights for this model are available in Safetensors format.
127
+
128
+ [Download](/Gavit0/InfarctImage/tree/main) them in the Files & versions tab.