Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
Reda alami
RedaAlami
Follow
PaoloM's profile picture
Nosdivad's profile picture
Nadas31's profile picture
8 followers
·
3 following
AI & ML interests
Reinforcement Learning
Recent Activity
published
a model
1 day ago
RedaAlami/Qwen2-0.5B-GRPO-test
updated
a dataset
9 days ago
RedaAlami/merged-dpo-safety
published
a dataset
9 days ago
RedaAlami/merged-dpo-safety
View all activity
Organizations
spaces
1
Sleeping
TestRecommenderSystem
👁
models
12
Sort: Recently updated
RedaAlami/Qwen2-0.5B-GRPO-test
Updated
1 day ago
RedaAlami/zephyr-7b-dpo-qlora
Updated
Oct 4, 2024
•
12
RedaAlami/zephyr-7b-dpo-full
Updated
Aug 29, 2024
RedaAlami/merged-dataset0-dataset1
Updated
Aug 28, 2024
RedaAlami/zephyr-7b-gemma-dpo
Updated
Jul 31, 2024
•
6
RedaAlami/ultrafeedback_binarized_custom2
Updated
Jul 17, 2024
RedaAlami/ultrafeedback_binarized_custom
Updated
Jul 17, 2024
RedaAlami/ultrafeedback_binarized_processed
Updated
Jul 12, 2024
RedaAlami/falcon-11b-instruct-dpo-full
Updated
Jul 1, 2024
RedaAlami/recommendation_sports_equipment_english
Updated
May 29, 2024
Expand 12 models
datasets
139
Sort: Recently updated
RedaAlami/merged-dpo-safety
Viewer
•
Updated
9 days ago
•
3.95k
•
17
RedaAlami/eng-batch-3-dpo-safety_test
Viewer
•
Updated
9 days ago
•
36
•
20
RedaAlami/eng-batch-4-dpo-safety_test
Viewer
•
Updated
9 days ago
•
53
•
21
RedaAlami/eng-batch-5-dpo-safety_test
Viewer
•
Updated
9 days ago
•
63
•
18
RedaAlami/eng-batch-6-dpo-safety_test
Viewer
•
Updated
9 days ago
•
58
•
20
RedaAlami/eng-batch-6-dpo-safety_train
Viewer
•
Updated
9 days ago
•
1.11k
•
18
RedaAlami/eng-batch-5-dpo-safety_train
Viewer
•
Updated
9 days ago
•
977
•
18
RedaAlami/eng-batch-4-dpo-safety_train
Viewer
•
Updated
9 days ago
•
1.06k
•
20
RedaAlami/eng-batch-3-dpo-safety_train
Viewer
•
Updated
9 days ago
•
596
•
18
RedaAlami/hate_lgbtq_v3
Viewer
•
Updated
Dec 10, 2024
•
393
•
68
Expand 139 datasets