Update README.md
Browse files
README.md
CHANGED
@@ -26,24 +26,27 @@
|
|
26 |
|
27 |
|
28 |
> Open-source test generation SDK for LLM applications.
|
|
|
29 |
|
30 |
-
Rhesis
|
31 |
|
32 |
-
###
|
33 |
|
34 |
-
|
35 |
|
36 |
-
|
|
|
|
|
37 |
|
|
|
38 |
|
39 |
-
## Features
|
40 |
|
41 |
-
|
|
|
|
|
42 |
|
43 |
-
|
44 |
-
- **Load Test Sets**: Load specific test sets for your use case
|
45 |
-
- **Download Test Sets**: Download test set data for offline use
|
46 |
-
- **Generate Test Sets**: Generate new test sets from basic prompts
|
47 |
|
48 |
### Example Use Cases:
|
49 |
|
@@ -61,7 +64,7 @@ The Rhesis SDK currently provides functionality to work with Rhesis test sets th
|
|
61 |
|
62 |
### Disclaimer
|
63 |
|
64 |
-
|
65 |
|
66 |
### Visit Us
|
67 |
For more details about our testing platform, datasets, and solutions, including the Rhesis AI SDK, visit [Rhesis AI](https://www.rhesis.ai/).
|
|
|
26 |
|
27 |
|
28 |
> Open-source test generation SDK for LLM applications.
|
29 |
+
|
30 |
|
31 |
+
Rhesis AI provides curated and dynamically generated test sets to evaluate LLM applications under diverse conditions. These datasets help assess robustness, reliability, and compliance in real-world scenarios.
|
32 |
|
33 |
+
### Using Our Datasets
|
34 |
|
35 |
+
Our datasets are designed to test various aspects of LLM application behavior, from reliability to safety and bias detection. To get started:
|
36 |
|
37 |
+
1. Browse the available test sets here on Hugging Face.
|
38 |
+
2. Select the dataset that aligns with your evaluation needs.
|
39 |
+
3. Load and apply the test cases to assess your application’s behavior.
|
40 |
|
41 |
+
For more advanced testing and seamless integration, the [Rhesis SDK](https://github.com/rhesis-ai/rhesis-sdk) provides tools to automate dataset handling, generate structured test cases, and streamline evaluation workflows.
|
42 |
|
43 |
+
## Key Features
|
44 |
|
45 |
+
- **Curated Test Sets** – Pre-built datasets covering diverse evaluation criteria.
|
46 |
+
- **Dynamic Test Generation** – Generate custom test sets tailored to specific use cases.
|
47 |
+
- **Scalability** – Use datasets for one-off evaluations or integrate them into automated testing pipelines.
|
48 |
|
49 |
+
For questions or custom datasets, reach out at **[email protected]**.
|
|
|
|
|
|
|
50 |
|
51 |
### Example Use Cases:
|
52 |
|
|
|
64 |
|
65 |
### Disclaimer
|
66 |
|
67 |
+
Some test cases may contain sensitive, challenging, or potentially upsetting content. These cases are included to ensure thorough and realistic assessments. Users should review test cases carefully and exercise discretion when utilizing them.
|
68 |
|
69 |
### Visit Us
|
70 |
For more details about our testing platform, datasets, and solutions, including the Rhesis AI SDK, visit [Rhesis AI](https://www.rhesis.ai/).
|