Update README.md
Browse files
README.md
CHANGED
@@ -171,7 +171,7 @@ We report results derived from the Agentless scaffold. Departing from the origin
|
|
171 |
"sphinx-doc__sphinx-8475"
|
172 |
|
173 |
### TAU-bench methodology
|
174 |
-
We evaluate TAU-Bench with the average passrate of 5 samples for each query, with GPT-4.1 as user model and without any custom tools. The maximum number of interaction steps is
|
175 |
We prepend a general principle to the policy prompt.
|
176 |
#### General
|
177 |
- In each round, you need to carefully examine the tools provided to you to determine if any can be used.
|
|
|
171 |
"sphinx-doc__sphinx-8475"
|
172 |
|
173 |
### TAU-bench methodology
|
174 |
+
We evaluate TAU-Bench with the average passrate of 5 samples for each query, with GPT-4.1 as user model and without any custom tools. The maximum number of interaction steps is 40.
|
175 |
We prepend a general principle to the policy prompt.
|
176 |
#### General
|
177 |
- In each round, you need to carefully examine the tools provided to you to determine if any can be used.
|