Huanzhi Mao
commited on
Commit
·
2b538b7
1
Parent(s):
383da93
update description
Browse files
app.py
CHANGED
@@ -1029,7 +1029,7 @@ with gr.Blocks() as demo:
|
|
1029 |
"**This live leaderboard evaluates the LLM's ability to call functions (aka tools) accurately. This leaderboard consists of real-world data and will be updated periodically. For more information on the evaluation dataset and methodology, please refer to our [blog](https://gorilla.cs.berkeley.edu/blogs/8_berkeley_function_calling_leaderboard.html) and [code](https://github.com/ShishirPatil/gorilla).**"
|
1030 |
)
|
1031 |
gr.Markdown(
|
1032 |
-
"""**AST means evaluation through Abstract Syntax Tree and Exec means evaluation
|
1033 |
|
1034 |
**FC = native support for function/tool calling.**
|
1035 |
|
@@ -1046,7 +1046,7 @@ with gr.Blocks() as demo:
|
|
1046 |
"**This live leaderboard evaluates the LLM's ability to call functions (aka tools) accurately. This leaderboard consists of real-world data and will be updated periodically. For more information on the evaluation dataset and methodology, please refer to our [blog](https://gorilla.cs.berkeley.edu/blogs/8_berkeley_function_calling_leaderboard.html) and [code](https://github.com/ShishirPatil/gorilla).**"
|
1047 |
)
|
1048 |
gr.Markdown(
|
1049 |
-
"""**AST means evaluation through Abstract Syntax Tree and Exec means evaluation
|
1050 |
|
1051 |
**FC = native support for function/tool calling.**
|
1052 |
|
@@ -1064,8 +1064,8 @@ with gr.Blocks() as demo:
|
|
1064 |
|
1065 |
We provide a short summary here. For more details, please refer to our release [blog](https://gorilla.cs.berkeley.edu/blogs/8_berkeley_function_calling_leaderboard.html):
|
1066 |
|
1067 |
-
**AST** means evaluation through Abstract Syntax Tree, and **Exec** means evaluation
|
1068 |
-
|
1069 |
**Cost** is calculated as an estimate of the cost per 1000 function calls, in USD.
|
1070 |
|
1071 |
**Latency** is measured in seconds.
|
|
|
1029 |
"**This live leaderboard evaluates the LLM's ability to call functions (aka tools) accurately. This leaderboard consists of real-world data and will be updated periodically. For more information on the evaluation dataset and methodology, please refer to our [blog](https://gorilla.cs.berkeley.edu/blogs/8_berkeley_function_calling_leaderboard.html) and [code](https://github.com/ShishirPatil/gorilla).**"
|
1030 |
)
|
1031 |
gr.Markdown(
|
1032 |
+
"""**AST means evaluation through Abstract Syntax Tree and Exec means evaluation by executing all the API calls the LLM generates.**
|
1033 |
|
1034 |
**FC = native support for function/tool calling.**
|
1035 |
|
|
|
1046 |
"**This live leaderboard evaluates the LLM's ability to call functions (aka tools) accurately. This leaderboard consists of real-world data and will be updated periodically. For more information on the evaluation dataset and methodology, please refer to our [blog](https://gorilla.cs.berkeley.edu/blogs/8_berkeley_function_calling_leaderboard.html) and [code](https://github.com/ShishirPatil/gorilla).**"
|
1047 |
)
|
1048 |
gr.Markdown(
|
1049 |
+
"""**AST means evaluation through Abstract Syntax Tree and Exec means evaluation by executing all the API calls the LLM generates.**
|
1050 |
|
1051 |
**FC = native support for function/tool calling.**
|
1052 |
|
|
|
1064 |
|
1065 |
We provide a short summary here. For more details, please refer to our release [blog](https://gorilla.cs.berkeley.edu/blogs/8_berkeley_function_calling_leaderboard.html):
|
1066 |
|
1067 |
+
**AST** means evaluation through Abstract Syntax Tree, and **Exec** means evaluation by executing all the API calls the LLM generates.
|
1068 |
+
|
1069 |
**Cost** is calculated as an estimate of the cost per 1000 function calls, in USD.
|
1070 |
|
1071 |
**Latency** is measured in seconds.
|