Spaces:

gorilla-llm
/

berkeley-function-calling-leaderboard

Running

App Files Files Community

Huanzhi Mao commited on Apr 2, 2024

Commit

2b538b7

1 Parent(s): 383da93

update description

Browse files

Files changed (1) hide show

app.py +4 -4

app.py CHANGED Viewed

@@ -1029,7 +1029,7 @@ with gr.Blocks() as demo:
                 "**This live leaderboard evaluates the LLM's ability to call functions (aka tools) accurately. This leaderboard consists of real-world data and will be updated periodically. For more information on the evaluation dataset and methodology, please refer to our [blog](https://gorilla.cs.berkeley.edu/blogs/8_berkeley_function_calling_leaderboard.html) and [code](https://github.com/ShishirPatil/gorilla).**"
             )
             gr.Markdown(
-                """**AST means evaluation through Abstract Syntax Tree and Exec means evaluation through execution.**
                 **FC = native support for function/tool calling.**
@@ -1046,7 +1046,7 @@ with gr.Blocks() as demo:
                 "**This live leaderboard evaluates the LLM's ability to call functions (aka tools) accurately. This leaderboard consists of real-world data and will be updated periodically. For more information on the evaluation dataset and methodology, please refer to our [blog](https://gorilla.cs.berkeley.edu/blogs/8_berkeley_function_calling_leaderboard.html) and [code](https://github.com/ShishirPatil/gorilla).**"
             )
             gr.Markdown(
-                """**AST means evaluation through Abstract Syntax Tree and Exec means evaluation through execution.**
                 **FC = native support for function/tool calling.**
@@ -1064,8 +1064,8 @@ with gr.Blocks() as demo:
                 We provide a short summary here. For more details, please refer to our release [blog](https://gorilla.cs.berkeley.edu/blogs/8_berkeley_function_calling_leaderboard.html):
-                **AST** means evaluation through Abstract Syntax Tree, and **Exec** means evaluation through execution.
                 **Cost** is calculated as an estimate of the cost per 1000 function calls, in USD.
                 **Latency** is measured in seconds.

                 "**This live leaderboard evaluates the LLM's ability to call functions (aka tools) accurately. This leaderboard consists of real-world data and will be updated periodically. For more information on the evaluation dataset and methodology, please refer to our [blog](https://gorilla.cs.berkeley.edu/blogs/8_berkeley_function_calling_leaderboard.html) and [code](https://github.com/ShishirPatil/gorilla).**"
             )
             gr.Markdown(
+                """**AST means evaluation through Abstract Syntax Tree and Exec means evaluation by executing all the API calls the LLM generates.**
                 **FC = native support for function/tool calling.**
                 "**This live leaderboard evaluates the LLM's ability to call functions (aka tools) accurately. This leaderboard consists of real-world data and will be updated periodically. For more information on the evaluation dataset and methodology, please refer to our [blog](https://gorilla.cs.berkeley.edu/blogs/8_berkeley_function_calling_leaderboard.html) and [code](https://github.com/ShishirPatil/gorilla).**"
             )
             gr.Markdown(
+                """**AST means evaluation through Abstract Syntax Tree and Exec means evaluation by executing all the API calls the LLM generates.**
                 **FC = native support for function/tool calling.**
                 We provide a short summary here. For more details, please refer to our release [blog](https://gorilla.cs.berkeley.edu/blogs/8_berkeley_function_calling_leaderboard.html):
+                **AST** means evaluation through Abstract Syntax Tree, and **Exec** means evaluation by executing all the API calls the LLM generates.
                 **Cost** is calculated as an estimate of the cost per 1000 function calls, in USD.
                 **Latency** is measured in seconds.