Spaces:

nhop
/

L3Score

Running

Niklas Hoepner commited on Apr 16

Commit

0ca5bff

1 Parent(s): 9b652c8

Put title above input box

Files changed (1) hide show

app.py CHANGED Viewed

@@ -18,6 +18,9 @@ def compute_l3score(api_key, provider, model, questions, predictions, references
         return {"error": str(e)}
 with gr.Blocks() as demo:
     with gr.Row():
@@ -40,7 +43,6 @@ with gr.Blocks() as demo:
     )
     gr.Markdown(r"""
-    <h1 align="center"> Metric: L3Score </h1>
     ## 📌 Description
     **L3Score** evaluates how semantically close a model-generated answer is to a reference answer for a given question. It prompts a **language model as a judge** using:
@@ -62,7 +64,7 @@ with gr.Blocks() as demo:
     ## 🧮 Scoring Logic
-    Let $l_{\text{yes}}$ and $l_{\text{no}}$ be the log-probabilities of "Yes" and "No", respectively.
     - If neither token is in the top-5:

         return {"error": str(e)}
 with gr.Blocks() as demo:
+    gr.Markdown(r"""
+    <h1 align="center"> Metric: L3Score </h1>
+    """)
     with gr.Row():
     )
     gr.Markdown(r"""
     ## 📌 Description
     **L3Score** evaluates how semantically close a model-generated answer is to a reference answer for a given question. It prompts a **language model as a judge** using:
     ## 🧮 Scoring Logic
+    Let $ l_{\text{yes}} $ and $ l_{\text{no}} $ be the log-probabilities of "Yes" and "No", respectively.
     - If neither token is in the top-5: