Spaces:

ServiceNow
/

browsergym-leaderboard

Running

ligang-orby commited on Feb 25

Commit

0d9ae7c

1 Parent(s): a17c3c8

Update links to research blog

Files changed (2) hide show

results/OrbyAgent-ActIO-72b/README.md CHANGED Viewed

@@ -4,4 +4,4 @@ This agent is developed by [Orby AI](https://www.orby.ai/).
 The agent does not use any benchmark-specific information in the prompts. For WebArena benchmark, we use the original evaluator and task definitions for fair comparison.
-It uses the ActIO model of 72B parameters as a backend, with both screenshot and HTML as inputs. More details can be found in our [research blog]().


4
5	The agent does not use any benchmark-specific information in the prompts. For WebArena benchmark, we use the original evaluator and task definitions for fair comparison.
6
7	+ It uses the ActIO model of 72B parameters as a backend, with both screenshot and HTML as inputs. More details can be found in our [research blog](https://www.orby.ai/resources/elevating-automation-orby-ais-generic-agent-framework-and-self-adaptive-interface-learning-technique).

results/OrbyAgent-Claude-3.5-Sonnet/README.md CHANGED Viewed

@@ -4,4 +4,4 @@ This agent is developed by [Orby AI](https://www.orby.ai/).
 The agent does not use any benchmark-specific information in the prompts. For WebArena benchmark, we use the original evaluator and task definitions for fair comparison.
-It uses Claude-3.5-sonnet-20241022 as a backend, with both screenshot and HTML as inputs. More details can be found in our [research blog]().


4
5	The agent does not use any benchmark-specific information in the prompts. For WebArena benchmark, we use the original evaluator and task definitions for fair comparison.
6
7	+ It uses Claude-3.5-sonnet-20241022 as a backend, with both screenshot and HTML as inputs. More details can be found in our [research blog](https://www.orby.ai/resources/elevating-automation-orby-ais-generic-agent-framework-and-self-adaptive-interface-learning-technique).