Commit
·
0d9ae7c
1
Parent(s):
a17c3c8
Update links to research blog
Browse files
results/OrbyAgent-ActIO-72b/README.md
CHANGED
@@ -4,4 +4,4 @@ This agent is developed by [Orby AI](https://www.orby.ai/).
|
|
4 |
|
5 |
The agent does not use any benchmark-specific information in the prompts. For WebArena benchmark, we use the original evaluator and task definitions for fair comparison.
|
6 |
|
7 |
-
It uses the ActIO model of 72B parameters as a backend, with both screenshot and HTML as inputs. More details can be found in our [research blog]().
|
|
|
4 |
|
5 |
The agent does not use any benchmark-specific information in the prompts. For WebArena benchmark, we use the original evaluator and task definitions for fair comparison.
|
6 |
|
7 |
+
It uses the ActIO model of 72B parameters as a backend, with both screenshot and HTML as inputs. More details can be found in our [research blog](https://www.orby.ai/resources/elevating-automation-orby-ais-generic-agent-framework-and-self-adaptive-interface-learning-technique).
|
results/OrbyAgent-Claude-3.5-Sonnet/README.md
CHANGED
@@ -4,4 +4,4 @@ This agent is developed by [Orby AI](https://www.orby.ai/).
|
|
4 |
|
5 |
The agent does not use any benchmark-specific information in the prompts. For WebArena benchmark, we use the original evaluator and task definitions for fair comparison.
|
6 |
|
7 |
-
It uses Claude-3.5-sonnet-20241022 as a backend, with both screenshot and HTML as inputs. More details can be found in our [research blog]().
|
|
|
4 |
|
5 |
The agent does not use any benchmark-specific information in the prompts. For WebArena benchmark, we use the original evaluator and task definitions for fair comparison.
|
6 |
|
7 |
+
It uses Claude-3.5-sonnet-20241022 as a backend, with both screenshot and HTML as inputs. More details can be found in our [research blog](https://www.orby.ai/resources/elevating-automation-orby-ais-generic-agent-framework-and-self-adaptive-interface-learning-technique).
|