Spaces:

ttomy
/

proxy-lite-demo-for-setup

Running

App Files Files Community

Alex J. Chan commited on Feb 25

Commit

92af62b

2 Parent(s): ad2c5c0 19e573e

Merge pull request #8 from convergence-ai/alex/image_update

Browse files

Files changed (2) hide show

README.md +3 -6
src/proxy_lite/cli.py +7 -3

README.md CHANGED Viewed

@@ -24,9 +24,6 @@
   <a href="https://github.com/convergence-ai/proxy-lite/issues/">
     <img src="https://img.shields.io/github/issues/convergence-ai/proxy-lite" alt="open issues" />
   </a>
-  <a href="https://github.com/convergence-ai/proxy-lite/blob/master/LICENSE">
-    <img src="https://img.shields.io/github/license/convergence-ai/proxy-lite.svg" alt="license" />
-  </a>
 </p>
 </div>
@@ -159,7 +156,7 @@ result = asyncio.run(
 The `Runner` sets the solver and environment off in a loop, like in a traditional reinforcement learning setup.
 <div align="center">
-  <img src="assets/loop.png" alt="Runner Loop" width="700" height="auto" style="margin-bottom: 20px;" />
 </div>
@@ -186,9 +183,9 @@ message_history = [
 ```
 This would then build up the message history, alternating between the assistant (who takes the *action*) and the user (who provides the *observation*).
-> **Context-Window Management:** When making calls to the model, all the last observations other than the current one are discarded in order to reduce the large number of image tokens required. Since the model responses include reflection on the observations and are all included in the message history, the model is still aware of the entire history when planning new actions.
-The chat template will format this automatically. You should also pass the `Tools` that the model has access to, these will define the action space available to the model. You can do this with `transformers`:
 ```python
 from qwen_vl_utils import process_vision_info

   <a href="https://github.com/convergence-ai/proxy-lite/issues/">
     <img src="https://img.shields.io/github/issues/convergence-ai/proxy-lite" alt="open issues" />
   </a>
 </p>
 </div>
 The `Runner` sets the solver and environment off in a loop, like in a traditional reinforcement learning setup.
 <div align="center">
+  <img src="assets/loop.png" alt="Runner Loop" width="800" height="auto" style="margin-bottom: 20px;" />
 </div>
 ```
 This would then build up the message history, alternating between the assistant (who takes the *action*) and the user (who provides the *observation*).
+> **Context-Window Management:** When making calls to the model, all the observations other than the current one are discarded in order to reduce the large number of image tokens required. Since the model responses include reflection on the observations and are all included in the message history, the model is still aware of the entire history when planning new actions.
+You should also pass the `Tools` that the model has access to, these will define the action space available to the model. You can do this with `transformers`:
 ```python
 from qwen_vl_utils import process_vision_info

src/proxy_lite/cli.py CHANGED Viewed

@@ -15,6 +15,10 @@ def update_config_from_env(config: RunnerConfig) -> RunnerConfig:
         config.solver.client.api_base = os.getenv("PROXY_LITE_API_BASE")
     if os.getenv("PROXY_LITE_MODEL"):
         config.solver.client.model_id = os.getenv("PROXY_LITE_MODEL")
     return config
@@ -31,11 +35,11 @@ def do_command(args):
     if args.model:
         config.solver.client.model_id = args.model
     if args.homepage:
-        config.homepage = args.homepage
     if args.viewport_width:
-        config.viewport_width = args.viewport_width
     if args.viewport_height:
-        config.viewport_height = args.viewport_height
     o = Runner(config=config)
     result = asyncio.run(o.run(do_text))

         config.solver.client.api_base = os.getenv("PROXY_LITE_API_BASE")
     if os.getenv("PROXY_LITE_MODEL"):
         config.solver.client.model_id = os.getenv("PROXY_LITE_MODEL")
+    if os.getenv("PROXY_LITE_VIEWPORT_WIDTH"):
+        config.environment.viewport_width = int(os.getenv("PROXY_LITE_VIEWPORT_WIDTH"))
+    if os.getenv("PROXY_LITE_VIEWPORT_HEIGHT"):
+        config.environment.viewport_height = int(os.getenv("PROXY_LITE_VIEWPORT_HEIGHT"))
     return config
     if args.model:
         config.solver.client.model_id = args.model
     if args.homepage:
+        config.environment.homepage = args.homepage
     if args.viewport_width:
+        config.environment.viewport_width = args.viewport_width
     if args.viewport_height:
+        config.environment.viewport_height = args.viewport_height
     o = Runner(config=config)
     result = asyncio.run(o.run(do_text))