PipableAI
/

pip-code-bandit

Model card Files Files and versions Community

rvk7895 commited on May 13, 2024

Commit

83dd665

verified ·

1 Parent(s): c37fe02

Update README.md

Browse files

Files changed (1) hide show

README.md +11 -11

README.md CHANGED Viewed

@@ -32,7 +32,7 @@ widget:
 ---
 # pip-code-bandit
-[pipableAi](https://www.pipable.ai/)
 [colab_notebook](https://colab.research.google.com/drive/10av3SxFf0Psx_IkmZbcUhiVznStV5pVS?usp=sharing)
@@ -51,9 +51,9 @@ widget:
 Given a goal and tools, can AI intelligently use the tools to reach the goal?\
 What if it has a meagre 1.3b params/neurons akin to that of an owl? Can it follow instructions and plan to reach a goal?\
-Apparently it can!\
 Releasing **pip-code-bandit** and **pipflow**\
-A `model` and a `library` to manage and run goal oriented agentic system.
 ## Model attributes
@@ -77,16 +77,16 @@ A `model` and a `library` to manage and run goal oriented agentic system.
 ```
-## How we built it?
 We used a simulator to simulate environments where the model could play games to achieve goals, given a set of actions available to it.
-All the model could do was find the right action and config to incur positive reward.
-The reward policy is around the concept of model going to a stable state of zero net sum reward for both good and bad behaviour.
-In this set up the model, which was pre trained on code , function documentation and similar OS datasets ,was RL tuned for instruction following and reliability.
 ## License
 ```bash
-complete open sourced - apache 2.0. License
 ```
 ## Usage
@@ -124,15 +124,15 @@ curl -X 'POST' \
   -d 'model_name=PipableAI%2Fpip-code-bandit&prompt="YOUR PROMPT"&max_new_tokens=400'
 ```
-Alternatively, you can directly access UI endpoint at https://playground.pipable.ai/docs#/default/infer_infer_post.
 ### Library Usage
-For directly using the capabilities of model without putting extra efforts on schemas and prompts try to use [pipflow](https://github.com/PipableAI/pipflow).
-For detailed usage refer to the [colab_notebook](https://colab.research.google.com/drive/10av3SxFf0Psx_IkmZbcUhiVznStV5pVS?usp=sharing)

 ---
 # pip-code-bandit
+[PipableAI](https://www.pipable.ai/)
 [colab_notebook](https://colab.research.google.com/drive/10av3SxFf0Psx_IkmZbcUhiVznStV5pVS?usp=sharing)
 Given a goal and tools, can AI intelligently use the tools to reach the goal?\
 What if it has a meagre 1.3b params/neurons akin to that of an owl? Can it follow instructions and plan to reach a goal?\
+It can!\
 Releasing **pip-code-bandit** and **pipflow**\
+A `model` and a `library` to manage and run goal-oriented agentic system.
 ## Model attributes
 ```
+## How did we build it?
 We used a simulator to simulate environments where the model could play games to achieve goals, given a set of actions available to it.
+All the model could do was find the right action and config to incur a positive reward.
+The reward policy is around the concept of a model going to a stable state of zero net sum reward for both good and bad behaviour.
+In this setup, the model, which was pre-trained on code, function documentation, and similar OS datasets, was RL-tuned for reliability and instruction-following.
 ## License
 ```bash
+complete open-sourced - apache 2.0. License
 ```
 ## Usage
   -d 'model_name=PipableAI%2Fpip-code-bandit&prompt="YOUR PROMPT"&max_new_tokens=400'
 ```
+Alternatively, you can directly access the UI endpoint at https://playground.pipable.ai/docs#/default/infer_infer_post.
 ### Library Usage
+To directly use the model's capabilities without putting extra effort into schemas and prompts, try to use [pipflow](https://github.com/PipableAI/pipflow).
+For detailed usage, refer to the [colab_notebook](https://colab.research.google.com/drive/10av3SxFf0Psx_IkmZbcUhiVznStV5pVS?usp=sharing)