Vikhrmodels
/

Qwen2.5-7B-Instruct-Tool-Planning-v0.1

Text Generation

function-calling

text-generation-inference

Model card Files Files and versions

DiTy commited on Feb 17

Commit

85b4e36

·

verified ·

1 Parent(s): aa36ad6

Update README.md

Files changed (1) hide show

README.md +62 -3

README.md CHANGED Viewed

@@ -117,6 +117,7 @@ input_tokens =  tokenizer(
 generated_ids = model.generate(
     **input_tokens,
     max_new_tokens=256,
 )[0]
 generated_response = tokenizer.decode(
@@ -133,17 +134,75 @@ Then our `generated_response` will look like this:
 ]</tool_call><|im_end|>
 ```

 generated_ids = model.generate(
     **input_tokens,
     max_new_tokens=256,
+    do_sample=False,
 )[0]
 generated_response = tokenizer.decode(
 ]</tool_call><|im_end|>
 ```
+## Usage (VLLM) <a name="usage_vllm"></a>
+For corrected work online serving in VLLM you need additionally load [qwen2_tool_parser.py]() and [chat_template.jinja]() from this repository.
+```
+vllm serve Vikhrmodels/Qwen2.5-7B-Instruct-Tool-Planning \
+--download-dir "/path/to/cache" \
+--chat-template "/path/to/chat_template.jinja" \
+--tool-parser-plugin "/path/to/qwen2_tool_parser.py" \
+--tool-call-parser "qwen2" \
+--enable-auto-tool-choice
+```
+After that you can start doing requests:
+```python
+from openai import OpenAI
+import json
+client = OpenAI(base_url="http://localhost:8000/v1", api_key="dummy")
+tools = [
+  {
+    "type": "function",
+    "function": {
+      "name": "get_weather",
+      "description": "Get the current weather in a given location",
+      "parameters": {
+        "type": "object",
+        "properties": {
+        "location": {"type": "string", "description": "City and state."},
+      },
+      "required": ["location"]
+      }
+    }
+  }
+]
+response = client.chat.completions.create(
+  model=client.models.list().data[0].id,
+  messages=[
+    {"role": "user", "content": "What's the weather in Krasnodar and Moscow?"}
+  ],
+  tools=tools,
+)
+print(response.choices[0].message)
+```
+```
+ChatCompletionMessage(
+  content='<|start_thinking|>I need to get the weather for Krasnodar and Moscow.<|end_thinking|>',
+  refusal=None,
+  role='assistant',
+  audio=None,
+  function_call=None,
+  tool_calls=[
+    ChatCompletionMessageToolCall(
+      id='chatcmpl-tool-73646c73148e4af9ac53656d6aa3e3c6',
+      function=Function(arguments='{"location": "Krasnodar"}', name='get_weather'),
+      type='function'),
+    ChatCompletionMessageToolCall(
+      id='chatcmpl-tool-95d93590d1a24df6a4f44a87a83f7761',
+      function=Function(arguments='{"location": "Moscow"}', name='get_weather'),
+      type='function')
+  ],
+  reasoning_content=None)
+```
+## Tool Planning Examples <a name="examples"></a>