Text Generation
Transformers
Safetensors
English
mistral
conversational
text-generation-inference
ri-nat commited on
Commit
aa5b14d
·
verified ·
1 Parent(s): 4d61c06

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +66 -10
README.md CHANGED
@@ -10,9 +10,6 @@ library_name: transformers
10
 
11
  # StarfleetAI Polaris Small
12
 
13
- > [!NOTE]
14
- > Work on the model card is still in progress.
15
-
16
  > [!WARNING]
17
  > Even though the model is not yet meant for production usage, we still encourage you to give it a try and tell us what you think.
18
  >
@@ -26,14 +23,73 @@ This model was developed by StarfleetAI with the aim of it being used for autono
26
  * Additionally trained on coding tasks
27
  * Trained for function calling
28
 
29
- ## Features
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
30
 
31
- - [x] Function calling
32
- - [ ] Needs more training on examples where function call should be performed in the middle/end parts of the conversation, rather than in the beginning
33
- - [ ] Needs to eliminate some hallucinations (it, for example, sometimes regressing news articles instead of calling the `get_rss_feed` function, and stuff like that)
34
- - [ ] Correct typography (currently: `Answer:123` instead of `Answer: 123`)
35
- - [ ] Current date / time / timezone conversational abilities
36
- - [ ] Needs some DPO
37
 
38
  ### Model Description
39
 
 
10
 
11
  # StarfleetAI Polaris Small
12
 
 
 
 
13
  > [!WARNING]
14
  > Even though the model is not yet meant for production usage, we still encourage you to give it a try and tell us what you think.
15
  >
 
23
  * Additionally trained on coding tasks
24
  * Trained for function calling
25
 
26
+ ## Current Features
27
+
28
+ As of version `0.1` the model is:
29
+
30
+ - Able to call a functions
31
+ - Not bad at coding tasks
32
+
33
+ ## Work To Do
34
+
35
+ - Needs more training on examples where function call should be performed in the middle/end parts of the conversation, rather than in the beginning
36
+ - Needs to eliminate some hallucinations (it, for example, sometimes regressing news articles instead of calling the `get_rss_feed` function, and stuff like that)
37
+ - Correct typography (`Answer: 123` instead of `Answer:123`)
38
+ - Current date / time / timezone conversational abilities ("Schedule a call tomorrow at 6PM")
39
+ - Needs some DPO
40
+ - Need to be evaluated on relevant metrics
41
+
42
+ ## Prompt Format
43
+
44
+ We wanted our prompt to:
45
+
46
+ 1. Have a configurable `system` prompt part, which is meant to be changed by the end user.
47
+ 2. Describe the functions available to the model (in a specific, static place in the prompt, making it easier for the model to identify them).
48
+ 3. Be capable of carrying additional configuration options in the future, such as the current date/time or the user's name.
49
+
50
+ In order to fullfill these needs, we designed the prompt format, which seems to tick all the boxes.
51
+
52
+ ### With Functions
53
+
54
+ ```
55
+ ## Configuration
56
+
57
+ Functions: enabled
58
+
59
+ ## Available Functions
60
+
61
+ { ... }
62
+ { ... }
63
+
64
+ ---
65
+
66
+ You are a helpful assistant.
67
+ ```
68
+
69
+ ### Without Functions
70
+
71
+ ```
72
+ ## Configuration
73
+
74
+ Functions: disabled
75
+
76
+ ---
77
+
78
+ You are a helpful assistant.
79
+ ```
80
+
81
+ ## Functions Calling
82
+
83
+ In order for the model to call a function, we have introduced two new tokens: `<|fn_start|>` and `<|fn_end|>`. It's safe to assume that if the model decides to call a function, there will be no other response from it except for the function call between these special tokens. If this is not true for some of your cases, please feel free to contact us with examples.
84
+
85
+ For now, the model is only capable of calling one function at a time.
86
+
87
+ ### Example Function Call
88
+
89
+ ```
90
+ <|fn_start|>{"name": "generate_password", "arguments": {"length": 42}}<|fn_end|>
91
+ ```
92
 
 
 
 
 
 
 
93
 
94
  ### Model Description
95