metadata
license: apache-2.0
datasets:
- StarfleetAI/function-calling
- ajibawa-2023/Code-290k-ShareGPT
language:
- en
library_name: transformers
StarfleetAI Polaris Small
Work on the model card is still in progress.
Even though the model is not yet meant for production usage, we still encourage you to give it a try and tell us what you think.
Any feedback or suggestions are welcomed!
This model was developed by StarfleetAI with the aim of it being used for autonomous AI agent scenarios.
Model Details
- Based on
Open-Orca/Mistral-7B-OpenOrca
- Additionally trained on coding tasks
- Trained for function calling
Features
- Function calling
- Needs more training on examples where function call should be performed in the middle/end parts of the conversation, rather than in the beginning
- Needs to eliminate some hallucinations (it, for example, sometimes regressing news articles instead of calling the
get_rss_feed
function, and stuff like that) - Correct typography (currently:
Answer:123
instead ofAnswer: 123
) - Current date / time / timezone conversational abilities
- Needs some DPO
Model Description
- Developed by: StarfleetAI
- Model type: Mistral
- Language(s) (NLP): English
- License: Apache license 2.0
- Finetuned from model:
Open-Orca/Mistral-7B-OpenOrca
Training Details
You can find the trainer code available on Github under the Apache 2.0 license.