Text Generation
Transformers
Safetensors
English
mistral
conversational
text-generation-inference
polaris-small / README.md
ri-nat's picture
Update README.md
c805b83 verified
|
raw
history blame
4.43 kB
metadata
license: apache-2.0
datasets:
  - StarfleetAI/function-calling
  - ajibawa-2023/Code-290k-ShareGPT
language:
  - en
library_name: transformers

StarfleetAI Polaris Small

Work on the model card is still in progress.

Even though the model is not yet meant for production usage, we still encourage you to give it a try and tell us what you think.

Any feedback or suggestions are welcomed!

This model was developed by StarfleetAI with the aim of it being used for autonomous AI agent scenarios.

Model Details

  • Based on Open-Orca/Mistral-7B-OpenOrca
  • Additionally trained on coding tasks
  • Trained for function calling

Features

  • Function calling
    • Needs more training on examples where function call should be performed in the middle/end parts of the conversation, rather than in the beginning
  • Needs to eliminate some hallucinations (it, for example, sometimes regressing news articles instead of calling the get_rss_feed function, and stuff like that)
  • Correct typography (currently: Answer:123 instead of Answer: 123)
  • Current date / time / timezone conversational abilities
  • Needs some DPO

Model Description

  • Developed by: StarfleetAI
  • Model type: Mistral
  • Language(s) (NLP): English
  • License: Apache license 2.0
  • Finetuned from model: Open-Orca/Mistral-7B-OpenOrca

Training Details

You can find the trainer code available on Github under the Apache 2.0 license.