Thank you for your dedication, it sounds great. Here I would like to share some additional information and perspectives so that everyone can better understand the issues we address:
- With language models, when applying in practice we only need it to be understood at 80% or a good overview and combining with RAG will bring better accuracy. So, here we will need a good level of truth telling model and the ability to understand and work with RAG at a very good level to be most effective.
- In Italian, I'm very happy when it speaks well, it proves that my training method and source code for it were correct because it's actually live with the d0x5 version. This is all because Italian was only added later (at the same time as German), responding to the fact that sometimes it can only be described as a translation mays.
- With the ability to reason, I hope you don't misunderstand. It still works well, just when compared to some current superior models like GPT 4o or Claude 3, there will be some songs where it will "lose". It still outperforms a lot of other much larger models. For example, the question "Andrew is free from 11 am to 3 pm, Joanne is free from noon to 2 pm and then 3:30 pm to 5 pm. Hannah is available at noon for half an hour, and then 4 pm to 6 pm. What are some options for start times for a 30 minute meeting for Andrew, Hannah, and Joanne?" taken from OpenAI GPT4 home page.
One note: in reasoning tests, models often set the temperature to 0, with Ghost 8B Beta we always set it to 0.1 as the lowest. The reason is simple because if at this level the model still reasons well, then at level 0.4 (the default level of the current chat) it will still often achieve the same results, and we want to aim for practical efficiency. rather than scores. Let's try to lower the temperature with some reasoning questions to experiment.
After all, you guys are great, thank you so much everyone.
An example of reasoning about time:
![Screenshot 2024-07-16 at 11.29.12.png](https://cdn-uploads.huggingface.co/production/uploads/600ae38cc92b79f54efd4556/72nk4rxI0Q0JKtrc08WXO.png)
![Screenshot 2024-07-16 at 11.29.25.png](https://cdn-uploads.huggingface.co/production/uploads/600ae38cc92b79f54efd4556/8dBdjYRdjL5NOY-AEg8Mg.png)
An example of a long context with extensive summary capabilities: Paper: Point out the highlights and identify the ideal people to apply it..