Deci
/

DeciCoder-6B

@@ -10,7 +10,7 @@ programming_language:
   - JavaScript
   - Python
   - Rust
-  - Go
   - C++
   - C
   - C#
@@ -58,10 +58,10 @@ datasets:
 - bigcode/starcoderdata
 ---
-# Model Card for DeciCoder 6B
-DeciCoder 6B is a 6 billion parameter decoder-only code completion model
-trained on the Python, Java, Javascript, Go, Rust, C++, C, and C# subset of [Starcoder Training Dataset](https://huggingface.co/datasets/bigcode/starcoderdata)..
 The model uses variable Grouped Query Attention and has a context window of 4096
 tokens. It was trained using a Fill-in-the-Middle training objective. The model's
 architecture was generated by Deci's proprietary Neural Architecture
@@ -70,10 +70,17 @@ Search-based technology, AutoNAC.
 ## Model Details
 - **Developed by:** Deci
-- **Model type:** DeciCoder is an auto-regressive language model based on the transformer decoder architecture, using variable Grouped Query Attention.
-- **Language(s):** Python, Java, JavaScript, Go, Rust, C++, C, C#
 - **License:** Model checkpoints are licensed under the [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0)
 ## Model Architecture
 | Parameters | Layers | Heads  | Sequence Length  | GQA num_key_value_heads  | Hidden Size  |
@@ -81,12 +88,12 @@ Search-based technology, AutoNAC.
 | 6B    | 32    | 32    | 4096   | Variable  | 4096 |  |
-- **Decoder layer:** Variable Grouped Query Attention. Grouped Query Attention was introduced in [Ainslie et al., 2023](https://arxiv.org/abs/2305.13245)
 - **Position Embeddings:** Rotary Position Embeddings [Su et al., 2021](https://arxiv.org/abs/2104.09864)
 ## Uses
-The model is intended to do single/multiline code completion from a
 context window of up to 4096k tokens. It is *not* an instruction model
 and commands like \"Write a function that computes the absolute value of
 an integer,\" won't yield the desired results. A more effective approach
@@ -114,8 +121,8 @@ print(tokenizer.decode(outputs[0]))
 ### Attribution
-DeciCoder was trained on StarCoder Training Dataset, filtered for
-Python, Java, JavaScript, Rust, Go, C++, C, and C#. For additional information, please
 refer to [https://huggingface.co/datasets/bigcode/starcoderdata](https://huggingface.co/datasets/bigcode/starcoderdata).
 ```
@@ -123,34 +130,28 @@ refer to [https://huggingface.co/datasets/bigcode/starcoderdata](https://hugging
 ### Limitations
 The model has undergone training with source code from Python, Java,
-JavaScript, Go, Rust, C++, C, and C#. While the primary language in the source is English, it does
 contain other languages. Therefore, the model can produce code snippets
-given some context. However, there\'s no assurance that the resulting
 code will function as expected. It might be suboptimal, contain bugs, or
 even exploits.
 ## Evaluation
-Below are DeciCoder's pass@1 on MultiPL HumanEval scores
-| Python | JavaScript | Java  | C++  | C#  | Rust  | Go  | C  |
-|:----------|:----------|:----------|:----------|:----------|:----------|:----------|:----------|
-| 33.5%    | 29.3%    | 30.3%    |29.93%    |20.31%    |20.5%    |77.47%    |xx%    |
 ### Runtime Benchmarks
-|Inference Tool/Hardware | Qualcomm AI 100 (tokens/sec) |
-|:----------|:----------|
-| Infery LLM | xxx   |
-- Throughput (tokens/sec) - Measured with an optimal batch size of 96
-## Documentation
-- [Notebook](https://colab.research.google.com/drive/1JCxvBsWCZKHfIcHSMVf7GZCs3ClMQPjs) CHANGE
-- Blog post: [Introducing DeciCoder: The New Gold Standard in Efficient and Accurate Code Generation](https://deci.ai/blog/decicoder-efficient-and-accurate-code-generation-llm/)CHANGE
-- Questions:Feel free to contact us via our [Discord Community!](https://discord.com/invite/p9ecgRhDR8/)CHANGE
 ## How to Cite
@@ -158,9 +159,9 @@ Please cite this model using this format.
 ```bibtex
 @misc{DeciFoundationModels,
-title = {DeciCoder},
 author = {DeciAI Research Team},
 year = {2023}
-url={[https://huggingface.co/deci/decicoder-6b](https://huggingface.co/deci/decicoder-6b)},
 }
-```

   - JavaScript
   - Python
   - Rust
+  - Ruby
   - C++
   - C
   - C#
 - bigcode/starcoderdata
 ---
+# Model Card for DeciCoder-6B
+DeciCoder-6B is a 6 billion parameter decoder-only code completion model
+trained on the Python, Java, Javascript, Rust, C++, C, and C# subset of [Starcoder Training Dataset](https://huggingface.co/datasets/bigcode/starcoderdata).
 The model uses variable Grouped Query Attention and has a context window of 4096
 tokens. It was trained using a Fill-in-the-Middle training objective. The model's
 architecture was generated by Deci's proprietary Neural Architecture
 ## Model Details
 - **Developed by:** Deci
+- **Model type:** DeciCoder-6B is an auto-regressive language model based on the transformer decoder architecture, using variable Grouped Query Attention.
+- **Language(s):** Python, Java, JavaScript, Ruby, Rust, C++, C, C#
 - **License:** Model checkpoints are licensed under the [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0)
+## Documentation
+- Google Colab [Notebook](https://colab.research.google.com/drive/1ZxG9qMlom9vn4lSGlD8PrjwHBvag94ei?usp=sharing)
+- Blog Post: [Introducing DeciCoder-6B: The Best Multi-Language Code Generation LLM in Its Class](https://deci.ai/blog/decicoder-6b-the-best-multi-language-code-generation-llm-in-its-class/)
+- Tutorial: [How to Run DeciCoder-6B on Qualcomm AI 100](https://github.com/quic/cloud-ai-sdk/tree/1.12/models/language_processing/decoder)
+- Questions: Feel free to contact us via our [Discord Community!](https://discord.com/invite/p9ecgRhDR8/)
 ## Model Architecture
 | Parameters | Layers | Heads  | Sequence Length  | GQA num_key_value_heads  | Hidden Size  |
 | 6B    | 32    | 32    | 4096   | Variable  | 4096 |  |
+- **Decoder layer:** Variable Grouped Query Attention
 - **Position Embeddings:** Rotary Position Embeddings [Su et al., 2021](https://arxiv.org/abs/2104.09864)
 ## Uses
+The model is intended to perform single/multiline code completion from a
 context window of up to 4096k tokens. It is *not* an instruction model
 and commands like \"Write a function that computes the absolute value of
 an integer,\" won't yield the desired results. A more effective approach
 ### Attribution
+DeciCoder-6B was trained on StarCoder Training Dataset, filtered for
+Python, Java, JavaScript, Ruby, RUST, C++, C, and C#. For additional information, please
 refer to [https://huggingface.co/datasets/bigcode/starcoderdata](https://huggingface.co/datasets/bigcode/starcoderdata).
 ```
 ### Limitations
 The model has undergone training with source code from Python, Java,
+JavaScript, Ruby, RUST, C++, C, and C#. While the primary language in the source is English, it does
 contain other languages. Therefore, the model can produce code snippets
+given some context. However, there is no assurance that the resulting
 code will function as expected. It might be suboptimal, contain bugs, or
 even exploits.
 ## Evaluation
+Below are DeciCoder-6B's pass@1 on MultiPL HumanEval scores
+| Python | JavaScript | Java  | C++  | C#  | Rust  | Go  |
+|:----------|:----------|:----------|:----------|:----------|:----------|:----------|
+| 33.3%    | 29.3%    | 30.3%    |29.93%    |20.31%    |20.5%    |77.47%    |
 ### Runtime Benchmarks
+|Inference Tool | Hardware | Prompt Length | Generation Length | Throughput (tokens/sec) |
+|:----------|:----------|:----------|:----------|:----------|
+| Qualcomm SDK | Qualcomm AI 100 | 1024 | 1024 | 531.3 |
+- Measured for maximal batch size on the device
 ## How to Cite
 ```bibtex
 @misc{DeciFoundationModels,
+title = {DeciCoder-6B},
 author = {DeciAI Research Team},
 year = {2023}
+url={[https://huggingface.co/deci/decicoder-6B](https://huggingface.co/deci/decicoder-6B)},
 }
+```