Update README.md
Browse files
README.md
CHANGED
@@ -279,3 +279,12 @@ Here are guides on using llama-cpp-python and ctransformers with LangChain:
|
|
279 |
|
280 |
<!-- footer start -->
|
281 |
<!-- 200823 -->
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
279 |
|
280 |
<!-- footer start -->
|
281 |
<!-- 200823 -->
|
282 |
+
|
283 |
+
## Thanks, and how to contribute
|
284 |
+
|
285 |
+
|
286 |
+
# Original model card: Infosys's NT-Java-1.1B
|
287 |
+
|
288 |
+
# **NT-Java**
|
289 |
+
|
290 |
+
The Narrow Transformer (NT) model NT-Java-1.1B is an open-source specialized code model built by extending pre-training on StarCoderBase-1B, designed for coding tasks in Java programming. The model is a decoder-only transformer with Multi-Query Attention and with a context length of 8192 tokens. The model was trained with Java subset of the StarCoderData dataset, which is ~22B tokens.
|