Spaces:

AstroMLab
/

AstroSage-8B

Runtime error

Tijmen2 commited on Nov 20, 2024

Commit

7e9cbb4

verified ·

1 Parent(s): 839a5ef

Update app.py

Files changed (1) hide show

app.py CHANGED Viewed

@@ -14,11 +14,7 @@ llm = Llama(
     n_ctx=2048,
     chat_format="llama-3",
     n_gpu_layers=-1,  # ensure all layers are on GPU
-    n_threads=1, # no CPU multi-threading
-    offload_kqv=True, # store kqv on GPU
-    vocab_only=False,
-    use_mmap=True,
-    use_mlock=False,
 )
 # Placeholder responses for when context is empty

     n_ctx=2048,
     chat_format="llama-3",
     n_gpu_layers=-1,  # ensure all layers are on GPU
+    split_mode="LLAMA_SPLIT_MODE_NONE",
 )
 # Placeholder responses for when context is empty