Spaces:

IronOne-AI-Labs
/

Annual_Report_Summarization_Demo

Sleeping

Lahiru Menikdiwela commited on Nov 26, 2024

Commit

3ef330a

1 Parent(s): d004546

remove quantization due to lack of cuda

Files changed (1) hide show

model.py CHANGED Viewed

@@ -26,7 +26,7 @@ def get_local_model(model_name_or_path:str)->pipeline:
     model = AutoModelForCausalLM.from_pretrained(
         model_name_or_path,
         torch_dtype=torch.bfloat16,
-        load_in_4bit = True,
         token = hf_token
     )
     pipe = pipeline(

     model = AutoModelForCausalLM.from_pretrained(
         model_name_or_path,
         torch_dtype=torch.bfloat16,
+        # load_in_4bit = True,
         token = hf_token
     )
     pipe = pipeline(