ZIISA2

Runtime error

App Files Files Community

akashmishra358 commited on Feb 6

Commit

b8298df

verified ·

1 Parent(s): 9f904c9

Update model.py

Browse files

model_url = configs["model"]["generation_model"] # Fixed "genration_model" -> "generation_model"

# 3. Add authentication to model loading
self.model = AutoModelForCausalLM.from_pretrained(
model_url,
token=self.hf_token, # Added authentication
torch_dtype=torch.float16,
low_cpu_mem_usage=True,
attn_implementation="sdpa",
device_map="auto" # Better device handling

Low CPU memory changed from false to true

Files changed (1) hide show

model.py +1 -1

model.py CHANGED Viewed

@@ -33,7 +33,7 @@ class RAGModel:
             model_url,
             token=self.hf_token,  # Added authentication
             torch_dtype=torch.float16,
-            low_cpu_mem_usage=False,
             attn_implementation="sdpa",
             device_map="auto"  # Better device handling
         )

             model_url,
             token=self.hf_token,  # Added authentication
             torch_dtype=torch.float16,
+            low_cpu_mem_usage=True,
             attn_implementation="sdpa",
             device_map="auto"  # Better device handling
         )