Spaces:

ejschwartz
/

resym

Running on Zero

App Files Files Community

ejschwartz commited on Mar 22

Commit

8e536a8

1 Parent(s): a7c1b7f

Disable field model

Browse files

Files changed (1) hide show

app.py +21 -19

app.py CHANGED Viewed

@@ -20,6 +20,8 @@ This space simply performs inference on the two pretrained models available as
 part of the ReSym artifacts. It takes a variable name and some decompiled code
 as input, and outputs the variable type and other information.
 ## Disclaimer
 I'm not a ReSym developer and I may have messed something up.  In particular,
@@ -39,9 +41,9 @@ tokenizer = AutoTokenizer.from_pretrained("bigcode/starcoderbase-3b")
 vardecoder_model = AutoModelForCausalLM.from_pretrained(
     "ejschwartz/resym-vardecoder", torch_dtype=torch.bfloat16, device_map="auto"
 )
-fielddecoder_model = AutoModelForCausalLM.from_pretrained(
-    "ejschwartz/resym-fielddecoder", torch_dtype=torch.bfloat16, device_map="auto"
-)
 example = r"""__int64 __fastcall sub_410D81(__int64 a1, __int64 a2, __int64 a3)
 {
@@ -103,24 +105,24 @@ def infer(code):
         skip_special_tokens=True,
         clean_up_tokenization_spaces=True,
     )
-    field_output = fielddecoder_model.generate(
-        input_ids=input_ids,
-        max_new_tokens=1024,
-        num_beams=4,
-        num_return_sequences=1,
-        do_sample=False,
-        early_stopping=False,
-        pad_token_id=0,
-        eos_token_id=0,
-    )[0]
-    field_output = tokenizer.decode(
-        field_output[input_ids.size(1) :],
-        skip_special_tokens=True,
-        clean_up_tokenization_spaces=True,
-    )
     var_output = var_name + ":" + var_output
-    field_output = var_name + ":" + field_output
     return var_output, varstring

 part of the ReSym artifacts. It takes a variable name and some decompiled code
 as input, and outputs the variable type and other information.
+The examples are randomly selected from `vardecoder_test.jsonl`.
 ## Disclaimer
 I'm not a ReSym developer and I may have messed something up.  In particular,
 vardecoder_model = AutoModelForCausalLM.from_pretrained(
     "ejschwartz/resym-vardecoder", torch_dtype=torch.bfloat16, device_map="auto"
 )
+# fielddecoder_model = AutoModelForCausalLM.from_pretrained(
+#     "ejschwartz/resym-fielddecoder", torch_dtype=torch.bfloat16, device_map="auto"
+# )
 example = r"""__int64 __fastcall sub_410D81(__int64 a1, __int64 a2, __int64 a3)
 {
         skip_special_tokens=True,
         clean_up_tokenization_spaces=True,
     )
+    # field_output = fielddecoder_model.generate(
+    #     input_ids=input_ids,
+    #     max_new_tokens=1024,
+    #     num_beams=4,
+    #     num_return_sequences=1,
+    #     do_sample=False,
+    #     early_stopping=False,
+    #     pad_token_id=0,
+    #     eos_token_id=0,
+    # )[0]
+    # field_output = tokenizer.decode(
+    #     field_output[input_ids.size(1) :],
+    #     skip_special_tokens=True,
+    #     clean_up_tokenization_spaces=True,
+    # )
     var_output = var_name + ":" + var_output
+    #field_output = var_name + ":" + field_output
     return var_output, varstring