mgyigit commited on
Commit
540e177
·
verified ·
1 Parent(s): 21ae81c

Update app.py

Browse files
Files changed (1) hide show
  1. app.py +13 -9
app.py CHANGED
@@ -225,13 +225,17 @@ with gr.Blocks(theme=gr.themes.Ocean()) as demo:
225
  ## Model Variations
226
 
227
  ### DrugGEN-AKT1
228
- This model is designed to generate molecules targeting the human AKT1 protein (UniProt ID: P31749).
 
229
 
230
  ### DrugGEN-CDK2
231
- This model is designed to generate molecules targeting the human CDK2 protein (UniProt ID: P24941).
 
232
 
233
  ### DrugGEN-NoTarget
234
- This is a general-purpose model that generates diverse drug-like molecules without targeting a specific protein.
 
 
235
  - Useful for exploring chemical space, generating diverse scaffolds, and creating molecules with drug-like properties.
236
 
237
  For more details, see our [paper on arXiv](https://arxiv.org/abs/2302.07868).
@@ -247,13 +251,13 @@ with gr.Blocks(theme=gr.themes.Ocean()) as demo:
247
  - **Runtime**: Time taken to generate or evaluate the molecules
248
 
249
  ### Novelty Metrics
250
- - **Novelty (Train)**: Percentage of molecules not found in the training set
251
- - **Novelty (Inference)**: Percentage of molecules not found in the test set
252
  - **Novelty (Real Inhibitors)**: Percentage of molecules not found in known inhibitors of the target protein
253
 
254
  ### Structural Metrics
255
- - **Average Length**: Average component length in the generated molecules
256
- - **Mean Atom Type**: Average distribution of atom types
257
  - **Internal Diversity**: Diversity within the generated set (higher is more diverse)
258
 
259
  ### Drug-likeness Metrics
@@ -261,8 +265,8 @@ with gr.Blocks(theme=gr.themes.Ocean()) as demo:
261
  - **SA Score (Synthetic Accessibility)**: Score from 1-10 indicating ease of synthesis (lower is better)
262
 
263
  ### Similarity Metrics
264
- - **SNN ChEMBL**: Similarity to ChEMBL molecules (higher means more similar to known drug-like compounds)
265
- - **SNN Real Inhibitors**: Similarity to known drugs (higher means more similar to approved drugs)
266
  """)
267
 
268
  model_name = gr.Radio(
 
225
  ## Model Variations
226
 
227
  ### DrugGEN-AKT1
228
+ This model is designed to generate molecules targeting the human AKT1 protein (UniProt ID: P31749). Trained with [2,607 bioactive compounds](https://drive.google.com/file/d/1B2OOim5wrUJalixeBTDKXLHY8BAIvNh-/view?usp=drive_link).
229
+ Molecules larger than 45 heavy atoms were excluded.
230
 
231
  ### DrugGEN-CDK2
232
+ This model is designed to generate molecules targeting the human CDK2 protein (UniProt ID: P24941). Trained with [1,817 bioactive compounds](https://drive.google.com/file/d/1C0CGFKx0I2gdSfbIEgUO7q3K2S1P9ksT/view?usp=drive_link)/
233
+ Molecules larger than 38 heavy atoms were excluded.
234
 
235
  ### DrugGEN-NoTarget
236
+ This is a general-purpose model that generates diverse drug-like molecules without targeting a specific protein. Trained with a general [ChEMBL dataset]((https://drive.google.com/file/d/1oyybQ4oXpzrme_n0kbwc0-CFjvTFSlBG/view?usp=drive_link)
237
+ Molecules larger than 45 heavy atoms were excluded.
238
+
239
  - Useful for exploring chemical space, generating diverse scaffolds, and creating molecules with drug-like properties.
240
 
241
  For more details, see our [paper on arXiv](https://arxiv.org/abs/2302.07868).
 
251
  - **Runtime**: Time taken to generate or evaluate the molecules
252
 
253
  ### Novelty Metrics
254
+ - **Novelty (Train)**: Percentage of molecules not found in the [training set](https://drive.google.com/file/d/1oyybQ4oXpzrme_n0kbwc0-CFjvTFSlBG/view?usp=drive_link)
255
+ - **Novelty (Inference)**: Percentage of molecules not found in the [test set](https://drive.google.com/file/d/1vMGXqK1SQXB3Od3l80gMWvTEOjJ5MFXP/view?usp=share_link)
256
  - **Novelty (Real Inhibitors)**: Percentage of molecules not found in known inhibitors of the target protein
257
 
258
  ### Structural Metrics
259
+ - **Average Length**: Normalized average number of atoms in the generated molecules, normalized by the maximum atom count (e.g., 45 for AKT1/NoTarget, 38 for CDK2)
260
+ - **Mean Atom Type**: Average number of distinct atom types in the generated molecules
261
  - **Internal Diversity**: Diversity within the generated set (higher is more diverse)
262
 
263
  ### Drug-likeness Metrics
 
265
  - **SA Score (Synthetic Accessibility)**: Score from 1-10 indicating ease of synthesis (lower is better)
266
 
267
  ### Similarity Metrics
268
+ - **SNN ChEMBL**: Similarity to [ChEMBL molecules](https://drive.google.com/file/d/1oyybQ4oXpzrme_n0kbwc0-CFjvTFSlBG/view?usp=drive_link) (higher means more similar to known drug-like compounds)
269
+ - **SNN Real Inhibitors**: Similarity to the real inhibitors of the selected target (higher means more similar to the real inhibitors)
270
  """)
271
 
272
  model_name = gr.Radio(