Spaces:
Running
Running
Update app.py
Browse files
app.py
CHANGED
@@ -225,13 +225,17 @@ with gr.Blocks(theme=gr.themes.Ocean()) as demo:
|
|
225 |
## Model Variations
|
226 |
|
227 |
### DrugGEN-AKT1
|
228 |
-
This model is designed to generate molecules targeting the human AKT1 protein (UniProt ID: P31749).
|
|
|
229 |
|
230 |
### DrugGEN-CDK2
|
231 |
-
This model is designed to generate molecules targeting the human CDK2 protein (UniProt ID: P24941).
|
|
|
232 |
|
233 |
### DrugGEN-NoTarget
|
234 |
-
This is a general-purpose model that generates diverse drug-like molecules without targeting a specific protein.
|
|
|
|
|
235 |
- Useful for exploring chemical space, generating diverse scaffolds, and creating molecules with drug-like properties.
|
236 |
|
237 |
For more details, see our [paper on arXiv](https://arxiv.org/abs/2302.07868).
|
@@ -247,13 +251,13 @@ with gr.Blocks(theme=gr.themes.Ocean()) as demo:
|
|
247 |
- **Runtime**: Time taken to generate or evaluate the molecules
|
248 |
|
249 |
### Novelty Metrics
|
250 |
-
- **Novelty (Train)**: Percentage of molecules not found in the training set
|
251 |
-
- **Novelty (Inference)**: Percentage of molecules not found in the test set
|
252 |
- **Novelty (Real Inhibitors)**: Percentage of molecules not found in known inhibitors of the target protein
|
253 |
|
254 |
### Structural Metrics
|
255 |
-
- **Average Length**:
|
256 |
-
- **Mean Atom Type**: Average
|
257 |
- **Internal Diversity**: Diversity within the generated set (higher is more diverse)
|
258 |
|
259 |
### Drug-likeness Metrics
|
@@ -261,8 +265,8 @@ with gr.Blocks(theme=gr.themes.Ocean()) as demo:
|
|
261 |
- **SA Score (Synthetic Accessibility)**: Score from 1-10 indicating ease of synthesis (lower is better)
|
262 |
|
263 |
### Similarity Metrics
|
264 |
-
- **SNN ChEMBL**: Similarity to ChEMBL molecules (higher means more similar to known drug-like compounds)
|
265 |
-
- **SNN Real Inhibitors**: Similarity to
|
266 |
""")
|
267 |
|
268 |
model_name = gr.Radio(
|
|
|
225 |
## Model Variations
|
226 |
|
227 |
### DrugGEN-AKT1
|
228 |
+
This model is designed to generate molecules targeting the human AKT1 protein (UniProt ID: P31749). Trained with [2,607 bioactive compounds](https://drive.google.com/file/d/1B2OOim5wrUJalixeBTDKXLHY8BAIvNh-/view?usp=drive_link).
|
229 |
+
Molecules larger than 45 heavy atoms were excluded.
|
230 |
|
231 |
### DrugGEN-CDK2
|
232 |
+
This model is designed to generate molecules targeting the human CDK2 protein (UniProt ID: P24941). Trained with [1,817 bioactive compounds](https://drive.google.com/file/d/1C0CGFKx0I2gdSfbIEgUO7q3K2S1P9ksT/view?usp=drive_link)/
|
233 |
+
Molecules larger than 38 heavy atoms were excluded.
|
234 |
|
235 |
### DrugGEN-NoTarget
|
236 |
+
This is a general-purpose model that generates diverse drug-like molecules without targeting a specific protein. Trained with a general [ChEMBL dataset]((https://drive.google.com/file/d/1oyybQ4oXpzrme_n0kbwc0-CFjvTFSlBG/view?usp=drive_link)
|
237 |
+
Molecules larger than 45 heavy atoms were excluded.
|
238 |
+
|
239 |
- Useful for exploring chemical space, generating diverse scaffolds, and creating molecules with drug-like properties.
|
240 |
|
241 |
For more details, see our [paper on arXiv](https://arxiv.org/abs/2302.07868).
|
|
|
251 |
- **Runtime**: Time taken to generate or evaluate the molecules
|
252 |
|
253 |
### Novelty Metrics
|
254 |
+
- **Novelty (Train)**: Percentage of molecules not found in the [training set](https://drive.google.com/file/d/1oyybQ4oXpzrme_n0kbwc0-CFjvTFSlBG/view?usp=drive_link)
|
255 |
+
- **Novelty (Inference)**: Percentage of molecules not found in the [test set](https://drive.google.com/file/d/1vMGXqK1SQXB3Od3l80gMWvTEOjJ5MFXP/view?usp=share_link)
|
256 |
- **Novelty (Real Inhibitors)**: Percentage of molecules not found in known inhibitors of the target protein
|
257 |
|
258 |
### Structural Metrics
|
259 |
+
- **Average Length**: Normalized average number of atoms in the generated molecules, normalized by the maximum atom count (e.g., 45 for AKT1/NoTarget, 38 for CDK2)
|
260 |
+
- **Mean Atom Type**: Average number of distinct atom types in the generated molecules
|
261 |
- **Internal Diversity**: Diversity within the generated set (higher is more diverse)
|
262 |
|
263 |
### Drug-likeness Metrics
|
|
|
265 |
- **SA Score (Synthetic Accessibility)**: Score from 1-10 indicating ease of synthesis (lower is better)
|
266 |
|
267 |
### Similarity Metrics
|
268 |
+
- **SNN ChEMBL**: Similarity to [ChEMBL molecules](https://drive.google.com/file/d/1oyybQ4oXpzrme_n0kbwc0-CFjvTFSlBG/view?usp=drive_link) (higher means more similar to known drug-like compounds)
|
269 |
+
- **SNN Real Inhibitors**: Similarity to the real inhibitors of the selected target (higher means more similar to the real inhibitors)
|
270 |
""")
|
271 |
|
272 |
model_name = gr.Radio(
|