Spaces:
Sleeping
Sleeping
Update README.md
Browse files
README.md
CHANGED
@@ -9,35 +9,35 @@ This application demonstrates a drug discovery pipeline that allows users to:
|
|
9 |
* Predict select ADMET properties for the generated molecules using ADMET-AI.
|
10 |
|
11 |
## Features
|
12 |
-
* Molecule Generation
|
13 |
* Generates up to 3 beta-lactam molecules at a time.
|
14 |
* Users can adjust the creativity (temperature) of the generation process. Higher number leads to more diverse output.
|
15 |
* Generated molecules are named 'Mol01' to 'Mol03'.
|
16 |
-
* Molecule Viewing
|
17 |
* Displays molecule structures using Streamlit.
|
18 |
* Option to view molecules as SMILES or SAFE encodings.
|
19 |
-
* ADMET Property Prediction
|
20 |
* Integrates ADMET-AI to predict select properties.
|
21 |
* Displays predicted properties of each molecule.
|
22 |
|
23 |
## How to Use the App
|
24 |
1. Set Generation Parameters:
|
25 |
-
|
26 |
-
|
27 |
2. Generate Molecules:
|
28 |
-
|
29 |
-
|
30 |
|
31 |
|
32 |
|
33 |
## Technical Details
|
34 |
-
* Generative Model
|
35 |
-
* ADMET Predictions
|
36 |
-
* Visualization
|
37 |
-
* Frameworks and Libraries
|
38 |
-
* Streamlit for the web interface.
|
39 |
-
* Transformers library for model loading and generation.
|
40 |
-
* RDKit for cheminformatics.
|
41 |
|
42 |
|
43 |
|
@@ -56,16 +56,16 @@ Zdrazil B, Felix E, Hunter F, et al. The ChEMBL Database in 2023: a drug discove
|
|
56 |
. doi:10.1093/nar/gkad1004
|
57 |
https://www.ebi.ac.uk/chembl/
|
58 |
|
59 |
-
### SAFE Encoding
|
60 |
This project uses the SAFE Encoding framework, licensed under the Apache License 2.0.
|
61 |
Noutahi E, Gabellini C, Craig M, Lim JS, Tossou P. Gotta be SAFE: A New Framework for Molecular Design. arXiv preprint arXiv:2310.10773, 2023.
|
62 |
https://github.com/datamol-io/safe
|
63 |
|
64 |
-
### This project utilizes the ADMET-AI platform for predicting ADMET properties
|
65 |
Swanson K, Walther P, Leitz J, et al. ADMET-AI: A machine learning ADMET platform for evaluation of large-scale chemical libraries. bioRxiv. 2023. doi:10.1101/2023.12.28.573531
|
66 |
https://admet.ai.greenstonebio.com/
|
67 |
|
68 |
-
### This project uses RDKit:
|
69 |
RDKit: Open-source cheminformatics. https://www.rdkit.org
|
70 |
|
71 |
|
|
|
9 |
* Predict select ADMET properties for the generated molecules using ADMET-AI.
|
10 |
|
11 |
## Features
|
12 |
+
* **Molecule Generation**:
|
13 |
* Generates up to 3 beta-lactam molecules at a time.
|
14 |
* Users can adjust the creativity (temperature) of the generation process. Higher number leads to more diverse output.
|
15 |
* Generated molecules are named 'Mol01' to 'Mol03'.
|
16 |
+
* **Molecule Viewing**:
|
17 |
* Displays molecule structures using Streamlit.
|
18 |
* Option to view molecules as SMILES or SAFE encodings.
|
19 |
+
* **ADMET Property Prediction**:
|
20 |
* Integrates ADMET-AI to predict select properties.
|
21 |
* Displays predicted properties of each molecule.
|
22 |
|
23 |
## How to Use the App
|
24 |
1. Set Generation Parameters:
|
25 |
+
* Use the sidebar to adjust the creativity (temperature) slider.
|
26 |
+
* Select the number of molecules to generate (maximum of 3).
|
27 |
2. Generate Molecules:
|
28 |
+
* Click the 'Generate Molecules' button.
|
29 |
+
* Generated molecules will appear with their structures, strings, and predicted ADMET properties.
|
30 |
|
31 |
|
32 |
|
33 |
## Technical Details
|
34 |
+
* **Generative Model**: Uses the model: 'seyonec/PubChem10M_SMILES_BPE_450k' fine-tuned on beta-lactam structures.
|
35 |
+
* **ADMET Predictions**: Uses the ADMET-AI library to predict molecular properties.
|
36 |
+
* **Visualization**: Employs RDKit and SAFE encoding for molecule rendering.
|
37 |
+
* **Frameworks and Libraries**:
|
38 |
+
* **Streamlit** for the web interface.
|
39 |
+
* **Transformers** library for model loading and generation.
|
40 |
+
* **RDKit** for cheminformatics.
|
41 |
|
42 |
|
43 |
|
|
|
56 |
. doi:10.1093/nar/gkad1004
|
57 |
https://www.ebi.ac.uk/chembl/
|
58 |
|
59 |
+
### SAFE Encoding:
|
60 |
This project uses the SAFE Encoding framework, licensed under the Apache License 2.0.
|
61 |
Noutahi E, Gabellini C, Craig M, Lim JS, Tossou P. Gotta be SAFE: A New Framework for Molecular Design. arXiv preprint arXiv:2310.10773, 2023.
|
62 |
https://github.com/datamol-io/safe
|
63 |
|
64 |
+
### This project utilizes the ADMET-AI platform for predicting ADMET properties:
|
65 |
Swanson K, Walther P, Leitz J, et al. ADMET-AI: A machine learning ADMET platform for evaluation of large-scale chemical libraries. bioRxiv. 2023. doi:10.1101/2023.12.28.573531
|
66 |
https://admet.ai.greenstonebio.com/
|
67 |
|
68 |
+
### This project uses RDKit:
|
69 |
RDKit: Open-source cheminformatics. https://www.rdkit.org
|
70 |
|
71 |
|