bcadkins01 commited on
Commit
66d2bac
·
verified ·
1 Parent(s): fe9591f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -17
README.md CHANGED
@@ -9,35 +9,35 @@ This application demonstrates a drug discovery pipeline that allows users to:
9
  * Predict select ADMET properties for the generated molecules using ADMET-AI.
10
 
11
  ## Features
12
- * Molecule Generation:
13
  * Generates up to 3 beta-lactam molecules at a time.
14
  * Users can adjust the creativity (temperature) of the generation process. Higher number leads to more diverse output.
15
  * Generated molecules are named 'Mol01' to 'Mol03'.
16
- * Molecule Viewing:
17
  * Displays molecule structures using Streamlit.
18
  * Option to view molecules as SMILES or SAFE encodings.
19
- * ADMET Property Prediction:
20
  * Integrates ADMET-AI to predict select properties.
21
  * Displays predicted properties of each molecule.
22
 
23
  ## How to Use the App
24
  1. Set Generation Parameters:
25
- * Use the sidebar to adjust the creativity (temperature) slider.
26
- * Select the number of molecules to generate (maximum of 3).
27
  2. Generate Molecules:
28
- * Click the 'Generate Molecules' button.
29
- * Generated molecules will appear with their structures, strings, and predicted ADMET properties.
30
 
31
 
32
 
33
  ## Technical Details
34
- * Generative Model: Uses the model: 'seyonec/PubChem10M_SMILES_BPE_450k' fine-tuned on beta-lactam structures.
35
- * ADMET Predictions: Uses the ADMET-AI library to predict molecular properties.
36
- * Visualization: Employs RDKit and SAFE encoding for molecule rendering.
37
- * Frameworks and Libraries:
38
- * Streamlit for the web interface.
39
- * Transformers library for model loading and generation.
40
- * RDKit for cheminformatics.
41
 
42
 
43
 
@@ -56,16 +56,16 @@ Zdrazil B, Felix E, Hunter F, et al. The ChEMBL Database in 2023: a drug discove
56
  . doi:10.1093/nar/gkad1004
57
  https://www.ebi.ac.uk/chembl/
58
 
59
- ### SAFE Encoding
60
  This project uses the SAFE Encoding framework, licensed under the Apache License 2.0.
61
  Noutahi E, Gabellini C, Craig M, Lim JS, Tossou P. Gotta be SAFE: A New Framework for Molecular Design. arXiv preprint arXiv:2310.10773, 2023.
62
  https://github.com/datamol-io/safe
63
 
64
- ### This project utilizes the ADMET-AI platform for predicting ADMET properties. ADMET-AI is licensed under the MIT License.
65
  Swanson K, Walther P, Leitz J, et al. ADMET-AI: A machine learning ADMET platform for evaluation of large-scale chemical libraries. bioRxiv. 2023. doi:10.1101/2023.12.28.573531
66
  https://admet.ai.greenstonebio.com/
67
 
68
- ### This project uses RDKit: Open-source cheminformatics software.
69
  RDKit: Open-source cheminformatics. https://www.rdkit.org
70
 
71
 
 
9
  * Predict select ADMET properties for the generated molecules using ADMET-AI.
10
 
11
  ## Features
12
+ * **Molecule Generation**:
13
  * Generates up to 3 beta-lactam molecules at a time.
14
  * Users can adjust the creativity (temperature) of the generation process. Higher number leads to more diverse output.
15
  * Generated molecules are named 'Mol01' to 'Mol03'.
16
+ * **Molecule Viewing**:
17
  * Displays molecule structures using Streamlit.
18
  * Option to view molecules as SMILES or SAFE encodings.
19
+ * **ADMET Property Prediction**:
20
  * Integrates ADMET-AI to predict select properties.
21
  * Displays predicted properties of each molecule.
22
 
23
  ## How to Use the App
24
  1. Set Generation Parameters:
25
+ * Use the sidebar to adjust the creativity (temperature) slider.
26
+ * Select the number of molecules to generate (maximum of 3).
27
  2. Generate Molecules:
28
+ * Click the 'Generate Molecules' button.
29
+ * Generated molecules will appear with their structures, strings, and predicted ADMET properties.
30
 
31
 
32
 
33
  ## Technical Details
34
+ * **Generative Model**: Uses the model: 'seyonec/PubChem10M_SMILES_BPE_450k' fine-tuned on beta-lactam structures.
35
+ * **ADMET Predictions**: Uses the ADMET-AI library to predict molecular properties.
36
+ * **Visualization**: Employs RDKit and SAFE encoding for molecule rendering.
37
+ * **Frameworks and Libraries**:
38
+ * **Streamlit** for the web interface.
39
+ * **Transformers** library for model loading and generation.
40
+ * **RDKit** for cheminformatics.
41
 
42
 
43
 
 
56
  . doi:10.1093/nar/gkad1004
57
  https://www.ebi.ac.uk/chembl/
58
 
59
+ ### SAFE Encoding:
60
  This project uses the SAFE Encoding framework, licensed under the Apache License 2.0.
61
  Noutahi E, Gabellini C, Craig M, Lim JS, Tossou P. Gotta be SAFE: A New Framework for Molecular Design. arXiv preprint arXiv:2310.10773, 2023.
62
  https://github.com/datamol-io/safe
63
 
64
+ ### This project utilizes the ADMET-AI platform for predicting ADMET properties:
65
  Swanson K, Walther P, Leitz J, et al. ADMET-AI: A machine learning ADMET platform for evaluation of large-scale chemical libraries. bioRxiv. 2023. doi:10.1101/2023.12.28.573531
66
  https://admet.ai.greenstonebio.com/
67
 
68
+ ### This project uses RDKit:
69
  RDKit: Open-source cheminformatics. https://www.rdkit.org
70
 
71