Comments (2)
Hello @ankitskvmdam,
After some thought, I suggest the following:
- Let's use REINVENT in transfer learning mode, as you suggested. As a starting model, we can use the Mol2MolSimilarity.
- Let's use 3D shape search via CHEESE.
The process will be the following:
- A molecule A is passed as input.
- We do a 3D Shape Similarity search via the CHEESE API. We can use the Enamine REAL database as the reference library. From that search, we get the top N compounds (L).
- Of these 100 compounds, we use 80% as a training set for the transfer learning, 10% as a validation set, and 10% as a held out test set.
- We train (fine-tune) REINVENT with transfer learning using the training set, and controling its performance with the validation set.
- With the test set, we make sure that, indeed, the test compounds are similar in 3D shape to molecule A. This 3D shape comparison can be done with VSFlow, if that is easier.
I hope this makes sense?
Then, I have more ideas to complicate things further (for example, to search against other databases such as ZINC in CHEESE, to do multiple similarity searches, to penalize molecules that are similar in 2D (favouring scaffold hopping), etc.). But let's go step by step.
Please let me know if something is not clear, @ankitskvmdam !
from ersilia.
@miquelduranfrigola It makes sense. I will proceed with this.
from ersilia.
Related Issues (20)
- 🐅 Epic: Remove BentoML as a dependency in Ersilia and move to FastAPI
- 🐕 Batch: Ersilia Pack Development
- 🐕 Batch: New EOS Template
- 🐕 Batch: Make Ersilia CLI work with new Packing strategy
- 🐕 Batch: Design and document endpoints in the new app template within Ersilia Pack
- 🐈 Task: Use SPDX License identifiers in the Model Request issue template HOT 1
- 🐈 Task: Remove PyAirtable as a hard dependency HOT 1
- 🐈 Task: Fix action that uploads Ersilia base image to DockerHub
- 🐈 Task: Bump requests and docker-py versions in ersilia
- 🐕 Batch: Maintain additional credentials' requirements within models HOT 1
- 🐈 Task: Docker build for ersilia-pack HOT 2
- 🐛 Bug: log file not found warning after using the track flags
- 🐛 Bug: Fetching models on MacBook (M1) results in 404 error due to looking for linux/arm64 HOT 1
- 🐕 Batch: Define Model Installs through a YAML file instead of a Dockerfile HOT 2
- 🐕 Batch: Resource monitoring with different input scenarios and systems HOT 1
- 🐛 Bug: Numpy versions conflicts HOT 2
- 🐛 Bug: Tracking functionality does not work when a result CSV file is not specified
- 🐛 Bug: Performance Metrics Fail for String Output Model HOT 4
- 🐛 Bug: Ersilia close when model serving is interrupted HOT 2
- 🦠 Model Request: Cardiotoxicity Classifier
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ersilia.