Comments (2)
Hi, thank you for being interested in our work!
The Metal Ion Binding dataset with valid set of 1066 and test set of 1083 samples is an early version, which is not clustered by sequence identity. The lmdb file uploaded is the final data on which we test all baselines. We have updated our paper here. In the latest preprint, we revised the size of datasets to match the real situations.
Hope this could resolve your problem!
from saprot.
Thank you for your reply!
from saprot.
Related Issues (20)
- Wrong link to hugging face model HOT 1
- Getting protein embeddings HOT 5
- Additional input values generated by the tokenizer HOT 5
- Unable to Open Downstream Task MDB Files in Access After Downloading from GitHub Tutorial HOT 2
- Release code about Contact Head prediction and evaluation HOT 2
- Setting my configuration to for SaProt_650M_AF2 HOT 1
- Downstream Annotation task 'EC/GO' finetuning overfitting HOT 3
- The meaning of output size HOT 2
- Why is the length of the SA-token sequence not equal to the length of the model outputs? HOT 1
- EC GO results HOT 3
- pretraining esm2_t33_650M_UR50D HOT 3
- Foldseek can't be used HOT 2
- Finetuning GPU memory cost HOT 2
- Mismatch between ESM2 pretraining dataset and SaProt pretraining dataset HOT 1
- per residue representations HOT 2
- Problem of 'Some weights of EsmForMaskedLM were not initialized from the model checkpoint at ./SaProt_model and are newly initialized' HOT 1
- 'utf-8' codec can't decode byte 0x80 in position 64: invalid start byte HOT 1
- How to use my own dataset HOT 8
- Reproducibility issues - Different scores than the ones from ProteinGym HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from saprot.