Comments (3)
I've encountered a similar issue. I believe the recently released flash-attention version 2 significantly improves the execution time for DNA-BERT2. By the way, I greatly appreciate your work; it has been very helpful to me.
from dnabert_2.
Hey,
Thank you very much for your interest in our work and for this great suggestion!
However, I can't do it right away since I have been quite busy recently. But you are very welcome to submit PRs if you find a good way to do it. I will work on it after a few weeks.
from dnabert_2.
I think this might be harder than one would expect, since the current triton implementation (only one that allows for AliBi) is pretty well known to be broken, unless you have a specific dev version of triton. There's a lot of discussion on the flash repo pertaining to this. If someone really wants to give it a go, this might be promising to use instead.
But currently, implementation is a bit more involved than just switching out a few lines on DNABERT_2.
from dnabert_2.
Related Issues (20)
- While Doing Quick Start, I Encountered an AssertionError: HOT 2
- When will the pretraining code be available?
- .
- environment about torch version HOT 1
- problem stll in environment HOT 6
- hidden_states = model(inputs)[0] # [1, sequence_length, 768]-- Is the second dimension really the sequence length? HOT 1
- Discuss a question about k-mer
- When will the code for pre-training model and training BPE tokenizer be available?
- Quickstart Does not work and Embedding Dim is not 768
- Pretraining, Pretraining, Pretraining!!! HOT 2
- I always encounter this error during the fine-tuning evaluation phase HOT 1
- Fine-tune for continuous labels HOT 2
- How do I output the attention from the model? HOT 1
- Special token treatment.
- splice site predictions
- Unable to Retrieve ' hidden_states ' Despite ' Setting return_dict=True ' and ' output_hidden_states=True ' HOT 3
- Cannot Reproduce DNA-BERT2‘s Result HOT 1
- GUE+ datasets?
- Is it neccessary to train a specific BPE tokenizer on own datasets? HOT 1
- Getting embedding of a sequence
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from dnabert_2.