Hey everyone, I've been using RoBERTa for the past year or so but have been looking in

Fine-tune DeBERTa v3 language model, worthwhile endeavour? about deberta HOT 5 OPEN

shensmobile commented on June 14, 2024

Fine-tune DeBERTa v3 language model, worthwhile endeavour?

from deberta.

Comments (5)

StephennFernandes commented on June 14, 2024

given that you have a significantly good amount of training data, i believe this could be a really good endevour as the DebERTa-v3 architecture and training procedure is insanely great. good h-param search and a nice continual pretraining should give great results. do let me know how it goes.

from deberta.

shensmobile commented on June 14, 2024

Would I use the deberta-v3-X-continue in rtd.sh or pretrain a model from scratch using my dataset?

from deberta.

StephennFernandes commented on June 14, 2024

do continual pretraining, i mean use the deberta-v3-X-continue. all medical domain LM are a result of continual pretraining

from deberta.

priamai commented on June 14, 2024

Hi all, I am in the exact same boat here. What is that rtd.sh is mentioned? I mean I know is a bash file but where is it ? Would be nice to see a python script that shows how the domain adaptation should be run and how to save the model.

from deberta.

fmobrj commented on June 14, 2024

do continual pretraining, i mean use the deberta-v3-X-continue. all medical domain LM are a result of continual pretraining

Hi, @StephennFernandes. How are you doing? Have you managed to sucessfully pretrain or continue pretraining a deberta V3 model in another language? Back when we were talking, my discriminator couldnt get better.

Best regards, Fabio.

from deberta.

Recommend Projects

Fine-tune DeBERTa v3 language model, worthwhile endeavour? about deberta HOT 5 OPEN

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent