Comments (6)
Modify the above file
return self.decodeBytes(tokens).decode('utf-8','ignore')
lost character λ
@cgisky1980 You can also try the 'replace' mode for errors here, which had better chances for producing all characters properly when using the world models of rwkv.
from rwkv.cpp.
by model rwkv world fp16 is correct
Do you mean that the same code works correctly when using FP16 model?
from rwkv.cpp.
by model rwkv world fp16 is correct
Do you mean that the same code works correctly when using FP16 model?
yes
I mean, models without quantization don't have this problem
from rwkv.cpp.
Then I think this issue actually points at 2 separate problems:
- quantized model produces less correct text that non-quantized model
UnicodeDecodeError
error is thrown
For 1, there is no real solution. Quantization reduces quality, it is expected, since information is cut from the model to make it smaller.
2 is an actual bug that can be fixed. I'll put it into my backlog, but anyone can take it.
from rwkv.cpp.
Then I think this issue actually points at 2 separate problems:
- quantized model produces less correct text that non-quantized model
UnicodeDecodeError
error is thrownFor 1, there is no real solution. Quantization reduces quality, it is expected, since information is cut from the model to make it smaller.
2 is an actual bug that can be fixed. I'll put it into my backlog, but anyone can take it.
Because this issue is 100% reproducible in models 3B and 7B, I don't think it's a problem of accuracy loss
ps. https:github.com//issues/19 this is the good first issue. LOL
from rwkv.cpp.
Modify the above file
return self.decodeBytes(tokens).decode('utf-8','ignore')
lost character λ @cgisky1980 You can also try the 'replace' mode for errors here, which had better chances for producing all characters properly when using the world models of rwkv.
yesοΌit works
from rwkv.cpp.
Related Issues (20)
- Tutorial for python script? HOT 1
- Repetitive, meaningless tokens output HOT 3
- it seems does not support the newly RWKV-4-World-CHNtuned-3B-v1-20230625-ctx4096.bin HOT 2
- Can't build with ggml HOT 3
- Support hipBLAS
- Support Metal in apple macOS?
- [QUESTION] Implementing RNN/LSTM with ggml HOT 4
- crash on GGML_ASSERT: 'rwkv.cpp/ggml/src/ggml.c:5316: ggml_can_repeat_rows(b, a)' HOT 1
- Update new GGML for GGML_MAX_NODES limit? HOT 2
- Support RWKV v5 HOT 1
- llama-node is not working in the moment HOT 1
- Fix extras/CMakeList.txt file for static build
- Support build with cublas and hipblas on github action HOT 3
- CMake Error HOT 1
- hipblas cannot build using cmake on windows with rocm5.7.1 HOT 2
- The linked Huging Face page in the README doesn't have any .bin files HOT 1
- Add mac ARM build as part of the build process
- Replace all assertions in Python code with if statements HOT 7
- Support RWKV v6
- Colab notebook to start faster?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. πππ
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google β€οΈ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from rwkv.cpp.