Comments (5)
The older tokenizer file saved on tokenizer.json on the model hub imports fine.
diff tokenizer-works.json tokenizer-fails.json
71c71,72
< "add_prefix_space": true
---
> "prepend_scheme": "always",
> "split": true
157c158,159
< "add_prefix_space": true
---
> "prepend_scheme": "always",
> "split": true
1000171c1000173,1000174
< ]
---
> ],
> "byte_fallback": false
There is one new introduced parameter byte_fallback
in the file that fails to import, which could match up with
data did not match any variant of untagged enum PreTokenizerWrapper
from vespa.
Have they fixed it on HEAD, if so I guess 0.28 will be out soon.
from vespa.
Created deepjavalibrary/djl#3141
from vespa.
Workaround to patch the tokenizer.json file to remove the key vespa-engine/sample-apps#1421
from vespa.
Related Issues (20)
- Add matched-elements-only support for index fields
- Syntax support for configuring distance-metric within the field
- No 'input' query param when use "vespa query yql=xxx" HOT 4
- Support setting metrics-proxy heap size HOT 3
- All Search Nodes are crashing HOT 4
- Potential Memory Leak Issue in PrometheusModel class HOT 4
- Node coming up with latest deployed application version after being down HOT 5
- Configurable max token length HOT 1
- Add embedding instruction prompt support for to hf-embedder HOT 1
- NPE while deploying HOT 2
- Cluster Crashes When Distribution Key Is Too High HOT 2
- Error code - ORT_INVALID_PROTOBUF HOT 2
- Vespa content pod created snapshot folder owned by nobody user in a high RAM situation HOT 3
- Add deploy time warning about combining paged attributes with index (hnsw) or fast-search
- you probably have an older CPU than required HOT 7
- Unable to install Vespa version 8.334.22, dependencies not found HOT 2
- Add response degradation indicator for approximate nearest neighbor
- Support userQuery/userInput in the JSON query format
- Sort ASC by a field should give those results without that field value HOT 7
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from vespa.