Comments (1)
Thanks for logging this issue.
Unfortunately if a session has items queued for inference and attemts to load/save are called this could cause and exception as the session is busy.
But this leaves you stuck with no way of knowing when to save, which was an oversight on my part.
Saving a queued session is a bit tricky and I have implemented a fix, however it may not be the final implementation but will allow you to save per message as you are, it adds a SaveOnComplete
flag to the 'QueueInferTextAsync' call, setting this to true on an item will save the ModelSDessionState when the queue processes that item, you can provide this on every message or every x up to you
I also expsosed a property InferQueueCount
on the ModelSessionService
so you can access the queue count if required
from llamastack.
Related Issues (7)
- not compatible with llamasharp 0.7.0 HOT 1
- [feature request]llamastack.webapi implement openai style webapi HOT 2
- IInferenceParams does not contain a definition for MinP HOT 2
- StreamingTokenDecoder not found | Demo not working
- [WPF] Method not found: 'Void LLama.Common.ModelParams.set_ContextSize(UInt32)'. HOT 1
- Odd behaviour HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from llamastack.