Comments (7)
We're using Bitnami's helm chart and they didn't upgrade yet (server-side on 2.11.3
). Will report back once they do.
from mlflow.
Could you share the code used for logging the metrics?
from mlflow.
We're using NVIDIA's NeMo, specifically the MTEncDec model. The logging call is this:
from mlflow.
@mlflow/mlflow-team Please assign a maintainer and start triaging this issue.
from mlflow.
Hi, I am having the same issue with mlflow tracking server and client 2.11.3 (using pytorch lightning MLFlowLogger logger.log(..., on_step=True, on_epoch=True)
). The issue appeared after upgrading mlflow. More precisely, the values logged at each gradient step are shown properly but the values logged at each epoch (average train loss and validation loss) show as a single point.
It seems the logging call refered to in the link above is a pytorch lightning logger call, so the root cause is likely the same.
from mlflow.
Does the issue still persist if you upgrade the tracking server to 2.12.1? There was a bug fix to sampling logic that should be shipped in the latest version.
from mlflow.
This appears to be fixed with 2.12.1 which is now available. Thanks!
from mlflow.
Related Issues (20)
- Improve `_init_server` HOT 2
- [FR] Improve UI stability to corrupt metric files HOT 2
- [BUG] UI Crash - Unterminated string in JSON at position 5000 for mlflow.log-model.history HOT 4
- [BUG] pyfunc.load_model ignores logged model with trust_remote_code set to True HOT 1
- [BUG] HOT 2
- [BUG]Prompt Engineering request from UI to Deployments Server Connection TimeOut HOT 5
- [FR]MLflow Deployments Server Support inside corporate proxy HOT 3
- Fix typos
- Fix docstrings in `mlflow/tracing` HOT 1
- [FR] Multiple retrievers with mlflow.langchain.log_model HOT 1
- [BUG] MLFlow Deployment Server for LLMs using chatCompletion on Azure OpenAI text-davinci-003 HOT 4
- [SETUP-BUG] Multi-Cloud artifact-destination migration HOT 3
- mlflow.pyfunc.load_model is loading model of class <class 'mlflow.pyfunc.PyFuncModel'> instead of original class HOT 2
- [BUG] ModuleNotFoundError: No module named 'fcntl' HOT 1
- Artifact files are not removed from tmp/ folder HOT 2
- [BUG] MLFlow infer signature requires transformers but the model is not a transformer HOT 2
- Add `trailing-whitespace` to remove trailing whitespace in `.rst` files HOT 4
- [BUG] ModuleNotFoundError: No module named 'opentelemetry.semconv' HOT 5
- [BUG] module 'PIL' has no attribute 'Image' when performing mlflow.log_image HOT 1
- [BUG] Unable to load images logged by mlflow.log_image HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mlflow.