Comments (8)
Yes, all CM scripts are modular and so we can do this. For example, the below command will download the full imagenet validation set and exports the downloaded paths.
cm run script - -tags=get,dataset,val,imagenet,original,_full -j
from ck.
Thank you !
I tried it but it is complaining, am I missing something ?
$ cm run script - -tags=get,dataset,val,imagenet,original, _full -j
* cm run script _full --tags="get,dataset,val,imagenet,original"
CM error: no scripts were found with above tags (when variations ignored)!
from ck.
Ok, the space before _full
had to be removed, that one is working now.
But I am trying this example and nothing is happening, do you know why or how I can debug further ?
https://github.com/mlcommons/ck/tree/master/cm-mlops/script/get-ml-model-retinanet#cm-cli
$ cm run script --tags=get,ml-model,raw,resnext50,retinanet,object-detection
* cm run script "get ml-model raw resnext50 retinanet object-detection"
Thanks !
from ck.
Hi @jdesfossez,
CM scripts installs artifacts to the CM cache and make them available to other CM scripts via API and/or ENV variables.
You can see the cache with all artifacts including above model as follows:
cm show cache
cm show cache --tags=get,ml-model,resnext50
You can find your model and extra CM meta files as follows:
cm find cache --tags=get,ml-model,raw,resnext50,retinanet,object-detection
Basically CM is a database of objects connected by tags, UIDs and ENV variables ...
Please check these 2 tutorials that may give you more ideas behind CM:
- https://github.com/mlcommons/ck/blob/master/docs/tutorials/modular-image-classification.md
- https://github.com/mlcommons/ck/blob/master/docs/tutorials/sc22-scc-mlperf.md
That's how we reuse individual CM scripts (and workflows assembled from those scripts) for reproducibility initiatives at conferences and other initiatives to make it easier to run AI on different platforms ...
We are interested to know your use cases and how CM can help - please feel free to talk to us via Discord server or we can set up a conf-call ...
Thank you for your interest and feedback!
from ck.
Hi !
Thank you, that helps a lot, I missed the first tutorial, I am glad you linked it here.
My current goal is to automate performance testing of GPUs in a public cloud environment. I need to easily and quickly compare the impact of various hypervisor-level changes, so this project seems perfect for that purpose. Eventually I will use it as well to submit results.
Another quick question, is there a clean way for me to specify at run-time the location of the data ? For example if I wanted to make a local mirror and have the VM download from there instead of hitting the public servers.
Thanks again !
from ck.
Sorry @jdesfossez for the typo -- I was typing on mobile :(
"is there a clean way for me to specify at run-time the location of the data "
I believe you want to use a private URL here right? Currently we are supporting multiple downloaded sources like this but not custom URLs - we can do this by next release.
But for most of the large datasets, there is an option to provide the path
to it via input/env variable and prevent a download like done here
from ck.
This solution can work for using custom URLs.
from ck.
ah perfect, thank you so much !
I will close this as it's not really an issue, but I appreciate the guidance !
from ck.
Related Issues (20)
- Update all tests in GitHub workflows for mlcommons@cm4mlops HOT 1
- running on ARM64? HOT 5
- Running dlrm cpu inference ends up using resnet50 HOT 2
- Print migration warning when using mlcommons@ck HOT 7
- Add universal check of env vars in cmind.utils HOT 1
- Add "prototype" flag to CM script meta
- `convert_path` is not part of setuptools API and will be removed HOT 7
- Refactor CM, CM for MLOps and CM for MLPerf docs and tutorials
- Improving CM core
- When dumping version info from dependencies, variations do not have _ HOT 1
- Warning Encountered During pip install cmind on Ubuntu via WSL HOT 3
- KeyError in MLPerf Inference with ResNet-50 HOT 5
- Could not identify license file for opentelemetry-cpp HOT 1
- CUDA version 12.4 not supported for this cm command HOT 1
- Support branch for cm pull repo
- How do you specify which GPU to run an Mlperf benchmark on with CM? HOT 6
- cm add script is failing on new CM repository HOT 1
- Support ssh URLs in cm pull repo HOT 3
- Improve the accessibility of the documentations HOT 1
- How to prevent caching? HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ck.