Comments (12)
Giovanni,
OK, I will take a look at the API structure and see what I'm missing to call. Which version are you using ?
Regards, Mark
from pyslurm.
pyslurm-14.11.5
slurm 14.11.9
(going to 15.08 soon)
from pyslurm.
Thanks, looks like I will need to call out to the opaque data structure via select_nodeinfo. Most of the code is there but not complete or tested for some time. As I'm working on slurm-14.11.x for other people I will look to adding it in to that branch first and then 15.x later.
from pyslurm.
Great! Thanks. I can help test that portion of the code as well.
from pyslurm.
It is now coded up but I will need to test it tonight/tomorrow before I push some code to you and then I will upload the changes to git.
from pyslurm.
OK, here we go. I believe It appears to work.......which is nice ! Of course I was banging my head on the desk as I was still getting zero until I realised that I needed to turn on "SelectTypeParameters=CR_CORE_MEMORY" in the config......dooh.
Please find attached a zipped replacement pyslurm.pyx for you. Obviously you will need to replace the existing one and do a build/install etc.
from pyslurm.
Looks good! Just need to remove the print statement on line 2385.
from pyslurm.
Cool, I will remove the debug line as well.
from pyslurm.
Giovanni,
If you pull the latest 14.11.5 the changes as discussed previously have been committed. Please let me know if this now works and close the ticket if you are happy. I will look at the license gres handling now !
Mark
from pyslurm.
I pulled the latest 14.11.5 and rebuilt. alloc_memory is now not showing up for any nodes.
>>> pprint(nodes.get("cn0001").get("alloc_memory"))
None
Here's the diff between the latest 14.11.5 and yesterday's out-of-band patch:
$ diff 14.11.5/pyslurm/pyslurm.pyx patch/pyslurm/pyslurm.pyx
2148a2148,2149
> self.get()
>
2278c2279
< int i, total_used, cpus_per_node, rc
---
> int i, total_used, cpus_per_node, rc, alloc_mem
2281c2282
< uint32_t my_state, alloc_mem
---
> uint32_t my_state
2298,2300d2298
< if self._record.name is NULL:
< break
<
2302c2300,2301
< alloc_mem = alloc_cpus = err_cpus = 0
---
> alloc_cpus = err_cpus = 0
> alloc_mem = 0
2406c2405
< Host_dict['err_cpus'] = err_cpus
---
> Host_dict['alloc_memory'] = alloc_mem
2407a2407
> Host_dict['err_cpus'] = err_cpus
from pyslurm.
Yep, too much haste ! Have committed missing host dictionary entry, please try again
from pyslurm.
Yup, looks good. Thanks Mark!
from pyslurm.
Related Issues (20)
- pyslurm throw KeyError 5039 when statistics().get() called HOT 2
- symbol lookup error: undefined symbol: data_init HOT 7
- Installation fails for v22.5.x HOT 2
- Submittion of a batch job will be failed when argument "work_dir" contains a "_" HOT 1
- Unclear that `Jobs.load()` returns a dictionary. HOT 2
- Missing `Job` fields compared to the docs HOT 2
- slurmdbd: Implement Association API
- ValueError: invalid literal for int() with base 10: 'milton.m' HOT 1
- Copy/Paste error in Job class memory property HOT 1
- Cannot seem to get allocated gres info from Node HOT 4
- Node in Nodes collection do not report a MIXED state, only IDLE or ALLOCATED HOT 5
- A job is found with JobFilter if it is running on start_time HOT 6
- question: install on cluster without slurm-devel package and without root access HOT 2
- job mem returns as "None" HOT 3
- error: ‘KILL_JOB_ARRAY’ undeclared HOT 1
- Slurm 23.11 support HOT 3
- pyslurm is installation failed HOT 2
- Particular value of gres is causing cstr to crash HOT 1
- [guidance request] 'gres' : 'gpu' does not have the same behavior as when using bash HOT 2
- Problem in setup.py get_version() - packaging.version object is missing .major and .minor attributes in earlier module versions under Python 3.6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pyslurm.