Comments (7)
Hi,
Sorry for the delay, have been distracted by other things. I will take a look at the weekend.
Mark
from pyslurm.
Thanks Mark. Good luck.
from pyslurm.
Any additional info you can provide ? cluster/platform and what nodes were in the cluster.
from pyslurm.
Laparn,
I have traced the issue, this whole area of code needs some serious reworking but I have a temporary fix up at the moment. I will commit the change once I check the data more.
Regards
Mark
from pyslurm.
I found the error message in src/plugins/select/linear/select_linear.c at the end of the function extern int select_p_select_nodeinfo_get :
extern int select_p_select_nodeinfo_get(select_nodeinfo_t *nodeinfo,
enum select_nodedata_type dinfo,
enum node_states state,
void *data)
{
int rc = SLURM_SUCCESS;
uint16_t *uint16 = (uint16_t *) data;
char **tmp_char = (char **) data;
select_nodeinfo_t **select_nodeinfo = (select_nodeinfo_t **) data;
if (nodeinfo == NULL) {
error("get_nodeinfo: nodeinfo not set");
return SLURM_ERROR;
}
if (nodeinfo->magic != NODEINFO_MAGIC) {
error("get_nodeinfo: nodeinfo magic bad");
return SLURM_ERROR;
}
switch (dinfo) {
case SELECT_NODEDATA_SUBGRP_SIZE:
*uint16 = 0;
break;
case SELECT_NODEDATA_SUBCNT:
if (state == NODE_STATE_ALLOCATED)
*uint16 = nodeinfo->alloc_cpus;
else
*uint16 = 0;
break;
case SELECT_NODEDATA_PTR:
*select_nodeinfo = nodeinfo;
break;
case SELECT_NODEDATA_RACK_MP:
case SELECT_NODEDATA_EXTRA_INFO:
*tmp_char = NULL;
break;
default:
error("Unsupported option %d for get_nodeinfo.", dinfo);
rc = SLURM_ERROR;
break;
}
return rc;
The enum dinfo is a select_nodedata_type which is defined in slurm.h .
What happens in the code above is that SELECT_NODEDATA_BITMAP is not a case in the switch. Then when it occurs (and it is the case here), it ends up on default and the error message appears.
enum select_nodedata_type {
SELECT_NODEDATA_BITMAP_SIZE, /* data-> uint16_t */
SELECT_NODEDATA_SUBGRP_SIZE, /* data-> uint16_t */
SELECT_NODEDATA_SUBCNT, /* data-> uint16_t */
SELECT_NODEDATA_BITMAP, /* data-> bitstr_t * needs to be
* freed with FREE_NULL_BITMAP */
SELECT_NODEDATA_STR, /* data-> char * needs to be
* freed with xfree */
SELECT_NODEDATA_PTR, /* data-> select_nodeinfo_t *nodeinfo */
SELECT_NODEDATA_EXTRA_INFO, /* data-> char * needs to be
* freed with xfree */
SELECT_NODEDATA_RACK_MP, /* data-> char * needs to be
* freed with xfree */
};
This is clearly not a pyslurm bug, but a slurm one. It should be reported there. There is no impact on pyslurm.
from pyslurm.
Hi,
Thank you for the report. I will have a look just to make sure PySlurm is not missing some code logic once the pressures of work have resided :(
Mark
from pyslurm.
I reworked the code a long time ago for this so it should be closed. Please reopen if still a problem
from pyslurm.
Related Issues (20)
- pyslurm throw KeyError 5039 when statistics().get() called HOT 2
- symbol lookup error: undefined symbol: data_init HOT 7
- Installation fails for v22.5.x HOT 2
- Submittion of a batch job will be failed when argument "work_dir" contains a "_" HOT 1
- Unclear that `Jobs.load()` returns a dictionary. HOT 2
- Missing `Job` fields compared to the docs HOT 2
- slurmdbd: Implement Association API
- ValueError: invalid literal for int() with base 10: 'milton.m' HOT 1
- Copy/Paste error in Job class memory property HOT 1
- Cannot seem to get allocated gres info from Node HOT 4
- Node in Nodes collection do not report a MIXED state, only IDLE or ALLOCATED HOT 5
- A job is found with JobFilter if it is running on start_time HOT 6
- question: install on cluster without slurm-devel package and without root access HOT 2
- job mem returns as "None" HOT 3
- error: ‘KILL_JOB_ARRAY’ undeclared HOT 1
- Slurm 23.11 support HOT 3
- pyslurm is installation failed HOT 2
- Particular value of gres is causing cstr to crash HOT 1
- [guidance request] 'gres' : 'gpu' does not have the same behavior as when using bash HOT 2
- Problem in setup.py get_version() - packaging.version object is missing .major and .minor attributes in earlier module versions under Python 3.6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pyslurm.