Comments (9)
Maria Pena-Guerrero Which MIRI file are you using for the above?
from jwst.
Comment by Maria Pena-Guerrero on JIRA:
David Law the file is
jw01283001001_03101_00001_mirimage_uncal.fits
from jwst.
Hm, just to add to this puzzle I tried processing this file myself with all of the latest versions, through a default strun of calwebb_detector1
The entire process completed in about 12 minutes, with jump the longest running step at about 7 minutes.
I've got 64 GB of physical memory in my MacBook Pro; 61 GB of that got used at peak, though I saw the python usage spike up to 153 GB during the jump step (presumably using virtual memory to handle it).
from jwst.
Comment by Maria Pena-Guerrero on JIRA:
David Law more to the mystery.... did you run the Detector1 pipeline with a file or did you open the datamodel first? i.e.
option A -> det1.call(uncal_file.fits)
option B -> det1.call(uncal_model)
For me the run takes significantly more memory if I do option B
from jwst.
My initial run was just using strun from a terminal command line. Took 12 minutes, peaked at 153 GB of python memory (61 GB physical), and when it finished RAM usage returned to the normal baseline.
Using your option A (i.e., running on a file) within a jupyter notebook it took 12 minutes, peaked at 153 GB (with a brief excursion to 172 GB that I may have missed before) and 61 GB physical. When it finished though physical RAM usage stayed high at 34 GB until I explicitly halted the notebook.
Using your option B (i.e, running on a datamodel) within a jupyter notebook it peaked at 182 GB used, and stalled out in the jump step. After 20 minutes in the 'two point difference' alone I killed it.
So, I'm also seeing a big difference in whether I pass in a file or a datamodel. Datamodel presumably required enough additional resources that it stalled out for me as well.
from jwst.
Comment by Tyler Pauly on JIRA:
It may be worth testing in an environment with the dev version of stpipe - some of the memory issues using file vs. datamodel may be linked to the log referencing bug found and fixed here: spacetelescope/stpipe#171
I believe a patch release of stpipe is planned, but I don't know its status.
from jwst.
Just repeated the test as suggested by Tyler Pauly with the dev version of stpipe.
Major memory difference. Option A ran in 11 minutes, maxing at 60 GB used by python vs the 172 GB it used to take. Option B maxed at 94 GB vs previous 182 GB, but still hung in the jump step, where I killed it after taking over 20 min in the 2-pt-difference section that took option A just 2.5 minutes to complete.
So the memory has improved, but running with a datamodel input is still hanging despite the memory usage being far less than what ran successfully in the original option A. Also still about 12 GB hanging around in memory until the kernel is restarted, even with option A.
from jwst.
Comment by Tyler Pauly on JIRA:
Opened a PR on the first steps toward reducing memory usage in jump - flamegraph shows a run of standalone jump step on a refpix output. More work to be done, but for this input file it's a quick 10GB reduction.
from jwst.
Comment by Maria Pena-Guerrero on JIRA:
for documentation purposes, I am putting the link to a Help Desk ticket related to their machine being killed at the jump step due to memory issues: https://stsci.service-now.com/nav_to.do?uri=%2Fincident.do%3Fsys_id%3Da878bdf483e14e102c4f1a226daad3f1%26sysparm_record_list%3Dassignment_group%3D8dbea402dbf2620033b55dd5ce9619ba%5Estate%3D2%5EORstate%3D3%5EORstate%3D1%5Ecompany%3D%5EORwatch_listCONTAINSa89f418ddb8f620042685434ce961990%5EORcompany%3Dc3478b5edb013e0042685434ce961951%5EORcaller_id%3Da89f418ddb8f620042685434ce961990%5EORopened_by%3Da89f418ddb8f620042685434ce961990%5EORDERBYstate%26sysparm_record_row%3D8%26sysparm_record_rows%3D47%26sysparm_record_target%3Dincident%26sysparm_view%3DJWST%26sysparm_view_forced%3Dtrue
from jwst.
Related Issues (20)
- numpy version 2.0.1 not compatible with jwst.pipeline.Detector1Pipeline HOT 1
- High memory usage for Detector1Pipeline HOT 4
- Memory Leaks in Detector1Pipeline HOT 7
- Clean up unnecessary copies HOT 4
- Emicorr NaNs data for low ngroups HOT 2
- Memory leak in OLS_C ramp fitting
- JWST to send calibration version to CRDS for context setting, under DR paradigm
- Train TAC model (Tso/Ami/Coron)
- Refactor model_blender to take in only necessary metadata
- NIRSpec TSO filenames may be missing the slitname HOT 2
- cibuildwheel fails for python 3.13
- Enable python 3.13 wheels when python 3.13 support is added HOT 1
- Remove nlow and nhigh parameters from outlier_detection
- OLS_C Ramp Fitting HOT 10
- outlier detection get_sections should set buffer size based on total available memory HOT 10
- add options to reduce memory usage in resample
- ramp_fit: crmag not implemented in OLS_C HOT 1
- Use group name instead of index for `SkyGroup` during skymatch
- Zero-filled AREA extensions loaded into SlitModel by datamodels open
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from jwst.