Giter Club home page Giter Club logo

Comments (7)

chengzhuzhang avatar chengzhuzhang commented on August 12, 2024

I'm wondering it might not be an e3sm-unified problem. I just tried e3sm-diags from e3sm-unified through interactive jobs on haswell. It ran well. However knl has been problematic, it's a known issue: E3SM-Project/e3sm_diags#314. @wagmanbe would you try it again on haswell? If it still gives trouble, could you share your run script and I will try reproduce.

from e3sm-unified.

xylar avatar xylar commented on August 12, 2024

Are you not seeing these problems on knl when you use an E3SM_Diags development environment? I have always found python packages to run slowly on knl, so I would be surprised if this is specific to E3SM-Unified but can investigate if it appears to be. But I agree that haswell is the recommended option for all python codes.

from e3sm-unified.

wagmanbe avatar wagmanbe commented on August 12, 2024

It's affecting both knl and haswell. Maybe it's a NERSC issue?
salloc --nodes=1 --partition=debug --time=00:20:00 -C haswell
source /global/cfs/cdirs/e3sm/software/anaconda_envs/load_latest_e3sm_unified.sh <-- Hangs for minutes.
python <--slow
import os
from acme_diags.parameter.core_parameter import CoreParameter <--hangs for minutes.

from e3sm-unified.

darincomeau avatar darincomeau commented on August 12, 2024

NERSC was having problems yesterday afternoon/evening with very slow compute node performance that a few of us experienced, and was posted on their status page: https://www.nersc.gov/live-status/motd/
There's no notice now, so I'd recommend trying again.

from e3sm-unified.

wagmanbe avatar wagmanbe commented on August 12, 2024

Thank you, but this problem is occurring just the same today.

from e3sm-unified.

chengzhuzhang avatar chengzhuzhang commented on August 12, 2024

In this case, I suspect that the compute node problem is still there. I tried similar commands as below yesterday afternoon and got the same behavior. But tried again much later yesterday, everything looked fine...

It's affecting both knl and haswell. Maybe it's a NERSC issue?
salloc --nodes=1 --partition=debug --time=00:20:00 -C haswell
source /global/cfs/cdirs/e3sm/software/anaconda_envs/load_latest_e3sm_unified.sh <-- Hangs for minutes.
python <--slow
import os
from acme_diags.parameter.core_parameter import CoreParameter <--hangs for minutes.

from e3sm-unified.

wagmanbe avatar wagmanbe commented on August 12, 2024

It's at least 10x faster this afternoon.

from e3sm-unified.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.