hpc / hpc-collab Goto Github PK
View Code? Open in Web Editor NEWThis project provides provisioned HPC cluster models using underlying virtualization mechanisms.
License: Other
This project provides provisioned HPC cluster models using underlying virtualization mechanisms.
License: Other
in its most basic form, an empty OverlayFS would only include patches
ip command needs to be loaded vi brew on MacOS X, for example
Alternatives:
Templates of interest:
This issue arose from a discussion regarding the USRC Resilience team's requirements for fault detection and signature generation.
It may be a justification for a lighterweight underlying provider, such as libvirt/ovirt or docker/container.
This would allow node configuration changes to be tested in a CI/CD pipeline, self-validating an hpc-collab node recipe & configuration change.
Another alternative: , returning node back to an earlier full state
Perhaps an underlying vagrant snapshot/suspend mechanism could be utilized, provided:
there are sufficient hooks to trigger a mini-provision, consisting mostly of verification, and
there is a mechanism to do multi-machine snapshot/resume in the preserving dependencies.
Alternatives could be:
Issue #2, Issue #22, Issue #52
This issue arose from the shared system (muon) deployment.
This request came about in preparation for a student joining the project next week (18 May)
Current process is as follows:
up
see also issue #20
provisioning extension that just instantiates this hierarchy for the invoking user dynamically ("make clean" removes)
This issue arose during the muon (USRC) deployment.
nodename aliases vary depending upon whether the node has been provisioned correctly. If the node is not fully provisioned, the alias maps to 'make '. If the node is fully provisioned, the alias maps to 'ssh -o UserKnownHostsFile=/dev/null '
The ideal provider would be able to be run within an existing HPC job allocation, on an existing, unmodified HPC cluster.
Use case: on-going in-pipeline validation of upcoming, scheduled HPC changes
complexity when this is (mostly) preformed in the requires/* stage, before most services DNS, NFS
This issue arose out of the muon (USRC) deployment.
record machine's GUID into the state file
If the GUID doesn't exist, invalidate that state file
If a machine with that name and GUID exists, report EX_ALREADY or EX_OK (context defines)
If a machine with that name and a different GUID exists, EX_CANTCREAT, EX_TEMPFAIL, EX_ALREADY
implies deployment of an in-cluster configuration management system, such as ansible
interim step until node (re)construction is truly lightweight
See also issue #52
This should improve rpm loading timing, therefore reduce node rebuild time.
Needs to leave a usable hierarchy whether or not local repos, remote repos are flagged.
possibly combine cfg//rpms/early/... into a common (cluster-external) early-localrepo.repo
non-relocatable paths in Vagrantfile, generating errors such as the following if 'vagrant' is not invoked from the directory where Vagrantfile is located:
% vagrant ssh vcfs
nodes_file_path:/home/sts/hpc-collab/clusters/vc/cfg/vcfs/verify/firewall/Vagrantfile.firewall.nodes does not exist
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.