Open On-Demand provides a web gateway to cluster with support for terminal, desktop and file access via the browser. This feature is focused on instantiating an ood web gateway node for the provisioned ohpc cluster and making it the default interface for the cluster.
The initial work is to provide vagrant and ansible tooling to provision the node and integrate it with the Xsede basic cluster. Early work for this feature exists in a fork of the vagrantcluster codebase providing a foundation for getting vanilla ood running on a node via ansible.
This work effort will integrate this with the XCBC code base and associated cluster.
Edited
To upload designs, you'll need to enable LFS and have an admin enable hashed storage. More information
Child items
...
Show closed items
Linked items
0
Link issues together to show that they're related.
Learn more.
The patch was hand edited to fix the offset of the role tree to the top level directory of the ansible project, vagrantcluster has the ansible repo embedded unlike our project which uses a submodule.
sed -e 's/ansiblerepo\///' < patch > patch-fix
Apply the modified patch to the CRI_XCBC project:
git am --whitespace=fix < ../../vagrantcluster/patch-fix
Building the ood node works with vagrant up ood but the slurm accounting configuration fails:
ood: TASK [ohpc_install : start and enable slurmdbd] ******************************** ood: changed: [ohpc] ood: ood: TASK [ohpc_install : load sacctmgr config] ************************************* ood: fatal: [ohpc]: FAILED! => {"changed": true, "cmd": ["sacctmgr", "-i", "load", "/etc/slurm/sacctmgr-heirarchy.cfg"], "delta": "0:00:00.057079", "end": "2018-09-28 13:21:56.033339", "msg": "non-zero return code", "rc": 1, "start": "2018-09-28 13:21:55.976260", "stderr": "sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to ohpc:7031: Failed to unpack SLURM_PERSIST_INIT message\nsacctmgr: error: slurmdbd: Sending PersistInit msg: No error\nsacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to ohpc:7031: Failed to unpack SLURM_PERSIST_INIT message\nsacctmgr: error: slurmdbd: Sending PersistInit msg: No error\nsacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to ohpc:7031: Failed to unpack SLURM_PERSIST_INIT message\nsacctmgr: error: slurmdbd: Sending PersistInit msg: No error\nsacctmgr: error: slurmdbd: DBD_GET_ACCOUNTS failure: No error\nsacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to ohpc:7031: Failed to unpack SLURM_PERSIST_INIT message\nsacctmgr: error: slurmdbd: Sending PersistInit msg: No error\nsacctmgr: error: slurmdbd: DBD_GET_QOS failure: No error\n Unknown option: QOS='normal'\n error: Problem with line(13)\n Problem with requests: Unspecified error", "stderr_lines": ["sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to ohpc:7031: Failed to unpack SLURM_PERSIST_INIT message", "sacctmgr: error: slurmdbd: Sending PersistInit msg: No error", "sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to ohpc:7031: Failed to unpack SLURM_PERSIST_INIT message", "sacctmgr: error: slurmdbd: Sending PersistInit msg: No error", "sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to ohpc:7031: Failed to unpack SLURM_PERSIST_INIT message", "sacctmgr: error: slurmdbd: Sending PersistInit msg: No error", "sacctmgr: error: slurmdbd: DBD_GET_ACCOUNTS failure: No error", "sacctmgr: error: slurm_persist_conn_open: Something happened with the receiving/processing of the persistent connection init message to ohpc:7031: Failed to unpack SLURM_PERSIST_INIT message", "sacctmgr: error: slurmdbd: Sending PersistInit msg: No error", "sacctmgr: error: slurmdbd: DBD_GET_QOS failure: No error", " Unknown option: QOS='normal'", " error: Problem with line(13)", " Problem with requests: Unspecified error"], "stdout": "", "stdout_lines": []} ood: to retry, use: --limit @/vagrant/CRI_XCBC/site.retry ood: ood: PLAY RECAP ********************************************************************* ood: ohpc : ok=33 changed=30 unreachable=0 failed=1 The SSH command responded with a non-zero exit status. Vagrantassumes that this means the command failed. The output for this commandshould be in the log above. Please read the output to determine whatwent wrong.