- Nov 06, 2024
-
-
Bo-Chun Chen authored
-
Bo-Chun Chen authored
-
Eesaan Atluri authored
Rename cheaha.yml playbook to cluster.yml See merge request rc/hpc-factory!116
-
- Nov 04, 2024
-
-
Eesaan Atluri authored
Rename it because cheaha is not the only cluster we deploy against. So give it a generic name.
-
Eesaan Atluri authored
Add a task to template DNS config See merge request rc/hpc-factory!115
-
Eesaan Atluri authored
This will allow you to define the variables during runtime whose values you may want to override. In this case, we want to override the DNS config related to nameserver and search domain
-
Eesaan Atluri authored
-
Bo-Chun Chen authored
Mount dirs besides adding them to fstab See merge request rc/hpc-factory!114
-
Eesaan Atluri authored
Otherwise, since we run this role in deploy the dirs won't be mounted and cause unexpected behavior of applications.
-
- Nov 01, 2024
-
-
Eesaan Atluri authored
Create multi homed proxy See merge request rc/hpc-factory!111
-
Bo-Chun Chen authored
removed dev_key to move var to gitlab pipeline variable See merge request rc/hpc-factory!113
-
Krish Moodbidri authored
-
- Oct 31, 2024
-
-
Eesaan Atluri authored
Avoids error that the IP has already been assigned while trying multiple proxy deploys
-
Eesaan Atluri authored
The instance creation with internal port attached was causing issues with login because the default gateway was set to internal network gateway. This will only allow login via headnode but not the host. This commit will fix that issue by giving a way to define the gw ip for default route.
-
Eesaan Atluri authored
We wanted to create a multi-homed instance connected to proxy-net and internal-net because the nfs-mounts are shared via internal net by the server. However, the NIC on internal-net wasn’t getting an IP assigned automatically and this was creating an issue for NFS server to communicate with the clients. So we created a port and assigned that port during instance creation
-
Bo-Chun Chen authored
Change path to avoid ansible log deletion See merge request rc/hpc-factory!109
-
- Oct 30, 2024
-
-
Eesaan Atluri authored
-
Bo-Chun Chen authored
feat: Run clusterhooks at deploy time for proxies See merge request rc/hpc-factory!108
-
Eesaan Atluri authored
-
Eesaan Atluri authored
We are providing hooks into the cluster we need at deploy time rather than baking them at build time. This will enable us to build once and deploy across multiple clusters. Otherwise, we need to build image whenever you need to deploy to a different cluster.
-
- Oct 25, 2024
-
-
Eesaan Atluri authored
Feat ci proxy builds See merge request rc/packer-openstack-hpc-image!103
-
Eesaan Atluri authored
-
Eesaan Atluri authored
-
Eesaan Atluri authored
-
Eesaan Atluri authored
-
Eesaan Atluri authored
fix yum install failures by fixing the baseurl via shell provisioners temporarily until we upgrade the base OS
-
- Oct 24, 2024
-
-
Eesaan Atluri authored
-
Eesaan Atluri authored
-
- Oct 23, 2024
-
-
Bo-Chun Chen authored
Modify roles in cheaha.yml playbook to enable a standalone proxy See merge request rc/packer-openstack-hpc-image!90
-
Bo-Chun Chen authored
-
Bo-Chun Chen authored
-
Bo-Chun Chen authored
Add packer template for proxy build See merge request rc/packer-openstack-hpc-image!95
-
Eesaan Atluri authored
-
- Oct 22, 2024
-
-
Eesaan Atluri authored
-
Eesaan Atluri authored
-
Eesaan Atluri authored
Enable TLS only if the ldap_uri var is having https substring in the value
-
Eesaan Atluri authored
A proxy node need not be a slurm client so we don't run this role
-
Eesaan Atluri authored
We changed it to "all" because that's the convention we follow. When running ansible playbook you need to limit the hosts with -l so that it doesn't run on all hosts. Some background on why it was set to default in the first place, `hosts: default` is used as a placeholder during development to indicate where actual host groups will later be defined. You can define specific host groups within the inventory file with names that are more relevant to your infrastructure, replacing the "default" placeholder.
-
Eesaan Atluri authored
We parameterized a few tasks in the cheaha.yml and put the vars in group_vars/all so that the existing knightly builds won't break. We added values required for the CoD deploy in group_vars/proxy. Now if we put the proxy nodes in proxy group they will inherit values meant for CoD deploy which is not desired in production. We later realized that the appropriate file is group_vars/knightly and it gives flexibility to put proxy nodes in knightly group to override the vars meant for CoD in group_vars/all.
-
- Oct 21, 2024
-
-
Eesaan Atluri authored
Change the default timeout from 10 secs to 60 secs to accomodate the delay for slurm pkg yum install tasks after running ldap_config role Refer #125
-