Commits on Source (100)
-
Bo-Chun Chen authoredd568a476
-
Bo-Chun Chen authorede72c5585
-
Bo-Chun Chen authored
Remove base compute and gpu build from pipeline Closes #127 See merge request !144
-
Bo-Chun Chen authoredacc62573
-
Bo-Chun Chen authoredbf72c6c6
-
Bo-Chun Chen authored0a084d03
-
Eesaan Atluri authored
Closes #175
c7a17a89 -
Eesaan Atluri authored
Add conditional to support both autofs, fstab mnts See merge request !160
ef7b46e3 -
Bo-Chun Chen authored
Feat login build and deploy See merge request !158
8e750e45 -
Eesaan Atluri authored
Closes #176
fadfb001 -
Eesaan Atluri authored2b70839e
-
Eesaan Atluri authored
feat: Download ansible files for cluster.yml See merge request !161
d862e711 -
Eesaan Atluri authored
Closes #173
96e0db22 -
Eesaan Atluri authored
The values used in prod will be defined at runtime so replacing them with default values
b2495d56 -
Eesaan Atluri authored
So that we can have same key names for both dicts used in autofs.yml and fstab.yml roles
96764f6a -
Eesaan Atluri authored
feat: Add params to define the autofs mountpoints See merge request rc/hpc-factory!159
518c89e5 -
Eesaan Atluri authored
Change the references to compute.yml in the packer template to use login.yml
180e8eab -
Eesaan Atluri authored
pam_slurm config denies ssh if there is no slurm job on the compute node A login node is not expected to run jobs so we removed this config from the login playbook
10f7c135 -
Eesaan Atluri authored
fix ssh failures due to plam_slurm in login VM See merge request !162
2e57b1b3 -
Bo-Chun Chen authored
Feat http proxy rewrite map See merge request rc/hpc-factory!155
e6691e23 -
Bo-Chun Chen authored
Follow the same approach in ssl_cert role, to avoid missing host key issue during deploy
0ec2bcda -
Bo-Chun Chen authored
fix: add when condition to ssh_host_keys role See merge request !163
77b42c12 -
Bo-Chun Chen authored6019778c
-
Bo-Chun Chen authored
fix: use owner instead of user in template ansible module See merge request rc/hpc-factory!164
c6251154 -
Eesaan Atluri authored68ac825f
-
Bo-Chun Chen authored
Add missing autofs service start See merge request rc/hpc-factory!165
a160a27a -
Eesaan Atluri authored
The home dir for slurm user is available as a mount from master so there is no need to create a local home dir for it.
ba397c1c -
Eesaan Atluri authored
fix: Skip creating home dir for slurm user See merge request !166
ac29100e -
Bo-Chun Chen authoredd5f0f8e8
-
Bo-Chun Chen authoredfd96d27a
-
Krish Moodbidri authored
We lookup the value of the placeholder (RewriteCond %{HTTP:REMOTE_USER} \’^(.+)$\’) in this case in the apache conf during build and replace it with the rewrite conditions during deploy phase.
ecbd2287 -
Krish Moodbidri authored
previously it reflected a placeholder but it actually is the value in file from build that we're looking for and replacing in deploy phase
b8f68f37 -
Krish Moodbidri authored4deca313
-
Krish Moodbidri authored
Update Apache configuration for OOD rewrite rules during deploy phase See merge request !170
7dcd7826 -
Bo-Chun Chen authorede1367527
-
Bo-Chun Chen authored
Restart httpd service after update See merge request rc/hpc-factory!172
074603f1 -
Eesaan Atluri authored17f0d7c5
-
Eesaan Atluri authored
to avoid error when a value is not provided for root_ssh_key
d78a2612 -
Eesaan Atluri authored
We moved cheaha.yml playbook to run during deploy time
6a74a89c -
Eesaan Atluri authored93b58dd8
-
Eesaan Atluri authoredffb9f238
-
Bo-Chun Chen authored056ba88e
-
Bo-Chun Chen authored
feat: update account app port See merge request !174
91c02c78 -
Eesaan Atluri authored
Download group_vars/knightly or group_vars/prod for knightly or prod environments respectively. Note: For all other environments like dev or staging just use the default values from group_vars/all
2a76eae3 -
Eesaan Atluri authored
Closes #203
0685dcb8 -
Krish Moodbidri authored
Created symlinks for `slurm.conf`, `gres.conf`, `slurmdbd.conf`, and `cgroup.conf` in `/etc/slurm`. To ensure consistency with the current production environment where these symlinks already exist.
822bb0f2 -
Bo-Chun Chen authored
Feat build extra vars See merge request rc/hpc-factory!168
7974f735 -
Krish Moodbidri authored637be235
-
Krish Moodbidri authored2d919ff8
-
Bo-Chun Chen authored
Add symbolic links for Slurm config files See merge request !177
b08acdc7 -
Eesaan Atluri authored28a835da
-
Eesaan Atluri authored
Adds fail2ban filter and jail configs
a9dcf73c -
Eesaan Atluri authored0c0cadb4
-
Eesaan Atluri authoredf8785f2f
-
Eesaan Atluri authored16ba256a
-
Eesaan Atluri authored
Install and configure fail2ban See merge request !178
bb581388 -
Bo-Chun Chen authored1b9577e9
-
Bo-Chun Chen authored
feat: disable fail2ban by default See merge request rc/hpc-factory!180
4c9faf4a -
Krish Moodbidri authored
Changed the 'enable_rsyslog_config' variable from false to true This should be the default behavior for all nodes unless explicitly turned off by the developer.
420230b4 -
Bo-Chun Chen authored6c105645
-
Bo-Chun Chen authored38b7fd3d
-
Bo-Chun Chen authored982be05a
-
Bo-Chun Chen authored
Add node exporter See merge request rc/hpc-factory!184
fd1f8799 -
Bo-Chun Chen authored
feat: Add a way to define multiple security groups See merge request !176
bc41900b -
Bo-Chun Chen authoreda96a7c84
-
Bo-Chun Chen authored
fix node exporter with firewalld enabled See merge request rc/hpc-factory!186
cb685673 -
Bo-Chun Chen authorede7ae10b8
-
Bo-Chun Chen authored
fix: restart firewalld after change See merge request !187
55cae529 -
Bo-Chun Chen authored815ced55
-
Bo-Chun Chen authored
fix: restart firewalld only when it was running See merge request !188
6e2052c5 -
Bo-Chun Chen authored
Enable rsyslog configuration by default See merge request rc/hpc-factory!183
cd35517c -
Eesaan Atluri authored
Closes rc/hpc-factory#181 Defining a network can be optional when a port is already defined when deploying a VM. This commit adds flexibility where you can either use --port or --network options to define a network during the deployment. Alternatively, you can define both but the port cannot be from the same network you used in the --network option. Otherwise, you will have two ips from the same network which will cause network reachability issues.
09717c19 -
Bo-Chun Chen authored
Resolve SSH proxy routing hang in prod by fixing dual IP assignment on same network. See merge request !175
980542a8 -
Bo-Chun Chen authoredb94d83eb
-
Bo-Chun Chen authored
We do not run merge request pipeline, so no need to check for it
c24c138c -
Bo-Chun Chen authored
# Conflicts: # openstack-ood/variables.pkr.hcl
86371b39 -
Bo-Chun Chen authored
ci: Add OOD image build job to gitlab CI See merge request rc/hpc-factory!171
bf57546d -
Bo-Chun Chen authored
Since build env is setup for building images, it makes more sense to have all build jobs in that environment.
97e1f7f5 -
Bo-Chun Chen authored
refactor: only run build pipeline in build environment See merge request rc/hpc-factory!189
d3c2ce4c -
Bo-Chun Chen authoredc6391491
-
Bo-Chun Chen authored2e966d7b
-
Bo-Chun Chen authored
fix: bash syntax error See merge request !190
14ab37ec -
Bo-Chun Chen authored16a9c164
-
Bo-Chun Chen authored
fix: use ENV variable in ansible var url See merge request rc/hpc-factory!191
d9e1c429 -
805aef56
-
Krish Moodbidri authoreddf944616
-
Bo-Chun Chen authored
Use BUILD_TAG for image build if set, else fallback to BUILD_DATE See merge request rc/hpc-factory!192
86f65c2a -
Bo-Chun Chen authored
This MR merges the feat-hpc-factory branch, which was focused on building and deploying proxy images for the COD cluster. Closes #181, #203, #173, #176, and #175 See merge request rc/hpc-factory!117
d7525ee4 -
Eesaan Atluri authored7d7acfce
-
Bo-Chun Chen authored
Feat deploy ood See merge request rc/hpc-factory!193
-
Eesaan Atluri authored190fe18e
-
Eesaan Atluri authored
Feat gl cicd var scripts Closes #232 See merge request !196
-
Eesaan Atluri authored1302836f
-
Eesaan Atluri authored
fix: openldap-servers pkg install failure Closes #237 See merge request rc/hpc-factory!199
-
Eesaan Atluri authored100dc7c8
-
Eesaan Atluri authored
Add missing pasword auth config for catchall in sshpiper Closes #247 See merge request rc/hpc-factory!202
-
Dylan Wheeler authored
feat: update group_vars to have variable for centos_repos and update the related ansible playbook to pull from that variable
cd4a3038 -
Bo-Chun Chen authored
FEAT: paramaterize centos repos See merge request !205
-
Eesaan Atluri authored
* Detach the floating ips to assign to new instance * Detach internal port to assign it to new instance * Install NetworkManager required during deploy NetworkManager is restarted during cloud-init final module but it failsbecause the package was not available. So install it during build to make it available during deploy
2c20aada -
Eesaan Atluri authored
Automate ood knightly build and deploy in a single pipeline Closes #231 See merge request !197
Showing
- .gitlab-ci.yml 131 additions, 12 deletions.gitlab-ci.yml
- ansible/cluster.yml 3 additions, 0 deletionsansible/cluster.yml
- ansible/group_vars/all 32 additions, 2 deletionsansible/group_vars/all
- ansible/login.yml 0 additions, 1 deletionansible/login.yml
- ansible/ood.yml 0 additions, 3 deletionsansible/ood.yml
- ansible/roles/fail2ban/tasks/main.yml 46 additions, 0 deletionsansible/roles/fail2ban/tasks/main.yml
- ansible/roles/fail2ban/templates/jail.local.j2 7 additions, 0 deletionsansible/roles/fail2ban/templates/jail.local.j2
- ansible/roles/fail2ban/templates/sshpiperd_filter.local.j2 22 additions, 0 deletionsansible/roles/fail2ban/templates/sshpiperd_filter.local.j2
- ansible/roles/fail2ban/templates/sshpiperd_jail.local.j2 9 additions, 0 deletionsansible/roles/fail2ban/templates/sshpiperd_jail.local.j2
- ansible/roles/fix_centos_repo/tasks/main.yaml 1 addition, 1 deletionansible/roles/fix_centos_repo/tasks/main.yaml
- ansible/roles/install_node_exporter/tasks/main.yaml 82 additions, 0 deletionsansible/roles/install_node_exporter/tasks/main.yaml
- ansible/roles/install_node_exporter/templates/node_exporter.service.j2 12 additions, 0 deletions.../install_node_exporter/templates/node_exporter.service.j2
- ansible/roles/ldap_config/tasks/main.yml 0 additions, 1 deletionansible/roles/ldap_config/tasks/main.yml
- ansible/roles/nfs_mounts/tasks/autofs.yml 15 additions, 20 deletionsansible/roles/nfs_mounts/tasks/autofs.yml
- ansible/roles/nfs_mounts/tasks/main.yml 1 addition, 1 deletionansible/roles/nfs_mounts/tasks/main.yml
- ansible/roles/rewrite_map/tasks/main.yaml 28 additions, 0 deletionsansible/roles/rewrite_map/tasks/main.yaml
- ansible/roles/rewrite_map/templates/rewrite_map_config_py.j2 11 additions, 0 deletionsansible/roles/rewrite_map/templates/rewrite_map_config_py.j2
- ansible/roles/slurm_client/tasks/main.yml 14 additions, 0 deletionsansible/roles/slurm_client/tasks/main.yml
- ansible/roles/ssh_host_keys/tasks/main.yml 2 additions, 0 deletionsansible/roles/ssh_host_keys/tasks/main.yml
- ansible/roles/ssh_proxy_config/tasks/main.yml 0 additions, 42 deletionsansible/roles/ssh_proxy_config/tasks/main.yml
ansible/roles/fail2ban/tasks/main.yml
0 → 100644
ansible/roles/rewrite_map/tasks/main.yaml
0 → 100644