FY 2026 Project Planning
Triaging and prioritizing RCS projects for FY26
MUST >> WANT >> NICE
- TOP priority
- MUST haves (requirements)
-
GITLAB: Migrate data to GPFS5 gitlab#15
-
GITLAB: Update GitLab OS gitlab#12
-
GITLAB: Update GitLab version #689
-
RCS: Software license renewal and upgrade (patch) planning and reminders/automation #686, Blocked by #689
- Researcher-Facing software (MATLAB, SAS, STATA, ...)
- Internal software (OS, Slurm, ...)
-
STORAGE: Update Ceph version to fix LTS-Globus bug, or work with Globus to read object metadata header for hash #608
-
STORAGE: Ceph Block/LTS/FS (not GPFS) shared quotas and accounting
-
CHEAHA: Slurm accounting model and priority queueing to support hardware buy-in
-
CHEAHA: Update Lmod (EasyBuild 5 will require Lmod >=8)
-
CHEAHA: Update Slurm
-
REDCAP: Install Redcap OpenStack extension
-
CHEAHA:
-ntasks-per-socketon OOD apps for correct multi-GPU usage -
CHEAHA: Node hardware info in slurm.conf (features) rc-data-science/metrics/rc-hardware#2 #230
-
CHEAHA: Update OOD
-
CHEAHA: Deprecate Anaconda3 module in favor of conda-forge (license compliance)
-
COMMUNICATION: Email sending automation (no more Outlook) #674
-
STORAGE: Scratch policy implement GPFS5 cheaha#34
-
SECURITY: SSO web service alignment everywhere for MFA and security compliance
-
SECURITY: Get information from XIAS for security
- (for person X, what is sponsor BlazerID? is X expired? what is X site(s)? is site expired? is URI in site?)
-
SECURITY: Security (POAM) milestones (docs, procedures, etc)
-
SECURITY: HIPAA attestation
-
SECURITY: NIST 800-171 attestation other than NIH (compliant for NIH grants, DUAs go thru OSP, limited to RCS only)
-
SECURITY: Automated account state management (this is very broad) #669
-
XNAT: Update XNAT
-
RCS: Resource entitlement and eligibility tracking #637
-
ADMIN: Onboarding procedure(s)
-
- WANT to haves (higher priority)
- LTS: Unify individual allocation labels to match
blazerid@uab.eduformat. - CHEAHA: Windfall "ondemand" partition #673 #471
- CHEAHA: SAS Enterprise Guide OOD app cluster-software#139
- CHEAHA: Install most recent SAS
- CHEAHA: Install most recent STATA
- CHEAHA: Cheaha software builds via CICD #679
- CHEAHA: Prevent/auto-revert Cheaha TLD permission changes
- CHEAHA: Refine experience for TMP and LOCAL dirs on Cheaha nodes #678
- CHEAHA: Cheaha Slurm QoS, priority calculation, etc, review rc-slurm#61 cheaha#44
- CHEAHA: Refine old modules - ties into rebuild CICD #554 (closed) #566 #480 #460 #432 (closed)
- DOCS: Migrate docs to GitLab
- NETWORK: Network upgrade beyond 10G link to campus, beyond 40G to scidmz
- METRICS: Cheaha job-specific/process-specific observability #661
- METRICS: Data/Storage observability #661
- RCS: Services status page #627
- STORAGE: Globus Azure Blob Connector #685
- STORAGE: Separate LTS from Core Ceph
- STORAGE: Shared, read-only research data allocation (e.g. alphafold) #680 #654
- STORAGE: Standardize /data/project/ #625 #633 #162 (closed)
- SECURITY: Cheaha observability for security #661
- SECURITY: RCS Account Statemachine and RabbitMQ refactoring rabbitmq_agents#163
- SECURITY: RCS Account Database (user reg db) enhancements #393 #400 #82 #410 #141 #642
- SECURITY: Restrict access to info about users, groups, slurm jobs, etc. cheaha#57
- TRAINING: Researcher training initiatives
- UX: RCS main page
- UX: Expose user reg app content (writing/theme) to facilitation via GitLab account-app#17 #556
- UX: Expose OOD content (writing/theme) to facilitation via GitLab rc-ood-message#1 #556
- UX: Expose OpenStack Horizon content (writing/theme) to facilitation via GitLab #624 #556
- LTS: Unify individual allocation labels to match
- NICE to haves (lower priority)
- CHEAHA: Update Singularity #546 cluster-software#115
- CHEAHA: Resolve poor researcher UX around pip and conda behaving unexpectedly #485
- CHEAHA: Internal routing SSH key dedicated storage location
- DOCS: Docs AI chatbot courtesy of UAB Copilot
- DOCS: Docs observability and analytics
- METRICS: Cheaha observability for buy-in model #626 #471
- METRICS: Cloud.rc observability " #661
- METRICS: Network observability " #661
- METRICS: Buffered node exporter data locally for robustness
- FUNDING: Cheaha compute buy-in model and implementation #626 #637
- FUNDING: Cloud.rc buy-in model and implementation #672 #637
- FUNDING: Storage buy-in model and implementation (TiB vs TB is 10%, this matters for dollar costs!) #637
- STORAGE: Offsite data center backup solution and buy-in model (MUST if funded)
- STORAGE: Globus Azure Blob connector for backup and archive solution and buy-in model (MUST if funded)
- STORAGE: Project-parallel scratch directories #653
- STORAGE: Improve LTS Globus UX to handle multiple keys (individual, lab shared, core shared, ...) #684
- SECURITY: CMMC compliant enclave
- PRODUCTIVITY:
pastebinitcommand and self-hosted pastebin equivalent
- INFOrmational
- SECURITY: IDM Federation (what is our portion?)
Completed
-
STORAGE: Globus configured for GPFS5 (no issue) #694 (closed) -
STORAGE: New accounts created on GPFS5 -
GLOBUS: Understand and implement shared administration group and duties in Globus.