Automate conversion of GPFS policy outputs to parquet without Jupyter
Compare changes
Files
2- Matthew K Defenderfer authored
File renamed with no changes. Show file contents
Created a set of scripts to parse our standard GPFS policy outputs and save them as a parquet dataset without needing a Jupyter notebook. Iterated off of parquet-list-policy-data.ipynb
.
tld
). Can parse from /data/user
, /data/user/home
, /data/project
, and /scratch
.
tld
as the index within each parquet file for faster aggregation laterlog_dir/parquet
but can be specified elsewheredaskdev/dask:2024.8.0-py3.12
) but is variable
run-convert-to-parquet.sh