Skip to content
Snippets Groups Projects
Commit 22776c4d authored by John-Paul Robinson's avatar John-Paul Robinson
Browse files

Notebook to convert policy run output to parquet data sets

This is intended to be run on URL encoded output lines from a
gpfs list policy run.  It creates panda structures that are
then saved as parquet format for ease of downstream processing.

Can be run in parallel across many inputs by wrapping with papermill
and have upstream split the input file.
parent b3a99478
No related branches found
No related tags found
No related merge requests found
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment