-`outdir`: specifies the directory the output log should be saved to. Defaults to `/data/rc/gpfs-policy/data`
-`outdir`: specifies the directory the output log should be saved to. Defaults to `/data/rc/gpfs-policy/data`
...
@@ -125,6 +125,10 @@ All other options control the array job resources. Default values are as follows
...
@@ -125,6 +125,10 @@ All other options control the array job resources. Default values are as follows
The default resources can parse 5 million line files in approximately 3 minutes so should cover all common use cases.
The default resources can parse 5 million line files in approximately 3 minutes so should cover all common use cases.
For all policies run on filesets in `/data/user`, `/data/project`, `/home`, or `/scratch` will automatically have their "top-level directory" (`tld`) computed and added to the parquet output. This is defined as the directory just under any of those specified filesets. For example, a file with path `/data/project/datascienceteam/example.txt` will have `tld` set to `datascienceteam`.
Any files in a directory outside those specified filesets will have `tld` set to `None`.