Skip to content
Snippets Groups Projects
Commit 5cc47f5f authored by Matthew K Defenderfer's avatar Matthew K Defenderfer
Browse files

add info on tld

parent 51b788ba
No related branches found
No related tags found
1 merge request!15Update main to v0.1.1
...@@ -24,10 +24,10 @@ Note: The command is aligned to run on specific nodes by way of arguments to mma ...@@ -24,10 +24,10 @@ Note: The command is aligned to run on specific nodes by way of arguments to mma
A list policy can be executed using `run-submit-pol-job.py` using the following command: A list policy can be executed using `run-submit-pol-job.py` using the following command:
``` bash ``` bash
run-submit-pol-job.py [-h] [-o OUTDIR] [-f LOG_PREFIX] [--with-dirs] sudo run-submit-pol-job.py [-h] [-o OUTDIR] [-f LOG_PREFIX] [--with-dirs]
[-N NODES] [-c CORES] [-p PARTITION] [-t TIME] [-N NODES] [-c CORES] [-p PARTITION] [-t TIME]
[-m MEM_PER_CPU] [-m MEM_PER_CPU]
device device
``` ```
- `outdir`: specifies the directory the output log should be saved to. Defaults to `/data/rc/gpfs-policy/data` - `outdir`: specifies the directory the output log should be saved to. Defaults to `/data/rc/gpfs-policy/data`
...@@ -125,6 +125,10 @@ All other options control the array job resources. Default values are as follows ...@@ -125,6 +125,10 @@ All other options control the array job resources. Default values are as follows
The default resources can parse 5 million line files in approximately 3 minutes so should cover all common use cases. The default resources can parse 5 million line files in approximately 3 minutes so should cover all common use cases.
For all policies run on filesets in `/data/user`, `/data/project`, `/home`, or `/scratch` will automatically have their "top-level directory" (`tld`) computed and added to the parquet output. This is defined as the directory just under any of those specified filesets. For example, a file with path `/data/project/datascienceteam/example.txt` will have `tld` set to `datascienceteam`.
Any files in a directory outside those specified filesets will have `tld` set to `None`.
## Running reports ## Running reports
### Disk usage by top level directies ### Disk usage by top level directies
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment