Draft: Partition parquet dataset for sync with s5cmd
Compare changes
- Matthew K Defenderfer authored
+ 23
− 0
@@ -61,6 +61,29 @@ The ouput file is an unsorted list of files in uncompressed ASCII. Further proc
Some automated tools in this repository use a container for processing to simplify environment management. The container itself is built using the spec in ./Dockerfile and available at `gitlab.rc.uab.edu:4567/<group>/gpfs-policy`. The shell scripts will automatically download this container if it is not detected in the working directory.