Skip to content
Snippets Groups Projects
Select Git revision
  • 3-create-a-pickling-script-to-parse-raw-text-into-pickled-pandas-dataframes-2
  • 8-build-out-a-post-processing-pipeline-to-generate-parquet-dataset-from-policy-run
  • feat-clean-notebook-commits
  • feat-improve-example-jobs
  • main default protected
  • prod protected
  • v0.5.1
  • v0.5.0
  • v0.4.2
  • v0.4.1
  • v0.4.0
  • v0.3.3
  • v0.3.2
  • v0.3.1
  • v0.3.0
  • v0.2.0
  • v0.1.1-1
  • v0.1.1
  • v0.1.0
19 results
You can move around the graph by using the arrow keys.
Created with Raphaël 2.2.02May29Apr26252423157Mar17Feb7531Jan282217151310228Dec271714129622Oct430Sep2418161413121131Aug302821201514230Jul262522May30Jun3Dec229Aug255Apr14Mar28FebPatch v0.5.1v0.5.1 mainv0.5.1 mainUpdate python image version and poetry versionv0.5.0v0.5.0Improve hive conversion functionalityFix bash math in convert-to-parquet batch scriptBugfixes in convert-to-parquetAdd random delay to nested sbatch submissionsFix no-clobber in convert-flat-to-hiveUpdate no-clobber for convert-to-parquetAdd --no-clobber to hive conversion toolsAdd pyarrow to dependenciesRemoving deps.ymlConvert to polars for backend computeAdd polars and cudf-polars packages for transition to polars backend pipelines.Improve help message response for CLIAdd caller defined worklist and lazy array invocationfeat-improve-ex…feat-improve-example-jobsFix typo in commentRemove noop echo on array task actionAdd listcmd param to split scriptAdd run order to example script file namesAdd __version__ file with automatic version bumpingChange partitioning name to fparqUpdated package build version and pyproject.tomlAdd log partitioning similar to fpartv0.4.2v0.4.2Bugfix: Added numpy datetime as valid type for run_dateAdd install instructions for Jupyter packagesAdd colormaps packagev0.4.1v0.4.1Modify churn structure to account for storage affectedAdd policy comparison pipelinev0.4.0v0.4.0Clean up repo for Jupyter notebooksClean up repo for Jupyter notebooksfeat-clean-note…feat-clean-notebook-commitsAdd example aggregation grouping by file size instead of ageAdd example notebook using dask for ad-hoc analysisAdd example notebook for full analysisv0.3.3v0.3.3Fix how list types are specified in type hints for type checkingAdd conversion of flat parquet structure to hiveReduce CLI time to action and fix convert-to-parquet help messageAdd correct pip hash and add dask_cudfv0.3.2v0.3.2Downgrade to CUDA 12.4v0.3.1v0.3.1Add installation instructionsUpdate README for new CLI
Loading