Skip to content

Conversation

simleo
Copy link
Collaborator

@simleo simleo commented Jul 5, 2022

Adds support for workflow runs where Directory was used as an input or output parameter.

CWLProv does not keep track of directories as file system entries, but rather as collections of files; more specifically, as dictionaries where values are the file entities and keys are their IDs. This is only reflected in the provenance metadata, while actual files under data/ are all stored at the same level: data/XY, where XY are the first two characters of the file's SHA1 checksum in hex format. Since RO-Crate's Datasets map to file system directories, for each dir entry the converter creates an actual directory in the crate, with name equal to the checksum of the checksums of all files in the directory.

@simleo simleo merged commit c6d41cc into ResearchObject:main Jul 5, 2022
@simleo simleo deleted the dir_io branch July 5, 2022 12:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant