Replies: 3 comments 1 reply
-
|
Databases people would normally say that you don't need to "store" what can be calculated, but maybe the strongest argument here is that we need to provide as much transparency as possible. One possible compromise would be to store raw data, only for a specific period of time, and discard it afterwards. |
Beta Was this translation helpful? Give feedback.
-
|
For traceability and reproducibility we should find ways to store all raw data, but we can probably outsource e.g. to zenodo? |
Beta Was this translation helpful? Give feedback.
-
|
Combining your arguments I would suggest to use a self-describing container format like https://www.researchobject.org/ro-crate/ that containes the processed data and optionally the raw data or at least references (URLs) to it. Storing it on Zenodo beside archived and versioned source code repos would provide reproducibility and long-term availability at no/low costs. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
There are at least three levels:
Arguments for storing raw data:
Arguments against requirements to store raw data:
Beta Was this translation helpful? Give feedback.
All reactions