Datasets editing enhancements
Esha Joshi
Current Limitation
The Datasets platform currently lacks basic editing capabilities for samplesheets (TSV/CSV), requiring users to modify files externally when working with SRA Explorer imports or other data sources.
Requested Features
- Column Removal: Allow users to drop unwanted columns from datasets/samplesheets. For example, this is essential for cleaning up SRA Explorer imports and making them compatible with pipelines (e.g. nf-core/rnaseq).
- Header Name Editing: Enable editing of column header names. Ties into the earlier point, will make it much easier to adapt samplesheets to be compatible with pipelines.
- Manual Sample Entry: Add ability to manually enter data via free text. This is useful for adding individual SRA samples or IDs without having to generate a text file externally then import it (e.g. "SAMN1000000" as a sample ID used in nf-core/fetchngs).
These features will significantly reduce the need for external file editing and streamline the workflow preparation process, especially when working with SRA Explorer data. For better provenance, all edits could automatically create new versions in the dataset's history, maintaining a clear record of modifications and enabling users to revert changes if needed.