"raw_data" vs "clean_data" folders in project structure

https://phac-modelling-hub.github.io/dev-practices/projects/consistency.html

In general your advice mirrors the systems I've used. However, when structuring projects, for scientific endeavors I find it useful to have a folder for the raw / original data, and then a separate folder with cleaned data. I often have one or more scripts that programmatically modify or clean the data, and it's really helpful to have a clearly identifiable folder (for me `raw_data/`) for unaltered data files, and a separate one for the cleaned data files that are ready for analysis (for me `cleaned_data/`).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

"raw_data" vs "clean_data" folders in project structure #9

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

"raw_data" vs "clean_data" folders in project structure #9

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions