Data Organization in Spreadsheets

Spreadsheets are widely used software tools for data entry, storage, analysis, and visualization. Focusing on the data entry and storage aspects, this article offers practical recommendations for organizing spreadsheet data to reduce errors and ease later analyses. The basic principles are: be consistent, write dates like YYYY-MM-DD, do not leave any cells empty, put just one thing in a cell, organize the data as a single rectangle (with subjects as rows and variables as columns, and with a single header row), create a data dictionary, do not include calculations in the raw data files, do not use font color or highlighting as data, choose good names for things, make backups, use data validation to avoid data entry errors, and save the data in plain text files.

Tags
Data and Resources
To access the resources you must log in
Additional Info
Field Value
Competence Not Available
Country United States of America (the) - USA
Creator Woo, Kara H, orcid.org/0000-0002-5125-4188
Creator Broman, Karl W, orcid.org/0000-0002-4914-6671
Domain Discipline agnostic
Language eng, English
Level Basic
Skill Integrate and analyze
Skill Capture and process
Target Data librarian or institutional level data steward
Target Data steward
Target Researcher
system:type Guidelines
Management Info
Field Value
Author Oset Paula
Maintainer Oset Paula
Version 1
Last Updated 6 April 2021, 11:52 (CEST)
Created 6 April 2021, 11:49 (CEST)