- make it a rectangle
- rows = observations, columns = variables
- one header row; avoid spaces
- one thing per cell
- fill in all cells
- consistent missing value codes
- care about dates (3 separate columns?)
- no calculations in raw data files
- save as CSV
- don't use font color or highlighting as data