dimension = 1467 x 2
Number of samples = 163
Our project is based on dataset GEOquery [GSE54514]
And this paper
The raw data:
dimension = 1467 x 2
Number of samples = 163
But a tidy data set should follow these rules:
Every column contains exactly one variable.
Every row represents exactly one experimental unit (sample).
Every cell contains exactly one measurement.
dimension = 2445 x 11
Separate characteristics into key/value pairs
Convert long format to wide format
Replace string "NA" with actual NA
Split group_day into two variables
Boxplots for expression per gene in survivors vs non-survivors
Difference in day 1 gene expression in survivors vs non-survivors