Because we’ve always done it this way!
Your job as a Bio Data Scientist:
Levering data driven decision making allows the company to gain a competitive edge and this is where you Bio Data Science skills are indispensable!
In your career, your task will be to create value
This is regardless of whether you plan to work in indstry or pursue a research career
What you do has to create value
Creating value requires skills
Skills need to be learned
So, why are you here?
“To understand computations in R, two slogans are helpful: Everything that exists is an object. Everything that happens is a function call.” – John Chambers (creator of the S language, of which R is an implementation).
You can approach R
as
Let’s say we have this vector
Now, we want to compute the mean, we can do:
Object Oriented Approach:
Vector <- R6::R6Class("Vector",
public = list(
data = NULL,
initialize = function(data) {
if (!is.numeric(data)) {
stop("Data should be numeric.")
}
self$data <- data
},
mean = function() {
return(sum(self$data) / length(self$data))
}
)
)
numbers <- Vector$new(my_vector)
print(numbers$mean())
[1] 38
You can approach R
as
You can approach R
as
The code on the right all performs the same task, but which do you think is:
In this course we will work with R
in its native form - a fully fledged functional programming language
Standard Deviation
Median
Permute
Bootstrap
…and tons more!
R
is Turing-complete:
R
can theoretically solve any computational problem. Foundational concept shared with e.g. Python, C++, Java, etc.R
comes with an ecosystem supporting reproducibility in production settings.tidyverse
R
is interpreted and can be slower, packages like data.table and Rcpp offer dramatic performance enhancements. Also, parallel computing is straightforwardClosing Thought: Every tool has its strengths. The key is to understand and leverage them effectively.
With SO many packages, there will inevitable be SO many opinions
The tidyverse is a unified opinionated collection of R packages designed for data science
All packages share an underlying design philosophy, grammar, and data structures
Today R has in essense become two dialects base
and tidyverse
Note: This course will focus solely on the tidyverse
dialect
We’ll spend a lot more time on going over the details of the Tidyverse!
From ~20 to ~150 students
This year materials have been revised to suit large class room teaching
The teaching team will do out best to support your learning, but it is important to emphasise, that you will have to take responsibility for following the course curriculum!
Tuesdays 08.00 - 12.00
Basically, show up, follow the curriculum and you will do fine!
R for Bio Data Science