Pre Course Questionnaire

Leon Eyrich Jessen

Programming Experience

Student Composition

Is there any specific area of bio-research you would like to see covered?

Data Analysis & Machine Learning

“I’m interested in more advanced statistics, mapping, and machine learning techniques applied to biological data, especially handling large datasets like RNA-seq and proteomics.”

Genetics and Genomics

“I would like to see topics related to gene-based disease discovery, genome sequencing, and CRISPR applications in bio-research.”

Immunology

“Immunology research, particularly related to autoimmune diseases and tumor immunology, would be valuable to explore.”

Multi-omics & Omics Data

“Multi-omics approaches, including proteomics, genomics, and the analysis of RNA-seq data, are areas I’m very interested in.”

Epidemiology & Clinical Data

“It would be helpful to cover epidemiology topics, including predictive modeling of disease outbreaks and the analysis of clinical datasets.”

Briefly, what are your general expectations to this course?

R Programming Proficiency

“I expect to become proficient in R programming, gaining confidence in writing, interpreting, and using R code in a variety of bio-research contexts. I aim to improve my R skills for future work in biological research and industry.”

Data Handling and Analysis

“My goal is to learn how to effectively handle and analyze large datasets, including cleaning, organizing, and applying statistical methods to biological data. I hope to gain the ability to manage big data and automate data analysis tasks in R.”

Practical Applications in Biology

“I want to apply the R programming skills learned in this course to real-world biological problems, such as genomic data analysis, bioinformatics, and pipeline development. Understanding how to use R in practical bio-research scenarios is a key expectation.”

Visualization & Data Presentation

“I hope to develop skills in visualizing and presenting biological data in a clear and effective way. Learning how to create plots and presentations that make large datasets more accessible is a critical aspect I expect to master.”

Confidence and Efficiency in Using R

“I aim to feel more confident and efficient in using R for bio-data analysis by the end of this course. I hope to reduce the time spent on coding, improve the readability of my code, and tackle intermediate challenges independently.”

Is there anything you would like to add? Comments, suggestions, anything?

Course Pace & Structure

“No rushing through the learning material would greatly benefit my understanding. A slower, more deliberate pace will help those of us who are new to programming.”

Learning Resources

“It would be really helpful to have additional learning resources, such as recommended books or websites, to support learning outside of class. Pointers for getting extra help would be appreciated.”

Industry Applications

“Including company talks with content related to protein engineering and data handling would be valuable. It would be great to see how skills from the course can be applied to real industry problems, particularly in the context of protein data.”

Community & Atmosphere

“Looking forward to the course and excited about the learning experience! There’s a general sense of anticipation and eagerness to start the class.”

Individual Programming Projects

“I’d love to focus more on coding custom functions and understanding how they can be applied in different contexts, beyond just the basics.”

Summary

Alignment of Expectations

  • Key criteria for succes is that we agree on the content of this course
  • Luckily, much of what is mentioned is part of this course
  • However, if we were to cover all of the mentioned aspects, that would basically be an entire bioinformatics education

This course

  • Practical Application: Equip students with hands-on bio data science skills, emphasizing on transforming messy datasets to clean, structured ones, and performing insightful data analyses.
  • Tools & Platforms: Utilize Tidyverse R, RStudio IDE, Quarto reporting, git/GitHub, and other specific tools such as dplyr, ggplot, and shinyapps.io for comprehensive bio data science tasks.
  • Reproducibility & Collaboration: Stress on the importance of reproducible data analysis and effective collaboration in bio data science projects using Tidyverse R and git/GitHub.
  • Broad Skill Set: Train students to perform basic statistical tests, create R packages, design Shiny apps, employ Large-Language-Model (LLM) technology like chatGPT, and more.
  • Project Analysis & Presentation: Guide students to assess and critique bio data science projects on method choices and data communication, and organize & present their own results in dynamic Quarto reports.

This course - In other words

  • Creates the foundation for you to explore the multitude of bioinformatics subjects
  • Gives you concrete tool skills to handle (almost) any kind of bio data and to do collaborative coding projects
  • Trains your general collaborative and communicative meta skills

Tag along and you’ll have fun and learn a ton!