HPBBM
There are only two kinds of programming languages: the ones people complain about and the ones nobody uses. —Bjarne Stroustrup
Wellcome & Disclaimer
This site contains the materials for the Coding tools for Biochemistry & Molecular Biology (in Spanish Herramientas de Programación para Bioquímica y Biología Molecular, a.k.a. HPBBM) course of fall 2024 in the Bachelor’s Degree in Biochemistry @UAM (Universidad Autónoma de Madrid, Spain). Source files are available at the repo, containing the source files and the data used in the examples and exercises. The icon on the top-right corner in all the site pages will allow you to download the repo data folder (link generated with DownGit). Note that this folder can be updated during the course. For reference, materials from 2023 and 2022 course editions are also available, although those are not updated or maintained anymore.
Course Syllabus can be checked at UAM site and a PDF version is available here. The course contains a first block of lessons on Python programming (Lessons 1-7) that is not included here. Detailed academic information about the course contents, dates, and assessment only can be found at the UAM Moodle site. In this site, the lessons pages have been generated as Quarto to obtain source files (.qmd) that contain formatted text and chunks of R code that are executed upon rendering (see lesson R7 for more info about Markdown and Quarto in RStudio).
These materials have been designed as an interactive and self-learning resource. As you go through the lessons, think about the code beforehand and then test it yourself with your computer. Then try out the exercises for yourself.
For further examples or additional help, you can consult the references and, of course, ask the lecturers in the meetings and lectures on site.
All the Lessons contain a final R Session info where you can see the used versions for R and all packages. Anyway, specially if you installed R in your computer for the first time some years ago, I kindly request to update R and RStudio before going forward.
Let’s go, go it now!
The lessons also contain embedded interactive Quick exercises (created with packages WebR
or webexercises
). Try them out before checking the proposed solution. We have also included a final section of Review exercises, that we will discuss in the classroom. The answers to the latter are not available online, but I am happy to provide them on request. Also, we often add additional exercises from third-party websites or blogs that you can work on to improve your skills and reinforce some key concepts.
All this material is open access and it is shared under CC BY-NC license.
This page is under construction and will be expanded and corrected in the coming weeks. Any feedback, help and suggestions are very welcome.
Also, it is likely that you will find some typos or even major errors in these course materials. I and the future students would appreciate it if you let me know about anything that could be corrected or improved. You can reach us by email.
Bibliography and Resources
Each lesson contains its own specific references, but we wanted to highlight here some curated general open access and open source resources to introduce yourself into R and R Studio.
Grolemund, G. Hands-on Programming in R.
Peng, R. D. R Programming for Data Science.
Coulatti, R. I. R Crash Course for Biologist.
Chang, W. R. R Graphics Cookbook.
Doyle, M., Chung, J. & Perreau V. Introduction to R for Biologists.
Higgins, P. D. R. Reproducible Medical Research with R.
Contents
Lesson R1: Variables, Operators and data structures in R
Lesson R2: Flow Control
Lesson R3: Data input and output in R
Lesson R4: Write your own functions
Lesson R5: Plots
Lesson R6: Data management
Lesson R7: R-projects and R-markdown for reproducible data analysis
Lesson R8: Advanced data plotting with ggplot
Lesson R9: Applications for Molecular Biology
Footnotes
See the code in the script r_sticker.R in the repo.↩︎