Bioinformatics course timetable
June 2022
Wed 15 |
R is one of the leading programming languages in Data Science. It is widely used to perform statistics, machine learning, visualisations and data analyses. It is an open source programming language so all the software we will use in the course is free. This course is an introduction to R designed for participants with no programming experience. We will start from scratch by introducing how to start programming in R and progress our way and learn how to read and write to files, manipulate data and visualise it by creating different plots - all the fundamental tasks you need to get you started analysing your data. During the course we will be working with one of the most popular packages in R; tidyverse that will allow you to manipulate your data effectively and visualise it to a publication level standard. Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here. |
Thu 16 |
R is one of the leading programming languages in Data Science. It is widely used to perform statistics, machine learning, visualisations and data analyses. It is an open source programming language so all the software we will use in the course is free. This course is an introduction to R designed for participants with no programming experience. We will start from scratch by introducing how to start programming in R and progress our way and learn how to read and write to files, manipulate data and visualise it by creating different plots - all the fundamental tasks you need to get you started analysing your data. During the course we will be working with one of the most popular packages in R; tidyverse that will allow you to manipulate your data effectively and visualise it to a publication level standard. Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here. |
Fri 17 |
Recent technological advances have made it possible to obtain genome-wide transcriptome data from single cells using high-throughput sequencing (scRNA-seq). Even though scRNA-seq makes it possible to address problems that are intractable with bulk RNA-seq data, analysing scRNA-seq is also more challenging. In this course we will be surveying the existing problems as well as the available computational and statistical frameworks available for the analysis of scRNA-seq. Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here. |
Mon 20 |
This course introduces concepts about reproducibility that can be used when you are programming in R. We will explore how to create notebooks - a way to integrate your R analyses into reports using Rmarkdown. The course also introduces the concept of version control. We will learn how to create a repository on GitHub and how to work together on the same project collaboratively without creating conflicting versions of files. Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here. |
This course provides an introduction to the basic theory and concepts of network analysis. Attendees will learn how to construct protein-protein interaction networks and subsequently use these to overlay large-scale data such as that obtained through RNA-Seq or mass-spec proteomics. The course will focus on giving attendees hands-on experience in the use of one of the most commonly used open source Network Visualisation Platforms, Cytoscape. The course will also access and analyse the data through Cytoscape apps, including IntAct app. Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here. |
|
Tue 21 |
This course provides an introduction to the basic theory and concepts of network analysis. Attendees will learn how to construct protein-protein interaction networks and subsequently use these to overlay large-scale data such as that obtained through RNA-Seq or mass-spec proteomics. The course will focus on giving attendees hands-on experience in the use of one of the most commonly used open source Network Visualisation Platforms, Cytoscape. The course will also access and analyse the data through Cytoscape apps, including IntAct app. Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here. |
Wed 22 |
R is one of the leading programming languages in Data Science. It is widely used to perform statistics, machine learning, visualisations and data analyses. It is an open source programming language so all the software we will use in the course is free. This course is an introduction to R designed for participants with no programming experience. We will start from scratch by introducing how to start programming in R and progress our way and learn how to read and write to files, manipulate data and visualise it by creating different plots - all the fundamental tasks you need to get you started analysing your data. During the course we will be working with one of the most popular packages in R; tidyverse that will allow you to manipulate your data effectively and visualise it to a publication level standard. Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here. |
Thu 23 |
R is one of the leading programming languages in Data Science. It is widely used to perform statistics, machine learning, visualisations and data analyses. It is an open source programming language so all the software we will use in the course is free. This course is an introduction to R designed for participants with no programming experience. We will start from scratch by introducing how to start programming in R and progress our way and learn how to read and write to files, manipulate data and visualise it by creating different plots - all the fundamental tasks you need to get you started analysing your data. During the course we will be working with one of the most popular packages in R; tidyverse that will allow you to manipulate your data effectively and visualise it to a publication level standard. Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here. |
Fri 24 |
Recent technological advances have made it possible to obtain genome-wide transcriptome data from single cells using high-throughput sequencing (scRNA-seq). Even though scRNA-seq makes it possible to address problems that are intractable with bulk RNA-seq data, analysing scRNA-seq is also more challenging. In this course we will be surveying the existing problems as well as the available computational and statistical frameworks available for the analysis of scRNA-seq. Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here. |
Wed 29 |
This award winning virtually delivered course is intended to provide a strong foundation in practical statistics and data analysis using the R software environment. The underlying philosophy of the course is to treat statistics as a practical skill rather than as a theoretical subject and as such the course focuses on methods for addressing real-life issues in the biological sciences. There are three core goals for this course:
R is an open source programming language so all of the software we will use in the course is free. In this course, we explore classical statistical analysis techniques starting with simple hypothesis testing and building up to linear models and power analyses. The focus of the course is on practical implementation of these techniques and developing robust statistical analysis skills rather than on the underlying statistical theory. After the course you should feel confident to be able to select and implement common statistical techniques using R and moreover know when, and when not, to apply these techniques. Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here. |
Thu 30 |
This award winning virtually delivered course is intended to provide a strong foundation in practical statistics and data analysis using the R software environment. The underlying philosophy of the course is to treat statistics as a practical skill rather than as a theoretical subject and as such the course focuses on methods for addressing real-life issues in the biological sciences. There are three core goals for this course:
R is an open source programming language so all of the software we will use in the course is free. In this course, we explore classical statistical analysis techniques starting with simple hypothesis testing and building up to linear models and power analyses. The focus of the course is on practical implementation of these techniques and developing robust statistical analysis skills rather than on the underlying statistical theory. After the course you should feel confident to be able to select and implement common statistical techniques using R and moreover know when, and when not, to apply these techniques. Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here. |
July 2022
Fri 1 |
Recent technological advances have made it possible to obtain genome-wide transcriptome data from single cells using high-throughput sequencing (scRNA-seq). Even though scRNA-seq makes it possible to address problems that are intractable with bulk RNA-seq data, analysing scRNA-seq is also more challenging. In this course we will be surveying the existing problems as well as the available computational and statistical frameworks available for the analysis of scRNA-seq. Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here. |
Mon 4 |
Machine learning gives computers the ability to learn without being explicitly programmed. It encompasses a broad range of approaches to data analysis with applicability across the biological sciences. Lectures will introduce commonly used algorithms and provide insight into their theoretical underpinnings. In the practicals students will apply these algorithms to real biological data-sets using the R language and environment. Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here. |
Tue 5 |
Machine learning gives computers the ability to learn without being explicitly programmed. It encompasses a broad range of approaches to data analysis with applicability across the biological sciences. Lectures will introduce commonly used algorithms and provide insight into their theoretical underpinnings. In the practicals students will apply these algorithms to real biological data-sets using the R language and environment. Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here. |
Wed 6 |
This award winning virtually delivered course is intended to provide a strong foundation in practical statistics and data analysis using the R software environment. The underlying philosophy of the course is to treat statistics as a practical skill rather than as a theoretical subject and as such the course focuses on methods for addressing real-life issues in the biological sciences. There are three core goals for this course:
R is an open source programming language so all of the software we will use in the course is free. In this course, we explore classical statistical analysis techniques starting with simple hypothesis testing and building up to linear models and power analyses. The focus of the course is on practical implementation of these techniques and developing robust statistical analysis skills rather than on the underlying statistical theory. After the course you should feel confident to be able to select and implement common statistical techniques using R and moreover know when, and when not, to apply these techniques. Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here. |
Using the Linux operating system and the bash command line interface, we will demonstrate the basic structure of the UNIX operating system and how we can interact with it using a basic set of commands. Applying this, we will learn how to navigate the filesystem, manipulate text-based data and structure simple pipelines out of these commands. Building on the techniques learnt so far, we will then construct bash scripts combining the commands and structures already learnt into more complex, reusable tools. We will look at how we can apply these scripts to common problems faced in UNIX environments such as: communicating with remote servers; managing custom software installations and integrating these tools into our simple pipelines. This course is targeted at participants with no prior experience working with UNIX-like systems (OSX, Linux) or command line interfaces. Please note that if you are not eligible for a University of Cambridge Raven account you will need to Book or register Interest by linking here. |
|
Thu 7 |
This award winning virtually delivered course is intended to provide a strong foundation in practical statistics and data analysis using the R software environment. The underlying philosophy of the course is to treat statistics as a practical skill rather than as a theoretical subject and as such the course focuses on methods for addressing real-life issues in the biological sciences. There are three core goals for this course:
R is an open source programming language so all of the software we will use in the course is free. In this course, we explore classical statistical analysis techniques starting with simple hypothesis testing and building up to linear models and power analyses. The focus of the course is on practical implementation of these techniques and developing robust statistical analysis skills rather than on the underlying statistical theory. After the course you should feel confident to be able to select and implement common statistical techniques using R and moreover know when, and when not, to apply these techniques. Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here. |
Using the Linux operating system and the bash command line interface, we will demonstrate the basic structure of the UNIX operating system and how we can interact with it using a basic set of commands. Applying this, we will learn how to navigate the filesystem, manipulate text-based data and structure simple pipelines out of these commands. Building on the techniques learnt so far, we will then construct bash scripts combining the commands and structures already learnt into more complex, reusable tools. We will look at how we can apply these scripts to common problems faced in UNIX environments such as: communicating with remote servers; managing custom software installations and integrating these tools into our simple pipelines. This course is targeted at participants with no prior experience working with UNIX-like systems (OSX, Linux) or command line interfaces. Please note that if you are not eligible for a University of Cambridge Raven account you will need to Book or register Interest by linking here. |
|
Tue 12 |
Machine learning gives computers the ability to learn without being explicitly programmed. It encompasses a broad range of approaches to data analysis with applicability across the biological sciences. Lectures will introduce commonly used algorithms and provide insight into their theoretical underpinnings. In the practicals students will apply these algorithms to real biological data-sets using the R language and environment. Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here. |
Wed 13 |
This award winning virtually delivered course is intended to provide a strong foundation in practical statistics and data analysis using the R software environment. The underlying philosophy of the course is to treat statistics as a practical skill rather than as a theoretical subject and as such the course focuses on methods for addressing real-life issues in the biological sciences. There are three core goals for this course:
R is an open source programming language so all of the software we will use in the course is free. In this course, we explore classical statistical analysis techniques starting with simple hypothesis testing and building up to linear models and power analyses. The focus of the course is on practical implementation of these techniques and developing robust statistical analysis skills rather than on the underlying statistical theory. After the course you should feel confident to be able to select and implement common statistical techniques using R and moreover know when, and when not, to apply these techniques. Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here. |
Have you heard about High Performance Computing, but are not sure what it is or whether it is relevant for your work? Would you like to use a HPC, but are not sure where to start? Are you using your personal computer to run computationally demanding tasks, which take long and slow down your work? Do you need to use software that runs on Linux, but don't have access to a Linux computer? If any of these questions apply to you, then this course might be for you! Knowing how to work on a High Performance Computing system is an essential skill for applications such as bioinformatics, big-data analysis, image processing, machine learning, parallelising tasks, and other high-throughput applications. In this course we will cover the basics of High Performance Computing, what it is and how you can use it in practice. This is a hands-on workshop, which should be accessible to researchers from a range of backgrounds and offering several opportunities to practice the skills we learn along the way. As an optional session for those interested, we will also introduce the (free) HPC facilities available at Cambridge University (the course is not otherwise Cambridge-specific). Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here. |
|
Thu 14 |
This award winning virtually delivered course is intended to provide a strong foundation in practical statistics and data analysis using the R software environment. The underlying philosophy of the course is to treat statistics as a practical skill rather than as a theoretical subject and as such the course focuses on methods for addressing real-life issues in the biological sciences. There are three core goals for this course:
R is an open source programming language so all of the software we will use in the course is free. In this course, we explore classical statistical analysis techniques starting with simple hypothesis testing and building up to linear models and power analyses. The focus of the course is on practical implementation of these techniques and developing robust statistical analysis skills rather than on the underlying statistical theory. After the course you should feel confident to be able to select and implement common statistical techniques using R and moreover know when, and when not, to apply these techniques. Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here. |
Have you heard about High Performance Computing, but are not sure what it is or whether it is relevant for your work? Would you like to use a HPC, but are not sure where to start? Are you using your personal computer to run computationally demanding tasks, which take long and slow down your work? Do you need to use software that runs on Linux, but don't have access to a Linux computer? If any of these questions apply to you, then this course might be for you! Knowing how to work on a High Performance Computing system is an essential skill for applications such as bioinformatics, big-data analysis, image processing, machine learning, parallelising tasks, and other high-throughput applications. In this course we will cover the basics of High Performance Computing, what it is and how you can use it in practice. This is a hands-on workshop, which should be accessible to researchers from a range of backgrounds and offering several opportunities to practice the skills we learn along the way. As an optional session for those interested, we will also introduce the (free) HPC facilities available at Cambridge University (the course is not otherwise Cambridge-specific). Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here. |
|
Wed 27 |
This course will be run as an in-person event only! Generalised linear models are the kind of models we would use if we had to deal with non-continuous response variables. For example, this happens if you have count data or a binary outcome. This course aims to introduce generalised linear models, using the R software environment. Similar to Core statistics using R this course addresses the practical aspects of using these models, so you can explore real-life issues in the biological sciences. The Generalised linear models using R course builds heavily on the knowledge gained in the core statistics sessions, which means that the Core statistics using R course is a firm prerequisite for joining. There are several aims to this course: 1. Be able to distinguish between linear models and generalised linear models 2. Analyse binary outcome and count data using R 3. Critically assess model fit R is an open-source programming language so all of the software we will use in the course is free. We will be using the R Studio interface throughout the course. Most of the code will be focussed around the tidyverse and tidymodels packages, so a basic understanding of the tidyverse syntax is essential.
|
September 2022
Mon 12 |
Recent technological advances have made it possible to obtain genome-wide transcriptome data from single cells using high-throughput sequencing (scRNA-seq). Even though scRNA-seq makes it possible to address problems that are intractable with bulk RNA-seq data, analysing scRNA-seq is also more challenging. In this course we will be surveying the existing problems as well as the available computational and statistical frameworks available for the analysis of scRNA-seq. The training room is located on the first floor and there is currently no wheelchair or level access available to this level. Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here. |
This course provides a practical introduction to the writing of Python programs for the complete novice. Participants are lead through the core concepts of Python including Python syntax, data structures and reading/writing files. These are illustrated by a series of example programs. Upon completion of the course, participants will be able to write simple Python programs. Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here. |