Bioinformatics Training - Bioinformatics courses

Analysing your own data: a workshop on statistics and experimental design (IN-PERSON)

Mon 15 Apr 2024 09:30 Finished

This week-long course is aimed at people with little or no experience using statistical analyses in research. It introduces participants to core concepts in statistics and experimental design, aimed at ensuring that the resulting data is able to address the research question using appropriate statistical methods.

The interactive course gives participants a hands-on, applied foundation in statistical data analysis and experimental design. Group exercises and discussions are combined with short lectures that introduce key theoretical concepts. Computational methods are used throughout the course, using the R programming language. Formative assessment exercises allow participants to test their understanding throughout the course and encourage questions and critical thinking.

By the end of the course participants will be able to critically evaluate and design effective research questions, linking experimental design concepts to subsequent statistical analyses. It will allow participants to make informed decisions on which statistical tests are most appropriate to their research questions. The course will provide a solid grounding for further development of applied statistical competencies.

As a follow-up of this course, we run an extra optional session on 25 April. This is an applied, hands-on session where you can bring your own data and we provide direct support to your analysis. This is exclusively available to participants on this course.

If you do not have a University of Cambridge Raven account please book or register your interest here.

Analysis of ChIP-seq data (ONLINE LIVE TRAINING) Thu 20 Jul 2023 09:30 Finished

Chromatin immunoprecipitation followed by sequencing (ChIP-seq) is a method used to identify binding sites for transcription factors, histone modifications and other DNA-binding proteins across the genome. In this course, we will cover the fundamentals of ChIP-seq data analysis, from raw data to downstream applications.

We will start with an introduction to ChIP-seq methods, including important considerations when designing your experiments. We will cover the bioinformatic steps in a standard ChIP-seq analysis workflow, covering raw data quality control, trimming/filtering, mapping, duplicate removal, post-mapping quality control, peak calling and peak annotation. We will discuss metrics used for quality assessment of the called peaks when multiple replicates are available, as well as the analysis of differential binding across sample groups. Throughout the course we will also cover tools and packages that can be used for visualising and exploring your results.

If you do not have a University of Cambridge Raven account please book or register your interest here.

Additional information

Our courses are only free for registered University of Cambridge students. All other participants will be charged according to our charging policy.
Attendance will be taken on all courses and a charge is applied for non-attendance, including for University of Cambridge students. After you have booked a place, if you are unable to attend any of the live sessions, please email the Bioinfo Team.
Further details regarding eligibility criteria are available here.

Analysis of ChIP-seq Data with SeqMonk (IN-PERSON)

Fri 5 Jul 2024 09:30 Finished

Chromatin immunoprecipitation followed by sequencing (ChIP-seq) is a method used to identify binding sites for transcription factors, histone modifications and other DNA-binding proteins across the genome. In this course, we will cover the fundamentals of ChIP-seq data analysis, from raw data to downstream applications.

We will start with an introduction to ChIP-seq methods and cover the bioinformatic steps in processing ChIP-seq data. We will then introduce the use of the graphical program SeqMonk to explore and visualise your data. Finally, you will perform peak calling and perform differential enrichment analysis.

If you do not have a University of Cambridge Raven account please book or register your interest here.

Additional information

♿ The training room is located on the first floor and there is currently no wheelchair or level access.
Our courses are only free for registered University of Cambridge students. All other participants will be charged according to our charging policy.
Attendance will be taken on all courses and a charge is applied for non-attendance, including for University of Cambridge students. After you have booked a place, if you are unable to attend any of the live sessions, please email the Bioinfo Team.
Further details regarding eligibility criteria are available here.
Guidance on visiting Cambridge and finding accommodation is available here.

Analysis of DNA Methylation using Sequencing (IN-PERSON) Tue 27 Feb 2024 09:30 Finished

This course will cover all aspects of the analysis of DNA methylation using sequencing, including primary analysis, mapping and quality control of BS-Seq data, common pitfalls and complications.

It will also include exploratory analysis of methylation, looking at different methods of quantitation, and a variety of ways of looking more widely at the distribution of methylation over the genome. Finally the course will look at statistical methods to predict differential methylation.

The course comprises of a mixture of theoretical lectures and practicals covering a range of different software packages.

If you do not have a University of Cambridge Raven account please book or register your interest here.

Additional information

♿ The training room is located on the first floor and there is currently no wheelchair or level access.
Our courses are only free for registered University of Cambridge students. All other participants will be charged according to our charging policy.
Attendance will be taken on all courses and a charge is applied for non-attendance, including for University of Cambridge students. After you have booked a place, if you are unable to attend any of the live sessions, please email the Bioinfo Team.
Further details regarding eligibility criteria are available here.
Guidance on visiting Cambridge and finding accommodation is available here.

Analysis of gene regulatory sequencing data: ChIP-seq, ATAC-seq and Hi-C

Thu 24 Nov 2016 09:30 Finished

This advanced course will cover high-throughput sequencing data processing, ChIP-seq data analysis (including alignment, peak calling), differences in analyses methods for transcription factors (TF) binding and epigenomic datasets, a range of downstream analysis methods for extracting meaningful biology from ChIP-seq data and will provide an introduction to the analysis of open chromatin with ATAC-seq and long-distance interactions with chromosomal conformation capture based Hi-C datasets.

Materials for this course can be found here.

Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here.

Analysis of high-throughput sequencing data with Bioconductor Wed 27 Apr 2016 09:30 Finished

This course provides an introduction to the tools available through the Bioconductor project for manipulating and analysing high-throughput sequencing (HTS) data. We will present workflows for the analysis of ChIP-Seq and RNA-seq data starting from aligned reads in bam format. We will also describe the various resources available through Bioconductor to annotate and visualize HTS data, which can be applied to any type of sequencing experiment.

The course timetable is available here.

Please note that if you are not eligible for a University of Cambridge Raven account you will need to book by linking here.

Analysis of mapped NGS data with SeqMonk (ONLINE LIVE TRAINING) Fri 30 Apr 2021 09:30 Finished

PLEASE NOTE that until further notice, due to the evolving situation with Coronavirus no courses will be offered as classroom based at the Training Facility. The Bioinformatics Team will be teaching the course live online in conjunction with the presenters.

SeqMonk is a graphical program for the visualisation and analysis of large mapped sequencing datasets such as ChIP-Seq, RNA-Seq, and BS-Seq.

The program allows you to view your reads against an annotated genome and to quantitate and filter your data to let you identify regions of interest. It is a friendly way to explore and analysis very large datasets.

This course provides an introduction to the main features of SeqMonk and will run through the analysis of a couple of different datasets to show what sort of analysis options it provides.

Further information is available here.

Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here.

Analysis of RNA-seq data with Bioconductor Wed 28 Mar 2018 09:30 Finished

This course provides an introduction to the tools available through the Bioconductor project for manipulating and analysing bulk RNA-seq data. We will present a workflow for the analysis RNA-seq data starting from aligned reads in bam format and producing a list of differentially-expressed genes. We will also describe the various resources available through Bioconductor to annotate, visualise and gain biological insight from the differential expression results.

Please note that if you are not eligible for a University of Cambridge Raven account you will need to book by linking here.

Analysis of small RNA-seq data

Tue 2 May 2017 09:30 Finished

This course focuses on methods for the analysis of small non-coding RNA data obtained from high-throughput sequencing (HTS) applications (small RNA-seq). During the course, approaches to the investigation of all classes of small non-coding RNAs will be presented, in all organisms.

Day 1 will focus on the analysis of microRNAs and day 2 will cover the analysis of other types of small RNAs, including Piwi-interacting (piRNA), small interfering (siRNA), small nucleolar (snoRNA) and tRNA-derived (tsRNA).

Please note that if you are not eligible for a University of Cambridge Raven account you will need to book by linking here.

An Introduction to Biological Networks & their Visualization (Webinar) Fri 10 Jul 2020 10:00 Finished

This webinar is an Introduction to Biological Networks, their types, and applications. It will include two of the most commonly used open source Network Visualisation Platforms (R-igraph and Cytoscape) with step-wise protocols for creating and visualising your own data as a network. It will present some of the major layout algorithms, visual styles and tips for effective visualisation, with examples from biology revealing how these can improve analysis and provide insights.

The webinar will be presented in the form of a lecture as well as a tutorial with step-wise screenshots that enable listeners to emulate simple Network creation and analysis. Please note that this is a webinar and not a coding exercise. Links to publicly available resources and hands-on tutorials will be shared with you for further reading and practice.

Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here.

An Introduction to Data Exploration, Experimental Design, and Biomarker Expression Analysis using JMP Software Tools Thu 10 Oct 2019 13:00 Finished

Through the use of real world examples and the JMP, JMP Pro, and JMP Genomics software, we will cover best practices used in both industry and academia today to visually explore data, plan biological experiments, detect differential expression patterns, find signals in next-generation sequencing data and easily discover statistically appropriate biomarker profiles and patterns.

The training room is located on the first floor and there is currently no wheelchair or level access available to this level.

Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here.

An introduction to long-read sequencing

Thu 13 Feb 2020 09:30 Finished

Analysis of whole genome data unearths a multitude of variants of different classes, which need to be filtered, annotated and validated to arrive at a causative variant for a disease. The current short length sequences, whilst being excellent at identifying single nucleotide variants and short insertions/deletions, struggle to correctly map structural variants (SVs). Long-read sequencing technologies offer improvements in the characterisation of genetic variation and regions that are difficult to assess with short-read sequences.

The aim of this course is to familiarise participants with long read sequencing technologies, their applications and the bioinformatics tools used to assemble this kind of data. Lectures will introduce this technology and provide insight into methods for the analysis of genomic data, while the hands-on sessions will allow participants to run analysis pipelines, focusing on data generated by the Oxford Nanopore Technologies (ONT) platform.

The training room is located on the first floor and there is currently no wheelchair or level access available to this level.

Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here.

An Introduction to Machine Learning (ONLINE LIVE TRAINING) Mon 17 Jul 2023 09:30 Finished

THIS COURSE IS NOT RETURNING IN ITS CURRENT FORM. PLEASE CHECK OUR WEBSITE FOR MORE INFORMATION.

Machine learning gives computers the ability to learn without being explicitly programmed. It encompasses a broad range of approaches to data analysis with applicability across the biological sciences. Lectures will introduce commonly used algorithms and provide insight into their theoretical underpinnings. In the practicals students will apply these algorithms to real biological data-sets using the R language and environment.

Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here.

An Introduction to MATLAB for biologists (ONLINE LIVE TRAINING) Mon 12 Jul 2021 09:30 Finished

PLEASE NOTE The Bioinformatics Team are presently teaching as many courses live online, with tutors available to help you work through the course material on a personal copy of the course environment. We aim to simulate the classroom experience as closely as possible, with opportunities for one-to-one discussion with tutors and a focus on interactivity throughout.

This course aims to give you an introduction to the basics of Matlab. During the two day course we will use a practical based approach to give you the confidence to start using Matlab in your own work. In particular we will show you how to write your own scripts and functions and how to use pre-written functions. We will also explore the many ways in which help is available to Matlab users. In addition we will cover basic computer programming in Matlab to enable you to write more efficient scripts.

Please note that if you are not eligible for a University of Cambridge Raven account you will need to Book or register Interest by linking here.

An Introduction to R: Software For Statistical Analysis Mon 26 Oct 2015 13:30 Finished

The aim of this course is to introduce participants to the basics of statistical analysis and the open source statistical software R, a free software environment for statistical computing and graphics.

Participants will actively use R throughout the course, during which they will be introduced to principles of statistical thinking and interpretation by example, exercises and discussion about a range of problems. The examples will be used to present a variety of statistical concepts and techniques, with no focus on any specific discipline.

Important information: We have 12 configured laptops for use at the workshop. After these laptops have been allocated, participants will either need to share, or bring their own. These laptops will be allocated to the first individuals to express an interest in using them. When booking, please indicate under "Special requirements" if you wish to use one of the 12 laptops or bring your own. Participants bringing their own laptop will be given instructions on what software to install.

Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register interest by linking here.

An Introduction to Solving Biological Problems with PERL Mon 4 Dec 2017 09:30 Finished

This course is aimed at those new to programming and provides an introduction to programming using Perl.

During this course you will learn the basics of the Perl programming language, including how to store data in Perl’s standard data structures such as arrays and hashes, and how to process data using loops, functions, and many of Perl’s built in operators. You will learn how to write and run your own Perl scripts and how to pass options and files to them. The course also covers sorting, regular expressions, references and multi-dimensional data structures.

The course will be taught using the online Learning Perl materials created by Sofia Robb of the University of California Riverside.

The course website providing links to the course materials is here.

Please note that if you are not eligible for a University of Cambridge Raven account you will need to book by linking here.

An Introduction to Solving Biological Problems with R Tue 11 Jun 2019 09:30 Finished

Please note that this course has been discontinued and has been replaced by the Introduction to R for biologists.

R is a highly-regarded, free, software environment for statistical analysis, with many useful features that promote and facilitate reproducible research.

In this course, we give an introduction to the R environment and explain how it can be used to import, manipulate and analyse tabular data. After the course you should feel confident to start exploring your own dataset using the materials and references provided.

The course website providing links to the course materials is here.

Please note that although we will demonstrate how to perform statistical analysis in R, we will not cover the theory of statistical analysis in this course. Those seeking an in-depth explanation of how to perform and interpret statistical tests are advised to see the list of Related courses. Moreover, those with some programming experience in other languages (e.g. Python, Perl) might wish to attend the follow-on Data Analysis and Visualisation in R course.

This event is supported by the BBSRC Strategic Training Awards for Research Skills (STARS) grant (BB/P022766/1).

The training room is located on the first floor and there is currently no wheelchair or level access available to this level.

Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here.

Bacterial Genome Assembly and Annotation in Galaxy

Thu 8 Jun 2017 09:30 Finished

The workshop will cover the basics of de novo genome assembly using a small genome example. This includes project planning steps, selecting fragment sizes, initial assembly of reads into fully covered contigs, and then assembling those contigs into larger scaffolds that may include gaps. The end result will be a set of contigs and scaffolds with sufficient average length to perform further analysis on, including genome annotation (link to that nomination). This workshop will use tools and methods targeted at small genomes. The basics of assembly and scaffolding presented here will be useful for building larger genomes, but the specific tools and much of the project planning will be different.

This workshop will also introduce genome annotation in the context of small genomes. We’ll begin with genome annotation concepts, and then introduce resources and tools for automatically annotating small genomes. The workshop will finish with a review of options for further automatic and manual tuning of the annotation, and for maintaining it as new assemblies or information becomes available.

This session will include an introduction to the Galaxy platform.

This event is co-organized with EMBL-ABR and the Genomics Virtual Lab. Course materials can be found here.

Please note that if you are not eligible for a University of Cambridge Raven account you will need to book by linking here.

Basic statistics and data handling Wed 28 Feb 2018 09:30 Finished

This three day course is intended to open doors to applying statistics - whether directly increasing skills and personally undertaking analyses, or by expanding knowledge towards identifying collaborators. The end goal is to drive confident engagement with data analysis and further training - increasing the quality and reliability of interpretation, and putting that interpretation and subsequent presentation into the hands of the researcher. Each day of the course will deliver a mixture of lectures, workshops and hands-on practicals – and will focus on the following specific elements.

Day 1 focuses on basic approaches and the computer skills required to do downstream analysis. Covering: Basic skills for data manipulation in R. How to prepare your data effectively. Principles of experimental design and how this influences analysis.

On day 2, participants will explore the core concepts of statistics – so that they can begin to see how they can be applied to their own work, and to also help with better critical evaluation of the work of others. Covering: Basic statistics concepts and practice: power, variability, false discovery, t-test, effect size, simulations to understand what a p-value means.

On day 3 we will continue to explore core concepts of statistics, focusing on linear regression and multiple testing correction.

Course materials are available here.

This event is supported by the BBSRC Strategic Training Awards for Research Skills (STARS) grant (BB/P022766/1).

Please note that if you are not eligible for a University of Cambridge Raven account you will need to book by linking here.

Big Data and Cloud Computing

Fri 1 Jun 2018 09:30 Finished

Recent advances in genomics, proteomics, imaging and other technologies, have resulted in data being generated at a faster rate than they can be meaningfully analysed. In this course we will show you how cloud computing can be used to meet the challenges of storage, management and analysis of big data. The first half of the course will introduce cloud infrastructure technologies. The second half will cover tools for collaborative working, resource management, and creation of workflows. The instructors will demonstrate how they are using cloud computing in their own research.

N.B. If you sign up for this course, you will be automatically registered for an AWS educate account, which will provide you with sufficient AWS credits to complete the course exercises. If you decide to continue using cloud computing after the course, you will need to either purchase more credits or apply for a grant from programs like: AWS Cloud Credits for Research, Microsoft Azure for Research or Google Cloud Platform Education Grants.

Please note that if you are not eligible for a University of Cambridge Raven account you will need to book by linking here.

BIIT web-tools for high-throughput data analysis from ELIXIR-Estonia

Thu 12 May 2016 09:30 Finished

In this course we will introduce web-based, open source tools to analyse and interpret high-throughput biological data.

The main focus will be g:Profiler - a toolset for finding most significant functional groups for a given gene or protein list; MEM - a query engine allowing to mine hundreds of public gene expression datasets to find most co-expressed genes based on a query gene; and ClustVis - a web tool for visualizing clustering of multivariate data using Principal Component Analysis (PCA) plot and heatmap.

MEM and g:Profiler are ELIXIR-Estonia node services.

Please note that if you are not eligible for a University of Cambridge Raven account you will need to book by linking here.

Bioinformatics for Principal Investigators Mon 16 Sep 2019 09:30 Finished

The aim of this workshop is to provide principal investigators with an introduction to the challenges of working with biological data and to the best practices, and tools, needed to perform bioinformatics research effectively and reproducibly.

On day 1, we will cover the importance of experimental design, discuss the challenges associated with (i) the analysis of high-throughput sequencing data (utilising RNA-seq as a working example) and (ii) the application of machine learning algorithms, as well as issues relating to reusability and reproducibility.

On day 2, we will put into practice concepts from day 1, running a RNA-seq data analysis pipeline, going from raw reads through differential expression analysis and the interpretation of downstream analysis results. Challenges encountered at each step of the analytical pipeline will be discussed. Please note that day 2 is optional.

The training room is located on the first floor and there is currently no wheelchair or level access available to this level.

Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here.

Biological data analysis using the InterMine API (Online)

Tue 17 Nov 2020 13:00 Finished

PLEASE NOTE The Bioinformatics Team are presently teaching as many courses live online. We aim to simulate the classroom experience as closely as possible, with opportunities for one-to-one discussion with tutors and a focus on interactivity throughout.

InterMine is a freely available open-source data warehouse built specifically for the integration and analysis of complex biological data sets.

InterMine-based data analysis platforms are available for many organisms including mouse, rat, budding yeast, plants (over 87 plant genomes), nematodes, fly, zebrafish, Hymenoptera, Planaria, and more recently human.

Genomic and proteomic data within InterMine databases includes pathways, gene expression, interactions, sequence variants, GWAS, regulatory data and protein expression. InterMine provides sophisticated query and visualisation tools both through a web interface and a powerful web service API, with multiple language bindings including Python and R.

This course will focus on programmatic access to InterMine through the API and InterMine searches will be done using Python and R scripts. The exercises will mainly use the fly, human and mouse databases, but the course is applicable to anyone working with data for which an InterMine database is available (a comprehensive list of InterMine databases is available here.

This event is organised alongside a half day course on Biological data analysis using the InterMine User Interface. More information on this event are available here.

Please note that if you are not eligible for a University of Cambridge Raven account you will need to Book or register Interest by linking here.

Biological data analysis using the InterMine User Interface (Online) Mon 16 Nov 2020 13:00 Finished

PLEASE NOTE The Bioinformatics Team are presently teaching as many courses live online. We aim to simulate the classroom experience as closely as possible, with opportunities for one-to-one discussion with tutors and a focus on interactivity throughout.

InterMine is a freely available open-source data warehouse built specifically for the integration and analysis of complex biological data.

InterMine-based data analysis platforms are available for many organisms including mouse, rat, budding yeast, plants (over 87 plant genomes), nematodes, fly, zebrafishHymenoptera, Planaria, and more recently human.

Genomic and proteomic data within InterMine databases includes pathways, gene expression, interactions, sequence variants, GWAS, regulatory data and protein expression. InterMine provides sophisticated query and visualisation tools both through a web interface and a powerful web service API, with multiple language bindings including Python and R.

This course will focus on the InterMine web interface and will introduce participants to all aspects of the user interface, starting with some simple exercises and building up to more complex analysis encompassing several analysis tools and comparative analysis across organisms. The exercises will mainly use the fly, human and mouse databases, but the course is applicable to anyone working with data for which an InterMine database is available (a comprehensive list of InterMine databases is available here.)

This event is organised alongside a half day course on Biological data analysis using the InterMine API. More information on this event is available here.

Please note that if you are not eligible for a University of Cambridge Raven account you will need to Book or register Interest by linking here.

Biological Imaging Data Management for Facility Managers

Wed 13 Jul 2016 09:30 Finished

The Open Microscopy Environment (OME) is an open-source software project that develops tools that enable access, analysis, visualization, sharing and publication of biological image data.

OME has three components:

OME-TIFF, standardised file format and data model;
Bio-Formats, a software library for reading proprietary image file formats; and
OMERO, a software platform for image data management and analysis.

In this one day course, we will present the OMERO platform, and show how Facility Managers can use it to manage users, groups, and their microscopy, HCS and digital pathology data.

Help pages on 'Using OMERO for Facility Managers' can be found here.

This course is organized alongside a one day course on Biological Imaging Data Management for Life Scientists. More information on this event are available here.

This course will be delivered by members of the OMERO team. The OME project is supported by BBSRC and Wellcome Trust.

Please note that if you are not eligible for a University of Cambridge Raven account you will need to book or register your interest by linking here.

All Bioinformatics courses

Please note that this course has been discontinued and has been replaced by the Introduction to R for biologists.

Contact training provider

Privacy policy
Cookie policy

Study at Cambridge

About the University

Research at Cambridge

All Bioinformatics courses

Please note that this course has been discontinued and has been replaced by the Introduction to R for biologists.

Contact training provider

Privacy policy Cookie policy

Study at Cambridge

About the University

Research at Cambridge

Privacy policy
Cookie policy