Cambridge Digital Humanities course timetable
June 2024
Mon 3 |
It is often said we live in a society saturated with data. Visualisation methods can play a crucial role in helping to cut through the information overload. Badly designed charts, graphs and diagrams, on the other hand, can confuse or deceive. This session will introduce and contextualise graphical communication practices historically and culturally, helping you to think more critically about your own work and that of others. We will focus on graphical display as an interpretative and persuasive practice which requires as much attention to detail as writing. A hands-on collaborative exercise using historical data will give you the chance to put your visualisation skills to work. Coding skills are not required for this workshop but a basic familiarity with creating graphs and charts will be helpful. If you need to refresh your skills before the session, please use this open access workbook: https://docs.google.com/document/d/1wtwqIsWVGcsQ6rvXAzdpfNL5f7mUqV5k_Rr7ZrQCnbo/edit. (if the link doesn't work from UTBS, please copy and paste into your browser). |
October 2024
Thu 17 |
This CDH Basics session explores the lifecycle of a digital research project across the stages of design, data capture, transformation, and analysis, presentation and preservation. It introduces tactics for embedding ethical research principles and practices at each stage of the research process.
This can be attended as a standalone session or as part of the DH Basics course. |
Thu 24 |
This session provides a brief introduction to different methods for capturing bulk data from online sources or via agreement with data collection holders, including Application Programme Interfaces (APIs). We will address issues of data provenance, exceptions to copyright for text and data-mining, and discuss good practice in managing and working with data that others have created.
|
Thu 31 |
CDH Basics: Transforming your data
![]() Data which you have captured rather than created yourself is likely to need cleaning up before you can use it effectively. This short session will introduce you to the basic principles of creating structured datasets and walk you through some case studies in data cleaning with OpenRefine, a powerful open source tool for working with messy data.
This can be attended as a standalone session or as part of the DH Basics course |
November 2024
Mon 4 |
Convenor: Liz Stevenson, CDH Methods Fellow 2024/25 Are you a humanities researcher or scholar with no coding experience who would like to begin using digital analysis tools productively, manageably, and in a way that meets your needs? Come and join one of the limited places to create a toolbox of basic text mining skills and methods that you can apply to your own humanities research and a simple but clear understanding of online resources with which you can do this, such as
and coding languages and workspaces like
We will cover the underlying basic theory and philosophy behind text mining, and then equip you with the commands you need to perform tasks such as authorship attribution and statistical analyses of literary materials. This will include the basic creation of topic models, and execution of word frequency analyses, along with other similar methods of investigation. You will leave with the coding tools to create simple but attractive visualisations and graphs of your results. About the convenor: Liz is a fourth-year PhD candidate in Renaissance English at Darwin College, having studied English at Stanford University, USA, where she also completed a BA (2016) and MA (2017). Liz’s work revolves around the relationship between digital analysis and subjective understandings of meaning and physicality in Renaissance literature. She is currently completing her dissertation using R-based topic modelling, MFW & MDW analyses, and language field volume analyses to argue for the categorisation of Shakespeare’s plays according to linguistic fields on the basis of the plays’ atypical generic behaviour compared to broader Elizabeth and Jacobean stage works of literature. This workshop is part of our Methods Fellowship programme, which develops and delivers innovative teaching in digital methods. You can read more about the programme here and view the complete series of workshops here. |
Wed 6 |
Convenor: Dr Anthony (Tony) Harris, Clare Hall Workshop 1 – Wednesday 6 November This workshop will cover:
About the convenor: Anthony Harris returned to higher education in 2007 after founding and exiting from an international software business via a management buy-out. The company was the recipient of four Queen’s awards for industry and numerous other industry accolades. He subsequently read English at Oxford as a mature student, where he became interested in medieval literature and the application of the sciences to improve our understanding of early texts. At Clare Hall Anthony is conducting research on the use of artificial intelligence (AI) in the humanities, further work on early mathematics and astronomy, and enhancing his publication record. He is extremely involved in the digital humanities and is a technical research officer for the Kemble Anglo-Saxon charters website on behalf of the British Academy & Royal Historical Society, is a technical research advisor for the Revised Regesta regni Hierosolymitani Database Project and has presented several conference papers in digital humanities related fields. For academic years 2024/25 and 2025/26 he will be a British Academy Neil Ker Memorial Fund Award Holder progressing a digital humanities manuscript research project at the Bodleian Library, Oxford. In Q1 2025 he is a Fulbright all-disciplines scholar at Harvard University progressing another manuscript related digital humanities project. This workshop is part of the Cambridge Digital Humanities events programme. View the complete programme of events here. |
Thu 7 |
The impact of well-crafted data visualisations has been well-documented historically. Florence Nightingale famously used charts to make her case for hospital hygiene in the Crimean War, while Dr John Snow’s bar charts of cholera deaths in London helped convince the authorities of the water-borne nature of the disease. However, as information designer Alberto Cairo notes, charts can also lie. This introductory Basics session presents the basic principles of data visualisation for researchers who are new to working with quantitative data.
|
Mon 11 |
Methods Fellows Series | From Disk to Digital: Techniques for Imaging and Analysing Floppy Disks
![]() Convenor: Leontien Talboom, CDH Methods Fellow 2024/25 This session is designed for staff interested in imaging obsolete media and students or researchers looking to use disk images in their work. By attending this session, participants will gain a comprehensive understanding of both the technical process of creating disk images and the analytical techniques for interpreting them. This dual focus ensures that both researchers and archivists supporting researchers can derive significant value, whether they are concerned with preserving data or uncovering historical insights. The session will be divided into two parts. Part 1 will provide an overview of the different types of floppy disks, their development, and their historical context. It will also cover the process of creating disk images, which are bit-for-bit copies of the content on a floppy disk. Attendees will learn about the equipment needed to create these images, including both hardware and software tools. Part 2 will explore how disk images can be used in research. With more memory institutions offering digital collections, including disk images, understanding how to access and analyse these images is crucial. The session will cover emulators as the primary tool for accessing disk images, as well as other methods like analysing raw flux streams and interpreting hex values to identify the correct system for reading the image. During the session disk drives and floppy controllers will be brought to demonstrate the transferring process. HxC floppy disk emulator and a HxD Hex Editor will be used to demonstrate how to analyse disk images. About the convenor: Leontien has extensive experience in the field of digital preservation. Beginning as a digital archivist at the Archaeology Data Service, she then pursued a collaborative PhD with University College London and The National Archives UK, focusing on access to born-digital materials. Her research includes co-authoring the Computational Access Guide for the Digital Preservation Coalition and exploring the UK Government Web Archive using Jupyter Notebooks. As a web archivist on the Archive of Tomorrow project, she worked to enhance accessibility to online health discourse. Currently, she works as a technical analyst at Cambridge University Libraries where she looks after the Transfer Service and works with obsolete media across the collections. Recently she has been awarded a British Academy Grant to further her research interest in safeguarding floppy disk knowledge for future practitioners and researchers. This workshop is part of our Methods Fellowship programme, which develops and delivers innovative teaching in digital methods. You can read more about the programme here and view the complete series of workshops here. |
Tue 12 |
This Methods Workshop will introduce advanced techniques used for the digitisation and preservation of archival material. The first workshop will introduce the following topics:
Completing the workshop will give participants a good understanding of archival photography best practices. You will gain a strong professional vocabulary to discuss imaging and a toolkit to assess image quality. A second session, bookable separately, will focus on how to adopt those principles to the projects chosen by the participants. This will cover learning a practical approach to taking images fit for purpose in any conditions with available resources. It may also address any more advanced imaging topics such as image stitching, Optical Character Recognition, Multispectral Imaging, or photogrammetry if these are in the interest of the participants. It will also be an opportunity to visit the Digital Content Unit at Cambridge University Library. |
Wed 13 |
Convenor: Dr Anthony (Tony) Harris, Clare Hall Workshop 2 – Wednesday 13 November This workshop will cover:
About the convenor: Anthony Harris returned to higher education in 2007 after founding and exiting from an international software business via a management buy-out. The company was the recipient of four Queen’s awards for industry and numerous other industry accolades. He subsequently read English at Oxford as a mature student, where he became interested in medieval literature and the application of the sciences to improve our understanding of early texts. At Clare Hall Anthony is conducting research on the use of artificial intelligence (AI) in the humanities, further work on early mathematics and astronomy, and enhancing his publication record. He is extremely involved in the digital humanities and is a technical research officer for the Kemble Anglo-Saxon charters website on behalf of the British Academy & Royal Historical Society, is a technical research advisor for the Revised Regesta regni Hierosolymitani Database Project and has presented several conference papers in digital humanities related fields. For academic years 2024/25 and 2025/26 he will be a British Academy Neil Ker Memorial Fund Award Holder progressing a digital humanities manuscript research project at the Bodleian Library, Oxford. In Q1 2025 he is a Fulbright all-disciplines scholar at Harvard University progressing another manuscript related digital humanities project. This workshop is part of the Cambridge Digital Humanities events programme. View the complete programme of events here. |
Thu 14 |
CDH Basics: Sustaining your data
![]() Ensuring long-term access to digital data is often a difficult task: both hardware and code decay much more rapidly than many other means of information storage. Digital data created in the 1980s is frequently unreadable, whereas books and manuscripts written in the 980s are still legible. This session explores good practice in data preservation and software sustainability and looks at what you need to do to ensure that the data you don’t want to keep is destroyed.
|
Thu 21 |
The Programming Historian publishes novice-friendly, peer-reviewed tutorials that help humanists learn a wide range of digital tools, techniques, and workflows to facilitate research and teaching. Programming Historian in English, en español, en français, and em português are four multilingual, Diamond Open Access journals of article-length lessons on digital techniques and workflows. Our lessons support students and academics to learn effective research methods, and also help educators to teach tomorrow’s researchers. This workshop will introduce you to the journals, explaining how our lessons can empower your next steps in learning the practical skills you’ll need to work with data. We'll highlight lessons that could help you practice the skills you've learned during previous sessions in the Basics series, discuss troubleshooting strategies for overcoming obstacles, and think together about the value of peer-to-peer support and building a community around you as you work. This can be attended as a standalone session or as part of the DH Basics course. |
Mon 25 |
Convenor: Ali Abbas, RSE Methods Fellow This workshop introduces students to open research practices, reproducibility using Git, and code and data publishing via GitHub. Students learn best practices for organising projects, writing clean, well-documented code, and creating reproducible analyses. Examples will be given in R and Python, but the techniques are generalisable to other languages. The importance of data transparency and sharing is highlighted, encouraging students to adopt an open research mindset. Git will be introduced as a powerful tool for version control and collaboration. Students will learn to track changes in their code, create branches for experimentation, and merge updates. Git's role in ensuring reproducibility will be emphasised, showing how it allows researchers to maintain a complete history of their work. Throughout the course, reproducibility is a key focus. Students learn to create self-contained project structures, use relative file paths, and document their analysis process. Using an open-source publishing system Quarto, participants will create dynamic reports that combine code, results, and narrative. The course concludes with strategies for publishing and sharing research projects. Students learn to create GitHub Pages to showcase their work, use Zenodo to create DOIs for their code, and explore options for deploying interactive applications using tools such as Shiny. This workshop will be most suitable for those with a bit of programming experience and who have some code and data they would like to organise and publish. About the convenor: Ali is a Senior Researcher at the Public Health Modelling group, which is part of the MRC Epidemiology Unit at the University. He holds a doctorate in Computer Science, focusing on modelling. He has been engaged in data-driven research and development for over two decades. This encompasses exploratory data analysis (EDA), statistical modelling (or machine learning), and interactive visualisation, with a particular emphasis on collaborative, transparent, and reproducible research. His research interests include the development of environmentally friendly and sustainable transport systems that also have a positive impact on public health. Additionally, he assumes responsibility for project management, as well as the management, training, and guidance of junior staff within his unit. Moreover, he advocates for open and reproducible research, the results of which can be accessed on his GitHub page linked below. This workshop is part of our Methods Fellowship programme, which develops and delivers innovative teaching in digital methods. You can read more about the programme here and view the complete series of workshops here. |
Tue 26 |
Following the introductory Methods Workshops, held on 12 November 2024, this session will focus on how to adopt the principles to the projects chosen by the participants. This will cover learning a practical approach to taking images fit for purpose in any conditions with available resources. It may also address any more advanced imaging topics such as image stitching, Optical Character Recognition, Multispectral Imaging, or photogrammetry if these are in the participants' interest. Visiting the Cultural Heritage Image Laboratory at Cambridge University Library will also be an opportunity. |
December 2024
Mon 2 |
This workshop, organised in collaboration with Dr Ann Borda (Alan Turing Institute) and Semeli Hadjiloizou (Alan Turing Institute), will provide an accessible, non-technical introduction to AI systems for working with images (such as image classification, analysis and generation) and discuss sources of bias and problems of interpretation. Through discussion and hands-on exercises, we will demonstrate some of the ways in which image-generation models reproduce bias and stereotypes. We will use a data justice lens to:
|
Wed 11 |
Convenor: Dr Estara Arrant (Cambridge University Library) This session is aimed at researchers who have never done any coding before. We will explore basic principles and approaches to navigating and working with code, using the popular programming language Python. Participants will use the Jupyter Notebooks platform to learn how to analyse texts. This will provide participants with a working foundation in the fundamentals of coding in Humanities research. The software we will use is free to download and compatible with most computers, and we will provide support in installation and setup before the class. |
March 2025
Wed 12 |
In the first part of this workshop, Dr Mia Ridge will introduce the British Library and discuss examples of digital curatorship projects, experiments and training from the British Library’s Digital Research team. She will look at their work publishing library ‘collections as (open) data’, then discuss work with machine learning / AI and collections across the British Library.The second part will involve some practical activities, with time for participants to report back and discuss their experiences About the convenor: Dr Mia Ridge is the British Library’s Digital Curator for Western Heritage Collections. Part of the Digital Research team, she provides advice and training on computational research, AI / machine learning and crowdsourcing. A Co-Investigator on Living with Machines (2018-23), she co-curated the Living with Machines exhibition with Leeds Museums and Galleries (2022-23). This session is part of the workshop series 'Digital Methods for the Digital Humanist'. This series is integrated in the Postgraduate Researcher Development Programme in Digital Humanities organised by Cambridge Digital Humanities and generously co-funded by the Enhanced Funding Scheme. View the complete programme of CDH events here. |
Wed 19 |
Data-Driven History: New Approaches to Source Criticism and Experimental Research 'at Scale'
![]() In this session we consider the practice of history in an age of large digitised collections. The discipline of history has long encompassed both quantitative and qualitative approaches; we ask: what are the opportunities and the risks of a working with sources as data? How can historians adapt their craft in the era of data science, and which questions should they be asking? The first part of this session focuses on one of the most widely used types of historical source: newspapers; also one of the most misused in their digital form. We explore new methods for making sense of large newspaper collections, using Python. The second part asks what we can learn by converging large historical datasets in new combinations. Focusing on Ordnance Survey maps and the Census, we explore how machine learning makes possible new questions about nineteenth-century Britain. About the convenor: Daniel Wilson, Research Fellow at The Alan Turing Institute Daniel Wilson is a historian of science and technology working on the politics and provenance of data and machines in the nineteenth, twentieth and twenty-first centuries. His work combines traditional close-reading and archival study with computational techniques. Current projects include using language models and other critical methods to explore historical text datasets, including the internationally important collections of the British Library. This session is part of the workshop series 'Digital Methods for the Digital Humanist'. This series is integrated in the Postgraduate Researcher Development Programme in Digital Humanities organised by Cambridge Digital Humanities and generously co-funded by the Enhanced Funding Scheme. View the complete programme of CDH events here. |