skip to navigation skip to content

All Cambridge Digital Humanities courses

Show:
Show only:

35 matching courses
Courses per page: 10 | 25 | 50 | 100


Analysing and Visualising Social Media Data (Workshop) new Mon 11 Feb 2019   14:00 Finished

This session introduces a variety of analytical strategies, with a focus on Social Network Analysis, the most widely used and abused method for analysing and visualising digital and social media data. At the end of this session, you will be familiar with the basic concepts, techniques and measures of social network analysis.

Archival Photography: An Introduction new Wed 12 Jun 2019   11:00 Finished

This session focusses on providing photography skills for those undertaking archival research. Dr Oliver Dunn has experience spanning a decade filming documents for major academic research projects. He will go over practical approaches to finding and ordering materials in the archive, methods of handling and filming them, digital file storage, and transcription strategies. The focus is very much on low-tech approaches and small budgets. We’ll consider best uses of smartphones, digital cameras and tripods. The session is held at the Digital Content Unit at the University Library.

Find out how to use blogging in your research. The first of two sessions on research blogging will explore the benefits and limitations of blogging for public engagement.

The second of two sessions on research blogging will explore how social media can enable public engagement with your blog, learn how to set up a Twitter chat and explore other methods to get people talking about your research.

Creating Databases from Historical Sources (Workshop) Mon 25 Feb 2019   11:00 Finished

This workshop will examine strategies for transforming a variety of sources into structured digital data, ranging from crumbling manuscripts to printed documents and books.

Data Presentation and Preservation new Tue 28 Jan 2020   11:30 [Places]

The afterlife of your research data forms a vitally important part of your research project. Research funders and academic journal publishers are often strongly committed to the re-use of data and are reluctant to fund or publish research where datasets are not accessible for the purposes of peer review or further use. Yet the push for open data exists in tension with the expectations of data protection law which requires transparency from researchers about how long they will retain personal data. This session will explore good practice in data sharing and archiving as well as introducing sources of further information and advice within the University on this topic.

Data Wrangling (Workshop) new Mon 4 Feb 2019   14:00 Finished

Garbage in, garbage out! Your output is as good or as bad as your input. Data collected from online sources is often dirty and messy. Discover how to clean and organise your data. After transforming raw data into a structured dataset, you will be ready to perform data analysis.

Digital Data Collection and Wrangling new Tue 14 Jan 2020   11:30 [Places]

This session addresses the technical and ethical aspects of digital data collection and wrangling – two fundamental stages in the lifecycle of a digital research project. Participants will be introduced to online data sources and practices of internet-mediated data collection, including retrieving data from social media platforms. As data collected from online sources is often dirty and messy, we will also provide a short practical introduction to the process of transforming raw data into a clean and structured dataset using free and open-source software.

Digital Data Collection (Workshop) new Mon 28 Jan 2019   14:00 Finished

This session is a primer on digital data collection. The goal is to become familiar with online data sources and practices of internet-mediated data collection, including retrieving data from social media platforms.

The shelf-life of your dataset dictates the longevity of your findings. Sharing your data and assuring its integrity is a fundamental part of a digital research project. In this session we will discuss the principles of open data, channels for data dissemination and the fundamentals of data preservation.

Digital Mapping for Historians new Wed 26 Jun 2019   09:30 Finished

This intensive workshop will provide an overview of a range of applications of digital mapping in historical research projects and introduce GIS tools and software.

Digital Research Design and Data Ethics new Tue 7 Jan 2020   11:30 [Places]

This session explores the lifecycle of a digital research project, across the stages of design, data capture, transformation, analysis, presentation and preservation, and introduces tactics for embedding ethical research principles and practices at each stage of the research process. We discuss how to assess the impact of relevant legal frameworks, including data protection, intellectual property and media law, on your digital research project and consider what approach researchers should take to the terms of service of third-party digital platforms.

Please note: this session covers the same content as The Ethics of Data Collection and Use in the Social Sciences Research Methods Programme. Students eligible to attend SSRMP are therefore requested not to book this session (though they are welcome to attend other sessions in this course), and instead to book here https://www.training.cam.ac.uk/jsss/course/ssrmc-ethicdaus

Digital Research Design, Methods and Ethics (Workshop) new Mon 21 Jan 2019   14:00 Finished

Find out how to shape a digital research project from scratch. This session will introduce the building blocks of online research design, from the several methodologies available to conduct the research to the ethical guidelines that should underpin our projects.

Dr Nathan Crilly and Chih-Chun Chen explore the challenges of communicating complex ideas to diverse audiences through a variety of digital media formats. Three case studies will be reported from an EPSRC-funded research project which sought to clarify and communicate the nature of complex system design and its relationship to emerging technologies. For example, the project studied the way in which technologists working in Synthetic Biology and Swarm Robotics conceptualise and address the complexity of the systems they are designing. Outputs from the project include: • A 35-page ‘primer’ on the subject of complexity (now with over 6000 downloads) • A three-minute animated movie discussing the subjectivity of complexity (now with 2500 views) • An interactive website (implemented by Dr Chen since she has programming skills) that generates annotated bibliographies for complexity resources tailored to a user’s interests (launched in March 2019) Dr Crilly and Dr Chih-Chun will discuss the process of engaging with media partners, including working with science communication agencies, animators and film-makers, reflect on what they learned from the process and what they would do differently in future.

Film-making for Beginners new Sat 1 Dec 2018   09:30 Finished

Learn to think visually and communicate using sound and film: participants will be introduced to the language of film, shot types, camera movements, framing, basic rules of camera use, how to tell a story, and editing in the Phoenix Training Suite.

Film-making for Beginners (Level 2) new Mon 24 Jun 2019   09:30 Finished

Learn to think visually and to communicate using sound and film. Participants will be introduced to the language of film, shot types, camera movements, framing, basic rules of camera use, how to tell a story, and editing. Some prior knowledge of filming is required. Please see the CDH website for more details (www.cdh.cam.ac.uk).

From Blog to Book new Thu 10 Oct 2019   14:00 Finished

Blogging as a digital means of research communication seems so simple: with free, easy-to-use platforms we’re all just a few clicks away from setting one up. But having set a blog up, the difficult work begins. Who are you talking to? What are you trying to achieve? How will you generate your content? How will the people you want to talk to find it? How are you going to keep it going alongside your research and teaching commitments? Will it make any difference to anything? And will you ever be able to transform any of this work into a scholarly publication that ‘counts’?

This session will be an interactive conversation between Julie Blake, Cambridge Digital Humanities Methods Fellow and Connie Ruzich, University Professor of English at Robert Morris University, Pittsburgh, USA. Connie’s Behind Their Lines blog started in 2014 during a Fulbright Scholarship at Exeter University to research First World War poetry in the context of the Centenary Commemorations. She became interested in the lost and neglected poetry of the First World War and began blogging about her ‘finds’. Five years later, she has had almost 400,000 visits to her blog, she maintains a lively dialogue with public and academic audiences including via Twitter and she is in the final stages of completing a monograph about this material with Bloomsbury Academic.

We’ll discuss the highs and lows of Connie’s research blogging experience, the surprises, the pitfalls and the lessons learned by hard won experience. We’ll try to answer all the questions listed above, and participants will be invited to join in with their own questions.

Game Design Workshop new Mon 2 Dec 2019   09:30 CANCELLED

This two-day intensive workshop will introduce participants to the practice of game design. It will explore the different ways that digital and analogue games are designed, particularly how you can design with intent to communicate a mood, theme or message. Participants will learn game design skills - such as boxing-in, design documents and prototyping – alongside opportunities to test them out by creating their own short games.

The sessions focus on game design, how to shape mechanics and play experiences, so no technical skills are needed. Participants will create their short games using both non-digital tools and simple, free software that will be taught in the session.

The course participants will be selected via an application process, once a provisional place is booked a call for application form will be issued for completion and return by 1 November 2019. Once the applications are reviewed, places will be confirmed directly in the week beginning 18 November 2019.

Generative Adversarial Networks Experimentation Lab new Tue 11 Dec 2018   11:30 Finished

This workshop will discuss prospective methods and approaches for critically engaging with the images of people created through Generative Adversarial Networks, using design experiments as provocations to expand debate about notions of ‘realism’ and ‘authenticity’ in an era where human and machine vision are ever more systematically intertwined.

Introduction to Text-Mining with Python 1 new Tue 30 Apr 2019   11:00 Finished

This session will introduce basic methods for reading and processing text files in Python. We will walk through an example that reads in a large text corpus, splits it into tokens (words) and sentences, removes unwanted words (stopwords), counts the words (frequency analysis), and visualises results. We will talk about the 5 steps of text mining and what resources to use when learning text mining for your research in your own time. No prior knowledge of Python is required, and no installations will be needed. We will use web services available in your browser to follow along.

Introduction to Text-Mining with Python 2 new Tue 7 May 2019   11:00 Finished

This session will introduce topic modelling. Topic modelling is looking for clusters of words that summarise the meaning of documents. We will talk about how to choose what sort of text mining you might want for your research. Some knowledge of Python is required, as gained from 'Introduction to Text-Mining with Python 1', or equivalent. No installations will be needed; we will use web services available in your browser to follow along with the examples.

Network Analysis for Humanities Scholars new Mon 27 Jan 2020   12:30 [Places]

This workshop is a very basic introduction to network analysis for humanities scholars. It will introduce the concepts of networks, nodes, edges, directed and weighted networks, bi- and multi-partite networks. It will give an overview of the kinds of things that can be thought about through a network framework, as well as some things that can’t. And it will introduce key theories, including weak ties, and small worlds. There will be an activity where participants will build their own test data set that they can then visualise. In the second half of the workshop we will cover some networks metrics including various centrality measures, clustering coefficient, community detection algorithms. It will include an activity introducing one basic web-based tool that allows you to run some of these algorithms and will provide suggestions for routes forward with other tools and coding libraries that allows quantitative analysis.

Attendees should bring their own laptops.

Ruth Ahnert is Professor of Literary History & Digital Humanities at Queen Mary University of London, and is currently leading two large AHRC-funded projects: Living with Machines, and Networking Archives. She is author of The Rise of Prison Literature in the Sixteenth Century (2013), and co-author of Tudor Networks of Power, and The Network Turn (both forthcoming).

Optical Character Recognition is a term used to describe techniques for converting images containing printed or handwritten text into a format that can be searched and analysed computationally. This workshop will introduce several such tools along with some practical techniques for using them, and will also highlight OCR and related services offered by the Digital Content Unit at the Cambridge University Library.

Podcasting: An Introduction new Fri 12 Oct 2018   11:00 Finished

An introduction to audio recording and editing aimed at students and staff interested in learning how podcasting can help disseminate research.

Qualitative Research in Online Environments new Tue 21 Jan 2020   11:30 [Standby]

What happens to the practice of qualitative research when interactions between researcher and research subject are largely mediated. This session will explore a wide range of topics including the challenge of consent, researcher presence and ‘lurking’ in mediated settings, how to engage with digital gatekeepers, information security for researchers, and understanding the impact of digital platform architecture on qualitative research design.

Social Network Analysis with Digital Data new Tue 4 Feb 2020   11:00 [Places]

This course will provide a hands-on introduction to the field of Social Network Analysis, giving participants the opportunity to “learn by doing” the process of network data collection and analysis. After being introduced to the basic concepts, the participants will have the opportunity to explore all stages of a social network analysis project, including research design, essential measures, data collection and data analysis. The focus will be on the retrieval of electronic archival data (e.g. websites, digital archives and social media platforms) for non-programmers and on the production of network analysis with specialised software (e.g. Gephi). At the end, the participants will be equipped with the basic tools to perform meaningful visualisations and analyses of network data.

Sources to Data (Workshop) Wed 5 Jun 2019   11:00 Finished

This workshop will examine database creation from historical documents. Extracting data from these can be hard work and involves quite unusual skill combinations. You may need to digitise and transcribe from primary sources, and then design and build a database from scratch with the information. Other sources you use could already be digitised but may be arranged or filed in an unsuitable way for your project and therefore need conversion. We will look at techniques used when employing crumbling manuscripts, printed documents, books, or text searchable images, to harvest historical data. Techniques include manual data-entry, scanning and OCR, and handwritten text recognition systems.

Letters have been for centuries the main form of communication between scientists. Correspondence collections are a unique window into the social networks of prominent historical figures. What can digital social sciences and humanities reveal about the correspondence networks of 19th century scientists? This two-session intensive workshop will give participants the opportunity to explore possible answers to this question.

With the digitisation and encoding of personal letters, researchers have at their disposal a wealth of relational data, which we propose to study through social network analysis (SNA). The workshop will be divided in two sessions during which participants will “learn by doing” how to apply SNA to personal correspondence datasets. Following a guided project framework, participants will work on the correspondence collections of John Herschel and Charles Darwin. After a contextual introduction to the datasets, the sessions will focus on the basic concepts of SNA, data transformation and preparation, data visualisation and data analysis, with particular emphasis on “ego network” measures.

The two demonstration datasets used during the workshop will be provided by the Epsilon project, a research consortium between Cambridge Digital Library, The Royal Institution and The Royal Society of London aimed at building a collaborative digital framework for 19th century letters of science. The first dataset, the “Calendar of the Correspondence of Sir John Hershel Database at the Adler Planetarium”, is a collection of the personal correspondence of John Frederick William Herschel (1792-1871), a polymath celebrated for his contributions to the field of astronomy. Its curation process started in the 50s at the Royal Society and currently comprises 14.815 digitised letters encoded in extensible markup language (.xml) format. The second dataset, the “Darwin Correspondence Project” has been locating, researching, editing and publishing Charles Darwin’s letters since 1974. In addition to a 30-volume print edition, the project has also made letters available in .xml format.

The workshop will provide a step-by-step guide to analysing correspondence networks from these collections, which will cover:

- Explanation of the encoding procedures and rationale following the Text Encoding Initiative guidelines; - Preparation and transformation of .xml files for analysis with an open source data wrangler; - Rendering of network visualisations using an open source SNA tool; - Analysis of the Ego Networks of John Herschel and Charles Darwin (requires UCINET)

About the speakers and course facilitators:

Anne Alexander is Director of Learning at Cambridge Digital Humanities

Hugo Leal is Methods Fellow at Cambridge Digital Humanities and Co-ordinator of the Cambridge Data School

Louisiane Ferlier is Digital Resources Manager at the Centre for the History of Science at the Royal Society. In her current role she facilitates research collaborations with the Royal Society collections, curates digital and physical exhibitions, as well as augmenting its portfolio of digital assets. A historian of ideas by training, her research investigates the material and intellectual circulation of ideas in the 17th and 18th centuries.

Elizabeth Smith is the Associate Editor for Digital Development at the Darwin Correspondence Project, where she contributed to the conversion of the Project’s work into TEI several years ago, and has since been collaborating with the technical director in enhancing the Darwin Project’s data. She is one of the co-ordinators of Epsilon, a TEI-based portal for nineteenth-century science letters.

No knowledge of prior knowledge of programming is required, instructions on software to install will be sent out before the workshop. Some exercises and preparation for the second session will be set during the first and participants should allow 2-3 hours for this. Please note, priority will be given to staff and students at the University of Cambridge for booking onto this workshop.

CDH Learning gratefully acknowledges the support of the Isaac Newton Trust and the Faculty of History for this workshop.

The Library as Data new Mon 15 Oct 2018   13:30 Finished

Discover the rich digital collections of Cambridge University Library and explore the methods and tools that researchers are using to analyse and visualise data.

The Library as Data: An overview new Wed 16 Oct 2019   11:00 Finished

Is the "digital library" more than a virtual rendering of the bookshelf or filing cabinet? Does the transformation of books into bytes and manuscripts into pixels change the way we create and share knowledge? This session introduces a conceptual toolkit for understanding the library collection in the digital age, and provides a guide to key methods for accessing, transforming and analysing the contents as data. Using the rich collections of Cambridge University Library as a starting point, we will explore:

  • Relations between digital and material texts and artefacts
  • Definitions of data and metadata
  • Methods for accessing data in bulk from digital collections
  • Understanding file formats and standards

The session will also provide an overview of the content in the rest of the term’s Library as Data programme, and introduce our annual call for applications to the Machine Reading the Archive Projects mentoring scheme.

The Library as Data: Digital Text Markup and TEI new Wed 23 Oct 2019   11:00 Finished

Text encoding, or the addition of semantic meaning to text, is a core activity in digital humanities, covering everything from linguistic analysis of novels to quantitative research on manuscript collections. In this session we will take a look at the fundamentals of text encoding – why we might want to do it, and why we need to think carefully about our approaches. We will also introduce the TEI (Text Encoding Initiative), the most commonly used standard for markup in the digital humanities, and look at some common research applications through examples.

Recent advances in machine learning are allowing computer vision and humanities researchers to develop new tools and methods for exploring digital image collections. Neural network models are now able to match, differentiate and classify images at scale in ways which would have been impossible a few years ago. This session introduces the IIIF image data framework, which has been developed by a consortium of the world’s leading research libraries and image repositories, and demonstrates a range of different machine learning- based methods for exploring digital image collections. We will also discuss some of the ethical challenges of applying computer vision algorithms to cultural and historical image collections. Topics covered will include:

  • Unlocking image collections with the IIIF image data framework
  • Machine Learning: a very short introduction
  • Working with images at scale: ethical and methodological challenges
  • Applying computer vision methods to digital collections

This session focusses on providing photography skills for those undertaking archival research. Dr Oliver Dunn has experience spanning a decade filming documents for major academic research projects. He will go over practical approaches to finding and ordering materials in the archive, methods of handling and filming them, digital file storage, and transcription strategies. The focus is very much on low-tech approaches and small budgets. We’ll consider best uses of smartphones, digital cameras and tripods. The session is held in the IT training room at the University Library.

Correspondence collections are a unique window into the social networks of prominent historical figures. With the digitisation and encoding of personal letters, researchers have at their disposal a wealth of relational data, which can be studied using social network analysis.

This session will introduce and demonstrate foundational concepts, methods and tools in social network analysis using datasets prepared from the Darwin Correspondence collection. Topics covered will include

  • Explanation of the encoding procedures and rationale following the Text Encoding Initiative guidelines
  • Preparation and transformation of .xml files for analysis with an open source data wrangler
  • Rendering of network visualisations using an open source SNA tool

No knowledge of prior knowledge of programming is required, instructions on software to install will be sent out before the session

Image big data are increasingly being used to understand the built and natural environment and to observe behaviours within it. Data sources include satellite and airborne imagery, 360 street views, and fixed video or time lapse traffic and CCTV cameras. While some of these sources are newer than others what has been changing are the quality of the images, the geographical coverage, and the potential for assessing changes over time. At the same time improvements in machine learning have made it possible to turn images into quantitative data at scale.

In this workshop we will explore the challenges that researchers face when using images at scale to understand environments and behaviours, building on work at Cambridge to estimate cycling levels, using satellite data to estimate motor vehicle volume, and planned data collection in Kenya using 360 cameras.

[Back to top]