skip to navigation skip to content
- Select training provider - (Institute of Continuing Education Staff Learning & Development)
Instructor-led course

Provided by: Cambridge Digital Humanities


This course is not scheduled to run.


[ Show past events ]



Register interest
Register your interest - if you would be interested in additional dates being scheduled.


Events available

Sources to Data
New


Description

We are currently reformatting our Learning programme for remote teaching; this will require some rescheduling so bookings will reopen and new sessions will be created for online courses as soon as possible. In the interim we would encourage you to register your interest so as to be notified of the new schedule. Please be aware that we hope to run many of our courses online, but that this is dependent on staff availability and resources so please be aware we may have to postpone or cancel some sessions

Archives typically hold records containing enormous quantities of data presented in a variety of scribal and print formats. Extracting this information has traditionally involved long hours of expensive manual data-entry work. Nowadays this work can be automated to a large degree and could soon open archives and allow for unprecedentedly large structured data sets for curators, researchers, and the public alike. This workshop will examine new methods for collecting historical data from manuscript and printed documents. We will look at archival photography, OCR, page structure recognition, and new handwritten text recognition systems. Cutting-edge Cambridge research in this field will be demonstrated.

Target audience

Post-graduate researchers and staff at the University of Cambridge

Format

Presentation and group discussion

Theme
Machine Reading the Archive

Events available