- Issue
- Journal of Siberian Federal University. Humanities & Social Sciences. 2016 9 (4)
- Authors
- Gregory, Ian; Atkinson, Paul; Hardie, Andrew; Joulain-Jay, Amelia; Kershaw, Daniel; Porter, Catherine; Rayson, Paul; Rupp, CJ
- Contact information
- Gregory, Ian:Lancaster University Lancaster LA1 4YT United Kingdom; Atkinson, Paul:Lancaster University Lancaster LA1 4YT United Kingdom; Hardie, Andrew:Lancaster University Lancaster LA1 4YT United Kingdom; Joulain-Jay, Amelia:Lancaster University Lancaster LA1 4YT United Kingdom; Kershaw, Daniel:Lancaster University Lancaster LA1 4YT United Kingdom; Porter, Catherine:Lancaster University Lancaster LA1 4YT United Kingdom; Rayson, Paul:Lancaster University Lancaster LA1 4YT United Kingdom; Rupp, CJ:Lancaster University Lancaster LA1 4YT United Kingdom;E-mail:
- Keywords
- Corpora; GIS; Resource enhancement; Research Methods; OCR quality; GIS
- Abstract
It is increasingly acknowledged that the Digital Humanities have placed too much emphasis on data creation and that the major priority should be turning digital sources into contributions to knowledge. While this sounds relatively simple, doing it involves intermediate stages of research that enhance digital sources, develop new methodologies and explore their potential to generate new knowledge from the source. While these stages are familiar in the social sciences they are less so in the humanities. In this paper we explore these stages based on research on the British Library’s Nineteenth Century Newspaper Collection, a corpus of many billion words that has much to offer to our understanding of the nineteenth century but whose size and complexity makes it difficult to work with
- Pages
- 994-1006
- Paper at repository of SibFU
- https://elib.sfu-kras.ru/handle/2311/20211
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).