Tuesday 24 February – Tracking the Emergence of New Words across Time and Space

The IHR Seminar in Digital History would like to welcome you to its first seminar of 2015.

Presenters:  Jack Grieve (Aston)

Title:  Tracking the Emergence of New Words across Time and Space

Date:  24 February 2015

Time:  5:15 PM (GMT)

Venue:  John S Cohen Room 203, 2nd floor, IHR, North block, Senate House or live online via the Digital History Seminar blog.

Live Stream

Slide Show (Coming soon)

Abstract: Very little is known about how new words spread in language. New words are regularly identified by lexicographers, linguists, and the news media, but until recently we have not had access to sufficiently large geo-coded and time-stamped datasets that would allow for the detailed analysis of the geographical diffusion of lexical items in real time. However, with the rise of social media and smart phones, it is now possible to compile very large corpora that meet these requirements, allowing for new words to be identified and mapped across time and space and for the first time. In this presentation, I identify numerous newly emerging words based on a multi-billion word corpus of American tweets from 2013-2014 and map their geographical spread across the United States.

Seminars are normally streamed live online on this blog and on YouTube. To keep in touch, follow us on Twitter (@IHRDigHist) or at the hashtag #dhist.

Posted in Events | Leave a comment

Citizen history and its discontents: Postscript

By Matt Phillpott

There are an increasing number of crowdsourcing projects making claims about being ‘citizen history’. Old Weather, one of the more successful crowdsourcing projects of recent years, has started to use the term, and Zooniverse (the company behind it) has taken the same infrastructure this year for a World War One project called Operation War Diary. Then there is the project, Children of the Lodz Ghetto, in which volunteers undertake actual research tasks, helping to track down the names and lives of school children who fell victim to the Holocaust. By its nature this research is often complex, as names vary and change, and sources come in a variety of languages.

Citizen history is the current ‘buzz-word’, and its use is a claim to be moving beyond crowdsourcing and offering as well an opportunity to learn and master the skills collaboratively and co-operatively, of an historian.

In this third talk of this year’s Digital History seminar, Mia Ridge from the Open University shared her research into crowdsourcing and citizen history projects and asked whether they are really helping people to become historians or if they are, in actuality, overstating their contribution. As Mia, herself put it, ‘can citizen history projects succeed without communities of experts and peers to nurture sparks of historical curiosity and support novice historians in learning the skills of the discipline?’

The role of the ‘expert’?

Mia was very careful to stress that the importance of ‘expert’ historians being involved at the beginning, and throughout the project, is not to suggest that the grassroots community that these projects hope to build cannot, and do not, manage to deal with complex historical data and interpretation on their own.

When citizen history projects work well, the forums, wikis and other online spaces become an active hive of activity and co-operative discussion and collaborative learning and training. However, these communities are built upon learning about sources and their interpretation in a collaborative environment, and there are times when professional historians can offer advice where the sources are difficult or no other answer is forthcoming, or to pick up and highlight on details uncovered that are of wider historical significance. Generally, people who take to citizen history projects are there to discover the past, and learn how to use the sources, and the input of professional historians are valued as part of that process.

Often however, the role of the professional or ‘expert’ historian, is largely hidden away. Mia noted that often professional historians take an active role in the forums near the beginning of a project to help to get things started, but later on, whilst they continue to check the forums, their input reduces as teaching, research, and funding applications, by necessity, take precedence. Ideally this shouldn’t happen, but there are very real obstacles that limit the time and effort professional historians can give to citizen history projects. How we overcome this difficulty is not an easy question to answer.

What makes citizen history a success?

For a citizen history project to become successful not just in developing a resource of research materials through crowdsourcing, but also in enabling the development of historians, it is essential to build a critical mass of discussion and usage, and to expose people to historical materials that are potentially interesting. It is, also, important to include expert input, as this can transform the process.

Essentially some citizen history projects are really crowdsourcing and are perhaps misusing the term, whilst others fail to reach their goals for one reason or another. Others are highly successful. Yet there is a risk in these projects that citizen historians will become seen as faux historians, with limited skills and abilities, where in reality there are a variety of levels of citizen historians ranging from those just beginning the process to those who have built up the skills and knowledge required of any other historian.

Mia ended her talk with a call for crowdsourcing and citizen history project organisers to be more careful with the terminology they use. Signing up to a project and doing a bit of transcription work does not make that person a historian, but this can become the end result. Projects need to be clear about what it is they are offering and asking, and what exactly is required to become a citizen historian rather than, perhaps, a citizen transcriber.

Posted in Postscript | 1 Comment

Digital Humanities Project, ‘Mapping Eighteenth-Century Tourism in the English Lakes’

On Wednesday 26 November 2014, the Digital History seminar is co-hosting a seminar with the British History in the Long-Eighteenth Century seminar. Here are the details:

Title: Mapping Eighteenth-Century Tourism in the English Lakes

Speakers: Ian Gregory and Chris Donaldson (Lancaster)

Location: Wolfson Room NB01, Basement IHR, North Block, Senate House

Time: Wednesday 26 November 2014, 5.15pm

Posted in Events | Leave a comment

Tuesday 18 November – Citizen History and its discontents

The IHR Seminar in Digital History would like to welcome you to its third seminar of the 2014 autumn term.

Presenters:  Mia Ridge (Open University)

Title:  Citizen History and its discontents

Date:  18 November, 2014

Time:  5:15 PM (GMT)

Venue:  John S Cohen Room 203, 2nd floor, IHR, North block, Senate House or live online via the Digital History Seminar blog.

Live Stream

Slide Show

Abstract: An increasing number of crowdsourcing projects are making claims about ‘citizen history’ – but are they really helping people become historians, or are they overstating their contribution? Can citizen history projects succeed without communities of experts and peers to nurture sparks of historical curiosity and support novice historians in learning the skills of the discipline? Through a series of case studies this paper offers a critical examination of claims around citizen history.

Seminars are normally streamed live online on this blog and on YouTube. To keep in touch, follow us on Twitter (@IHRDigHist) or at the hashtag #dhist.

Posted in Events | Leave a comment

Interrogating the Archived UK Web – postscript

By Adam Crymble

The second talk of our 2014 Autumn programme took on the challenge of a new type of source for historians: the Internet. Not online sources and databases, but the Internet itself. The first archived copies of the UK web have started to find their way into scholarly hands. Historians now have the ability to look at webpages as sources in themselves, just as we have previously read manuscripts as a window into the past. The web is a corpus rich in details about what we were like and what we thought was important, not that long ago. For a cultural or social historian, it’s a dream.

Peter Webster introduced the UK Web Archive, which is hosted by the British Library, and contains snapshots of the UK-web (.uk sites) dating back to the 1990s. A team of historians have been given access, to see what they can make of this new (and huge) resource. I want to emphasise the experimental aspect of this project, because in many respects I think we learned more about what these scholars couldn’t achieve than what they did achieve.


That’s not a failing in the quality of the scholars themselves. They managed to do exactly what we could hope from them: to test the limits of the historian’s method on a large, messy, digital archive. They’ve done us a great service in finding some of those limits. The question now ahead of us is what we’re going to do about it?


Two of the scholars were on hand to share their experiences. Gareth Millward, whose project explored hyperlinking behaviour towards the website of the Royal National Institute of the Blind (RNIB) in those early days of the web, and tried to uncover why people were casting those hyperlinks.

Also Richard Deswarte, who used the archive to explore manifestations of Europhobia online, looking particularly for indicators that people in Britain were using the web to express dissatisfaction with the country’s continued role in the EU.


The projects themselves took on interesting questions, which were appropriate, given the type of source. Most interesting for me – and a significant part of both presentations – was the discussion of where they had problems using the corpus. Both scholars complained of noise that made it difficult to identify unique or meaningful mentions. In Millward’s case the noise came in the form of an advertisement in the Guardian for a talking watch that was endorsed by the RNIB. The ad appeared on hundreds of pages, though it really only represented a single match for Millward’s purposes. Deswarte too had trouble with a rotating banner on a newspaper website that dramatically overemphasized the number of meaningful links to an article about Europhobia.

Both also noted the sheer number of hits they were getting, and Millward in particular emphasized his attempts to get the list down to a size where he could conduct a close reading. He had failed to do so, and is still left with a collection of 39,000 hits. However, both he and Deswarte reflected on that failure, and evoked the language of social scientists and their ideas about representative sampling that they felt would have been appropriate if given the opportunity to tackle this challenge again. That reflection is significant, because it shows both Millward and Deswarte recognized the limits of the historian’s skillset for a project such as this.

However, I think we can push those limits further. The very notion that we would do a close reading of the Internet is one that I think only historians would suggest. It shows how deeply the value of close reading is held in the profession, even if it proves entirely inappropriate. We need to move on from that belief: that you can only know something if you’ve read it carefully. If we hold on to this mentality we’re going to lose our chance to discover anything at scale. We’ll be unable to pursue the longue durée that Guldi advocated for in our previous seminar.

Sitting in the audience I couldn’t help but think that the solution wasn’t in sampling and close reading. It was in corpus linguistics, data manipulation, clustering algorithms, and distant reading. Skills that are so rarely taught in our history programmes, but that this experiment made clear need to become part of our disciplinary tool kit. And if not our toolkit, then we need to engrain the value of collaboration. If you can’t do it, find someone who can that wants to work with you.

The day of the lone scholar intent on close reading are numbered. The UK Web archive has showed us that. So what are we going to do about it?

Adam Crymble is a convenor of the Digital History seminar at the IHR and a lecturer of digital history at the University of Hertfordshire. The UK Web Archive is available to search now. In addition there are a variety of related research projects such as the Big UK Domain Data for the Arts and Humanities (BUDDAH) Project. Analysis into the sustainability of the dataset can be found on the website for the Analytical Access to the Domain Dark Archive (AADDA), and examination of the potential value of the UK Web Domain dataset can be found on the Big Data: Demonstrating the Value of the UK Web Domain Dataset for Social Science Research website.

Posted in Postscript | Tagged , , , | Leave a comment

Tuesday 4 November – Interrogating the archived UK web: Historians and Social Scientists Research Experiences

surf-107865_640The IHR Seminar in Digital History would like to welcome you to its second seminar of the 2014 autumn term.

Presenters:  Dr Gareth Millward (London School of Hygiene and Tropical Medicine), Dr Peter Webster (British Library Web Archiving Team), & Richard Deswarte (UEA).

Title:  ‘Interrogating the archived UK web: Historians and Social Scientists Research Experiences’

Date:  4 November, 2014

Time:  5:15 PM (GMT)

Venue:  John S Cohen Room 203, 2nd floor, IHR, North block, Senate House or live online via the Digital History Seminar blog.

Live Stream: 

Slides: Peter Webster     Richard Deswarte     Gareth Millwood (opens in new windows)

Abstract:  The emergence of the WWW has been one of the most profound and influential phenomena of the last twenty years.  One of the dominant features of the WWW is its changing nature both in terms of content and its technological underpinnings.  The content of the WWW is an immense resource full of potential for academic researchers both in its current state and in its previous constantly changing forms.  Over the last decade, in particular, archives of WWW materials have been emerging.  These archives are still very much in a nascent form but are beginning to be made available and to be utiltised by a range of scholars.  The UK Web Archive hosted by the British Library is at the forefront of trawling and making available for researchers archived versions of the UK WWW dating back to the 1990s.  It is currently engaged jointly with the Institute of Historical Research (IHR) and the Oxford Internet Institute (OII) in the ‘Big UK Domain Data for the Arts and Humanities Project’ (BUDDAH) where a new research interface is being developed in conjunction with a number of humanities scholars who are at the same time exploring the UK Web Archive to identify its strengths and weaknesses for academic research.  Peter Webster will introduce Web Archiving, the BUDDAH project and the new research interface, while Gareth Millward and Richard Deswarte will relate their experiences in using the resource to research respectively the history of disabled people and accessibility on the WWW, and Euroscepticism.


Dr Gareth Millward is currently a Research Fellow at the Centre for History in Public Health at the London School of Hygiene and Tropical Medicine.  He has research interests in disability and government policy, and more recently notions of the ‘public’ in British vaccination programmes.  For the BUDDAH project he is researching disabled people and the Web.

Richard Deswarte is a Lecturer in Modern European History at UEA with research interests in the European idea and integration, as well as Digital Humanities.  On the BUDDAH project he is examining the presence and rise of Euroscepticism.

Dr Peter Webster is currently the British Library lead on the BUDDAH project and Web Archiving Engagement and Liaison Officer at the BL.  Alongside scholarly interests in Web Archiving and Digital Humanities, Peter researches on the history of religion, the Anglican Church and the relation between church, law and state in 19th and 20th century Britain.

Seminars are normally streamed live online on this blog and on YouTube. To keep in touch, follow us on Twitter (@IHRDigHist) or at the hashtag #dhist.


Posted in Uncategorized | Leave a comment

Introducing Paper Machines – postscript

In the welcome surroundings of the refurbished Institute of Historical Research, Jo Guldi (Brown University) kicked off the 2014 Autumn Term programme of the IHR Digital History Seminar. In town to discuss The History Manifesto, her new open access book co-authored with David Armitage, Guldi’s talk ranged from the public role of the historians, the Digital Humanities and new model of publishing to impending environmental catastrophe, the need for deep history and data processing tools that can help citizen and scholars alike overcome the problems of modern bureaucracy. To see how Guldi weaved all this threads together, you’ll need to watch the video below. Here I just want to tease in no particular order at a few of threads that stuck in my mind, threads that pertain to most, if not all, digital history projects that pass through the seminar.

Tools as provocations: Paper Machines is a research tool. But it is also a provocation, an experiment with using large swathes of information to inform historical research in the longue durée, a vantage point – the tools makers argue – historians take not often enough. The tool, in short, is the argument.

What we need now: As we sit on the precipice of environmental catastrophe, does it not behove us to think about what digital projects we need? Do we want digital projects that analyse art for art’s sake, that recapitulate old research paradigms and do not address problems of a wider, public relevance?

Hypothesis generation: At the heart of Paper Machines is hypothesis generation. It allows the scholar to take a vast paper archive and facet that archive, make visualisations, select where to read closely. How that macro to micro scaling changes the history that is written, how scholarly debates mature to integrate the inevitable discrepancies between interpretations made at these scales is the challenge historians must re-engage with.

Being bold about method: Works that change the focus of disciplines usually open their accounts by stating ‘you missed this because your method was wrong’. Digital history can and should do the same, it can and should be bold about how it comes to the conclusions it does rather than hide the methods, ways, and means that underpin its particular take on historical phenomena.

My partial, incomplete, CC BY notes on the seminar are available on GitHub Gist.

The next Digital History seminar, ‘Interrogating the archived UK web: Historians and Social Scientists Research Experiences’, will take place on 4 November and a full listing of Autumn Term seminars is available on the IHR Website.

James Baker (Curator, Digital Research, British Library)

Creative Commons License
This post is licensed under a Creative Commons Attribution 4.0 International License.

Posted in Postscript | Leave a comment

Tuesday 7 October – Introducing Paper Machines

The IHR Seminar in Digital History is back for another year. We will be announcing our full programme for 2014-15 soon and details about the live stream, but in the meantime we would like to welcome you to our first session of our new programme. Please keep the date free!

Watch the Live Stream here:

 Slide Show

If you would like to take part in the online chat click through to watch the video on YouTube.

Date:  7 October 2014

Time:  5:15 PM (BST=GMT+1)

Venue:  Room 208, Senate House

Speaker: Jo Guldi (Brown University)

Abstract: Historians of the twentieth century have to contend with a technological problem, the issue of archives too large to process by traditional methods.  While textual encoding, tagging, and n-grams can reveal certain patterns in digital archives, topic modeling and topic frequency, applied to hand-tailored archives, can help the historian make informed decisions about where in an archive to start looking.  Digital methods, in this way, are driving historians to longer and longer time scales, making it possible for even younger scholars to perform a ‘distant reading’ on big questions that range over nations and centuries.  The talk will follow parts of the argument of The History Manifesto (2014), comparing how a historian’s search for periodization, agency, and causality in the data compare with use and abuse of digital data in other digital fields.

Speaker Biography: Jo Guldi is author of Roads to Power (2012), What is the Spatial Turn? (2012), The History Manifesto(2014), and the digital toolkit Paper Machines (2012).  She is Hans Rothfels Assistant Professor of Modern Britain and its Empire at Brown University.  Her next project, The Long Land War, examines a century and a half of movements for land and water around the globe.

Posted in Uncategorized | Leave a comment

Next seminar – Tuesday 17 June – Mapping the Medieval Countryside

The IHR Seminar in Digital History would like to welcome you to its final seminar of the 2014 summer term.

Speaker: Dr Matthew Holford, University of Winchester

Title: Mapping the Medieval Countryside: Places, People and Properties in the Inquisitions Post Mortem

Date:  17 June, 2014

Time:  5:15 PM (BST=GMT+1)

Venue:  Athlone Room, 102, Senate House, South Block, First floor, or live online at http://www.livestream.com/historyspot

Abstract: Mapping the Medieval Countryside is a major research project dedicated to the online publication of medieval English inquisitions post mortem (IPMs).

These inquisitions, which recorded the lands held at their deaths by tenants of the crown, comprise the most extensive and important body of source material for landholding in medieval England. They describe the lands held by thousands of families, from nobles to peasants, and are a key source for the history of almost every settlement in England (and of many in Wales).  They are indispensable to local and family historians as well as to academic specialists in areas as diverse as agrarian history and political society.

The project will publish a searchable English translation of the IPMs covering the periods 1236 to 1447 and 1485 to 1509. From 1399 to 1447 the text will be enhanced to enable sophisticated analysis and mapping of the inquisitions’ contents. The online texts will be accompanied by a wealth of commentary and interpretation to enable all potential users to exploit this source easily and effectively.

Speaker biography: Matthew Holford gained his first degree at the University of Cambridge, and an MA and DPhil in Medieval Studies from the University of York. He subsequently worked for the Oxford English Dictionary and held research posts at the Universities of Durham and Cambridge. He is currently Research Officer on the AHRC-funded Inquisitions Post Mortem Project.

Posted in Uncategorized | Leave a comment

Next seminar – Tuesday 3 June 2014 – Digitising the First World War: opportunities and challenges

The IHR Seminar in Digital History would like to welcome you to its first seminar of the 2014 summer term.

Speaker:  Professor Sir Deian Hopkin (President of the National Library of Wales)

Title:  Digitising the First World War: Opportunities and Challenges

Date:  3 June, 2014

Time:  5:15 PM (BST=GMT+1)

Venue:  Athlone Room, 102, Senate House, South Block, First floor, or live online at HistorySpot

Abstract:  One of the most important legacies of the commemoration of the First World War will be an extensive range of new digital archives.  The Imperial War Museum is leading a partnership of many hundreds of organisations, many of whom are involved in capturing records, visual artefacts, memoirs and much else.  The National Archives now offers a wide variety of resources, from war diaries and nurses’ records to interviews with prisoners of war and records of military service appeal tribunals and has launched a crowd-sourcing site to identify data contained within war diaries.  The National Library of Wales hosts the People’s Collection, also a crowd-sourcing platform, which enables individuals and organisations to upload diaries, letters, photographs and other artefacts, and a dedicated website provides searchable access to Welsh newspapers during the war, part of a much larger collection of Welsh Newspapers Online.  And there is much else, on the same lines, taking place in libraries, record offices and among informal groups across the country.

In his acclaimed book, Capital in the Twenty-First Century, Thomas Piketty pays a particular debt to improvements in the technology of research, most specifically computers, which enabled him to process data on a huge scale and offer a new synthesis; indeed he claims his work to be as much about history as economics.  Twenty years ago, there was a rush of enthusiasm for the use of computing technology by historians.  Since then, despite huge technical advances and a communications revolution, there is a sense that most historians have remained aloof from these new developments.  Some of the tools available in the 1980s and 1990s have not evolved and there is much less written nowadays about techniques and methodology; indeed there appear to be little provision for historians to develop the particular skills needed to exploit rich digital archives, especially structured data.

While the new resources appear to offer exciting prospects, are we any nearer being able to exploit them?  This presentation will discuss the opportunities which are now available but the challenges that still remain.

Speaker:  Professor Sir Deian Hopkin spent 43 years in higher education, retiring as Vice Chancellor of London South Bank University in 2009. He was a co-founder of the Association of History and Computing and active in the CTI, the History Data Archive and other initiatives in the 1980s and 1990s. He is currently President of the National Library of Wales, a trustee of the IHR Development Trust and Chair of the Wales Programme Committee for the First World War Centenary.

Seminars are streamed live online at HistorySpot. To keep in touch, follow us on Twitter (@IHRDigHist) or at the hashtag #dhist.

Posted in Uncategorized | Leave a comment