Category Archives: Databases

The present and future of digitisation projects: an interview with George Williams and Seth Denbo

I was very lucky to have the chance to talk to two of the leading voices on digital humanities when they very kindly agreed to take part in a filmed discussion at ASECS annual meeting, in San Antonio, March 2012. George Williams is an associate professor of English (specialising in the 18thC) at the University of South Carolina and will be familiar to many from the ProfHacker pieces in The Chronicle of Higher Education; Seth Denbo is a historian of eighteenth-century England and involved with MITH, Project Bamboo, the IHR Seminar in Digital History and is on the faculty of the Maryland Institue for Technology in the Humanities. (Using iMovie to film the discussion in my hotel room was a bit of an experiment – which is by way of an apology for any impairment in sound and /or visual quality. The interview is split into two parts).

[youtube http://www.youtube.com/watch?v=lapMmaa2PmY&w=560&h=315]

[youtube http://www.youtube.com/watch?v=eWWeYIxIBVM&w=560&h=315]

Improving ECCO part 2

Part of the excitement is the further option to create – and be credited as editor of – an entire text from your corrected OCR text. Gale’s release of the texts though 18thConnect to be corrected by TypeWright aims to have those texts re-imported in Gale’s database. But it seems Gale is also offering the chance for those corrected texts to be published either (possibly via 18thConnect or at least peer-reviewed by them) as digital editions or via Gale as a print text.

Now this is the odd point – what does Gale get out of releasing into the wilds of the open-access world its texts? ECCO isn’t cheap and a number of universities have spent a considerable amount of money for it; even JISC’s one-stop interface for both EEBO and ECCO isn’t much cheaper. Gale’s income would presumably suffer. One might be tempted to think that both of those moves to wider access suggest Gale’s anxiety over the continuing authority of ECCO (with its old OCR software, its reliance on microfilmed texts and small images) and the sustainability of this kind of database publishing model. One need only look at databases such as London Lives, or the William Godwin’s Diaries or the Digital Miscellenies Index to see where digital resources are going. It looks as if Gale is trying to maintain ECCO’s relevance by opening it up to wider access, paradoxically undermining potential income. Perhaps they figure that the market for ECCO is saturated and that there is nothing more to loose: they would reap the kudos from keeping up with the general thrust of more recent digital resources towards open access (there’s probably a buzzier-sounding phrase than that, I’m sure). As for those texts that would be released for publication outside of ECCO, they might figure that this would amount to only selected areas or authors and that the vast majority of texts on ECCO (non-canonical and found only through specialist searching) would be unaffected and so would continue to be the USP of ECCO.

Interesting times.