2008-02-22

deborah: the Library of Congress cataloging numbers for children's literature, technology, and library science (Default)
2008-02-22 02:07 pm

real preservation

I've been getting increasingly concerned about what I see as a too-shallow view of sustainability in digital preservation. There's been a lot of lip service paid over the last few years to preservation, and I have certainly heard talks by grant-funding agencies in which they explained that they are now only funding grants which have sustainability written into the grant structure. Yet time and time again, I see soft money being awarded to projects for which the project administrators clearly have only the vaguest idea of what sustainability really means in a software environment.

I don't see this as anyone's fault, mind you. Software developers and IT folks aren't used to thinking of software projects in terms of Permanence. In the traditional software world, the only way something is going to be around forever is if it's going to be used all that time -- for example, a financial application which is in constant use needs to be constantly up. But archival digital preservation has a very different sense of permanence. For us, permanence might mean that you build a digital archival collection once, don't touch its content again for 10 years, but can still discover all of its preserved content at the end of those 10 years.

Meanwhile, in Internet time, a project which has been around for two years is clearly well past its prime and ready to be retired.

Repository managers are putting all of this great work into the repository layer* of preservation: handles and DOIs, PRESERV and PRONOM, JHOVE and audit trails and the RLG checklist. But meanwhile, all of these collections of digital objects -- many of them funded by limited-duration soft money -- are running on operating systems which will need to be upgraded and patched as time passes, on hardware which will need to be upgraded and repaired as time passes, on networks which require maintenance. Software requires sustenance and maintenance, and no project which doesn't take into account that such maintenance requires skilled technical people in perpetuity can succeed as perpetual preservation. Real sustainability means commitment from and communication with the programmers and sysadmins. It requires the techies understand an archivist's notion of "permanence", and the librarians and archivists (and grant agencies) understand how that a computer needs more than electricity to keep running -- it needs regular care and feeding.

(This, by the way, is one of the reasons I'm so excited by the OTW Archive of One's Own and the Transformative Works and Cultures journal. The individuals responsible for the archive and the journal *do* have a real understanding of and commitment to permanence down to the hardware and network provider level. Admittedly, it's a volunteer-run, donation supported organization, so its sustainability is an open question. But it's a question the OTW Board is wholeheartedly investigating, because they understand its importance.)

*I'm somewhat tempted to make an archival model of preservation that follows the layered structue of the OSI model of network communication. Collection policy layer, Accession layer, Content layer, Descriptive Metadata layer, Preservation Metadata layer, Application Layer, Operating System layer, Hardware layer. Then you could make sure any new preservation project has all of those checkboxes ticked. Sort of an uber-simplification of the RLG Checklist, in a nice, nerd-friendly format.
deborah: the Library of Congress cataloging numbers for children's literature, technology, and library science (Default)
2008-02-22 02:40 pm

many links

The only way to get all these tabs out of my browser is to actually post some links.

This is one I've been saying for awhile "somebody has got to be working on this". Omeka is creating a free platform to help people create curated digital exhibits. The next thing that needs to happen is a hosted service -- not CONTENTdm style hosted service, but a real hosted curation service including preservation planning.

Republicans utterly refuse to compromise on telecom immunity, while the president insists that anyone who doesn't grant immunity to the telecommunications companies want the terrorists to win.

Why students want simplicity and why it fails them when it comes to research is a good introduction to the idea that the skills learned in googling for facts are not actually going to serve a student who needs to learn how to do complex research. Sometimes we need to adapt to user-perceived needs, but sometimes, as academic or school librarians, our job is to teach our patrons. The trick lies in choosing the right balance.

It doesn't do us much good to have an independent, bipartisan Privacy and Civil Liberties Oversight Board if the President can make it vanish simply by not appointing any members.

The MPAA's numbers about the effect of campus music piracy were vastly overblown. Only about 15% of their losses were due to campus downloading, and only about 3% probably came from on campus networks, but the record companies and Congress are bullying the universities to police anyway.

These pictures are very beautiful and very, very sad. "It will rise from ashes" is a blog post and accompanying Flickr set of images from an abandoned Detroit school system book depository. Trees growing from the soil created by burned then rained upon books; it's a kind of renewal, but renewal not from the typical post-apocalyptic vision of a rich industrial culture, but renewal from... well, I don't want to be too horribly melodramatic and say shattered potentials, so I don't know how to finish the sentence.