deborah: the Library of Congress cataloging numbers for children's literature, technology, and library science (Default)
[personal profile] deborah
I'm currently trying to normalize and shift into comma separated values files the disambiguated name lists created by four different students who don't work for my department, with whom I'm not allowed to communicate, and for whom I'm not allowed to create standard documentation. (Don't ask.) After title casing everything, my current (incomplete) Vim regular expression is: (screenreader users be warned you should skip!)

:%s#\(<\([^>]*\)>\( \)\)*\(\(\((\)*\([^)]*\)\()\)*\) \([^{]*\)\)#\2,\7,\9,,,,,,,,,\2\3\7 \9;,MS165.001.010.00001


Yes, this is what happens when the people dealing with metadata that need to be normalized are not being managed by professionals.

(I'm doing this in Vim instead of in Perl because each file is a little bit different, so every time I open one I'm doing some hand manipulation of the data and massaging the regular expression slightly to accommodate the fact that each of the students copes with variant names, titles, and unknown personal or surnames differently.)

This is why we can't have nice things.

Date: 2010-07-28 10:20 pm (UTC)
libskrat: (Default)
From: [personal profile] libskrat
They gave me a "keywords" field which had (variously) something title-ish that wasn't in any way demarcated from a description of varying length (from zero to...), as well as EVERY FREAKING THING THAT WAS IN THE OTHER FIELDS.

SHOOT. ME. NOW.

Date: 2010-08-04 09:10 am (UTC)
jeshyr: Pile of thick books labelled "Geek" (Geek)
From: [personal profile] jeshyr
Let me get this right, you people chose this profession?

Do they hide the details until you graduate and are hired, or were you both on the Really Good Drugs?

Ricky
(who might, under duress, admit that the vim regular expression ... or the fact that other people do stuff like that ... was really hot, in a geeky way)

Custom Text

Gnomic Utterances. These are traditional, and are set at the head of each section of the Guidebook. The reason for them is lost in the mists of History. They are culled by the Management from a mighty collection of wise sayings probably compiled by a SAGE—probably called Ka’a Orto’o—some centuries before the Tour begins. The Rule is that no Utterance has anything whatsoever to do with the section it precedes. Nor, of course, has it anything to do with Gnomes.

Expand Cut Tags

No cut tags
Page generated Jan. 5th, 2026 10:50 pm
Powered by Dreamwidth Studios