deborah: the Library of Congress cataloging numbers for children's literature, technology, and library science (Default)
At WWDC (the annual Apple developers' conference), Apple announced something which might be full command-and-control speech recognition for the Mac at last, for the first time.[1] None of the regular tech journalists are asking the questions I desperately want to know, however.

Most of my questions boil down to this:

How much did the Apple developers and designers of this product work with users of Dragon NaturallySpeaking for Windows (DNS), DragonDictate for Mac (DD), and Windows Speech Recognition (WSR)?

How much did they learn about what the speech recognition community already expects as a minimal baseline, as well as what speech recognition users have been lacking in our current tools?


Because how Apple answers that first question will inform the answers to all these details:


  1. Will this allow complete hands-free command and control? In other words, will users be able to control their computer without a mouse, a keyboard, a virtual keyboard, a switch, or mouse emulation?

  2. Will it give access to the menus, graphical icons, or any other aspects of the standard OS X desktop chrome, as long as the code is written using Apple standards?

  3. How will it work with tools that are not natively enabled to use it? For example, if I install an application that runs in a virtual machine (eg. Eclipse or Slack), what aspects of this speech recognition will be available and what won't?

  4. Will it require the cloud or network access to work?

  5. Will it have a trainable voice model?

  6. Will it have a configurable vocabulary?

  7. Will it be programmable, either with simple macros or with complex third-party tools?

  8. In what languages will it be available?

  9. Will the mobile version require a physical trigger to access, as with the built in microphone-icon-to-dictate currently available on iOS? Can it be left on all the time?

  10. How will the privacy be guaranteed for any always-listening aspects?

  11. Does it integrate with Apple VoiceOver?


For context, the answers to these questions for DNS and WSR )

What other questions do people have?

Endnotes )
deborah: the Library of Congress cataloging numbers for children's literature, technology, and library science (Default)
I'm so honored to be on the ALSC/Booklist/YALSA Odyssey award for excellence in audiobook production selection committee for 2013. I won't be able to talk about it, but rest assured I am listening to some amazing audiobooks.

Andrea Horbinski's "Madge, in Thy Orisons…", over at the Transformative Works and Cultures Symposium blog, starts with the fascinating ways in which Madonna's Super Bowl halftime performance was clearly drawing upon Luminosity's 300 vid, and comes to some thought-provoking conclusions about the limitations of transformative works:
My point here is not so much that all of this is anything new (it’s not), but rather that viewing the vid and the halftime show together provides a textbook example of the ways in which fandom (and any pop culture critique based in pop culture itself), and vidding in particular, is limited by its working, in some senses, with found objects.


The Free Government Information Blog, in light of the shutdown of Scroogle, talks about privacy-protecting search engines. I've personally come to love Duck Duck Go -- I came for the privacy and stayed for the simplicity and usability.
deborah: the Library of Congress cataloging numbers for children's literature, technology, and library science (Default)
Some library, book, archives, records, baseball fandom, and government information musings and links just so I can clear the tabs out of my browser again: Cut to save your screen real estate )

many links

Feb. 22nd, 2008 02:40 pm
deborah: the Library of Congress cataloging numbers for children's literature, technology, and library science (Default)
The only way to get all these tabs out of my browser is to actually post some links.

This is one I've been saying for awhile "somebody has got to be working on this". Omeka is creating a free platform to help people create curated digital exhibits. The next thing that needs to happen is a hosted service -- not CONTENTdm style hosted service, but a real hosted curation service including preservation planning.

Republicans utterly refuse to compromise on telecom immunity, while the president insists that anyone who doesn't grant immunity to the telecommunications companies want the terrorists to win.

Why students want simplicity and why it fails them when it comes to research is a good introduction to the idea that the skills learned in googling for facts are not actually going to serve a student who needs to learn how to do complex research. Sometimes we need to adapt to user-perceived needs, but sometimes, as academic or school librarians, our job is to teach our patrons. The trick lies in choosing the right balance.

It doesn't do us much good to have an independent, bipartisan Privacy and Civil Liberties Oversight Board if the President can make it vanish simply by not appointing any members.

The MPAA's numbers about the effect of campus music piracy were vastly overblown. Only about 15% of their losses were due to campus downloading, and only about 3% probably came from on campus networks, but the record companies and Congress are bullying the universities to police anyway.

These pictures are very beautiful and very, very sad. "It will rise from ashes" is a blog post and accompanying Flickr set of images from an abandoned Detroit school system book depository. Trees growing from the soil created by burned then rained upon books; it's a kind of renewal, but renewal not from the typical post-apocalyptic vision of a rich industrial culture, but renewal from... well, I don't want to be too horribly melodramatic and say shattered potentials, so I don't know how to finish the sentence.
deborah: the Library of Congress cataloging numbers for children's literature, technology, and library science (Default)
Just a quick note: I have a big girly crush on Brewster Kahle, and he's not even here.

Opening Plenary on getting books online )

Interoperability panel )

Jonathan Zittrain on privacy )

Joanne Kaczmarek on the RLG Audit checklist )

This isn't every presentation that I liked, but most of the others I enjoyed were displays of clever software products, hardware display, or metadata tools (though I am fasincated by the project of "Exploring Erotics in Emily Dickinson’s Correspondence with Text Mining and Visual Interfaces") and I'm not sure how much there is to blog on them.

Oh, also, to my fellow presenters. If you are going to do a demo, get some capture software and make a video of yourself doing the demo. You're all either computer or library professionals, and should know better than to trust internet connections, computers, and A/V systems to work on demand. The demos that were pre-recorded went smoothly, and for many of the live demos we lost any real understanding of the software because you gut hung up on the failing demo.

Syndicate

RSS Atom

Custom Text

Gnomic Utterances. These are traditional, and are set at the head of each section of the Guidebook. The reason for them is lost in the mists of History. They are culled by the Management from a mighty collection of wise sayings probably compiled by a SAGE—probably called Ka’a Orto’o—some centuries before the Tour begins. The Rule is that no Utterance has anything whatsoever to do with the section it precedes. Nor, of course, has it anything to do with Gnomes.

Expand Cut Tags

No cut tags
Page generated Jul. 4th, 2025 04:18 pm
Powered by Dreamwidth Studios