<?xml version="1.0" encoding="UTF-8"?>
<feed xmlns="http://www.w3.org/2005/Atom" xmlns:dw="https://www.dreamwidth.org">
  <id>tag:dreamwidth.org,2009-04-11:37793</id>
  <title>Ramblings on Librarianship, Technology, and Academia</title>
  <subtitle>The Australasian Journal of Me</subtitle>
  <author>
    <name>deborah</name>
  </author>
  <link rel="alternate" type="text/html" href="https://deborah.dreamwidth.org/"/>
  <link rel="self" type="text/xml" href="https://deborah.dreamwidth.org/data/atom"/>
  <updated>2019-06-04T00:47:51Z</updated>
  <dw:journal username="deborah" type="personal"/>
  <entry>
    <id>tag:dreamwidth.org,2009-04-11:37793:85773</id>
    <link rel="alternate" type="text/html" href="https://deborah.dreamwidth.org/85773.html"/>
    <link rel="self" type="text/xml" href="https://deborah.dreamwidth.org/data/atom/?itemid=85773"/>
    <title>speech recognition at last? I have so many questions.</title>
    <published>2019-06-03T22:14:17Z</published>
    <updated>2019-06-04T00:47:51Z</updated>
    <category term="user interfaces"/>
    <category term="windows"/>
    <category term="accessibility"/>
    <category term="mobile devices"/>
    <category term="macos"/>
    <category term="privacy"/>
    <dw:security>public</dw:security>
    <dw:reply-count>2</dw:reply-count>
    <content type="html">At WWDC (the annual Apple developers' conference), Apple announced something which &lt;i&gt;might&lt;/i&gt; be full command-and-control speech recognition for the Mac at last, for the first time.&lt;a href="#note1" aria-label="Footnote 1" role="doc-noteref"&gt;[1]&lt;/a&gt;&lt;a name="ref1"&gt;&lt;/a&gt; None of the regular tech journalists are asking the questions I desperately want to know, however. &lt;br /&gt;&lt;br /&gt;Most of my questions boil down to this:&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;How much did the Apple developers and designers of this product work with users of Dragon NaturallySpeaking for Windows (DNS), DragonDictate for Mac (DD), and Windows Speech Recognition (WSR)? &lt;br /&gt;&lt;br /&gt;How much did they learn about what the speech recognition community already expects as a minimal baseline, as well as what speech recognition users have been lacking in our current tools?&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;Because how Apple answers that first question will inform the answers to all these details:&lt;br /&gt;&lt;br /&gt;&lt;ol&gt;&lt;br /&gt; &lt;li&gt;Will this allow complete hands-free command and control? In other words, will users be able to control their computer without a mouse, a keyboard, a virtual keyboard, a switch, or mouse emulation?&lt;/li&gt;&lt;br /&gt; &lt;li&gt;Will it give access to the menus, graphical icons, or any other aspects of the standard OS X desktop chrome, as long as the code is written using Apple standards?&lt;/li&gt;&lt;br /&gt; &lt;li&gt;How will it work with tools that are not natively enabled to use it? For example, if I install an application that runs in a virtual machine (eg. Eclipse or Slack), what aspects of this speech recognition will be available and what won't?&lt;/li&gt;&lt;br /&gt; &lt;li&gt;Will it require the cloud or network access to work?&lt;/li&gt;&lt;br /&gt; &lt;li&gt;Will it have a trainable voice model?&lt;/li&gt;&lt;br /&gt; &lt;li&gt;Will it have a configurable vocabulary?&lt;/li&gt;&lt;br /&gt; &lt;li&gt;Will it be programmable, either with simple macros or with complex third-party tools?&lt;/li&gt;&lt;br /&gt; &lt;li&gt;In what languages will it be available?&lt;/li&gt;&lt;br /&gt; &lt;li&gt;Will the mobile version require a physical trigger to access, as with the built in microphone-icon-to-dictate currently available on iOS? Can it be left on all the time?&lt;/li&gt;&lt;br /&gt; &lt;li&gt;How will the privacy be guaranteed for any always-listening aspects?&lt;/li&gt;&lt;br /&gt; &lt;li&gt;Does it integrate with Apple VoiceOver?&lt;/li&gt;&lt;/ol&gt;&lt;br /&gt;&lt;br /&gt;&lt;span class="cut-wrapper"&gt;&lt;span style="display: none;" id="span-cuttag___1" class="cuttag"&gt;&lt;/span&gt;&lt;b class="cut-open"&gt;(&amp;nbsp;&lt;/b&gt;&lt;b class="cut-text"&gt;&lt;a href="https://deborah.dreamwidth.org/85773.html#cutid1"&gt;For context, the answers to these questions for DNS and WSR&lt;/a&gt;&lt;/b&gt;&lt;b class="cut-close"&gt;&amp;nbsp;)&lt;/b&gt;&lt;/span&gt;&lt;div style="display: none;" id="div-cuttag___1" aria-live="assertive"&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;What other questions do people have?&lt;br /&gt;&lt;br /&gt;&lt;span class="cut-wrapper"&gt;&lt;span style="display: none;" id="span-cuttag___2" class="cuttag"&gt;&lt;/span&gt;&lt;b class="cut-open"&gt;(&amp;nbsp;&lt;/b&gt;&lt;b class="cut-text"&gt;&lt;a href="https://deborah.dreamwidth.org/85773.html#cutid2"&gt;Endnotes&lt;/a&gt;&lt;/b&gt;&lt;b class="cut-close"&gt;&amp;nbsp;)&lt;/b&gt;&lt;/span&gt;&lt;div style="display: none;" id="div-cuttag___2" aria-live="assertive"&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;img src="https://www.dreamwidth.org/tools/commentcount?user=deborah&amp;ditemid=85773" width="30" height="12" alt="comment count unavailable" style="vertical-align: middle;"/&gt; comments</content>
  </entry>
</feed>
