<?xml version='1.0' encoding='utf-8' ?>

<rss version='2.0' xmlns:lj='http://www.livejournal.org/rss/lj/1.0/' xmlns:atom10='http://www.w3.org/2005/Atom'>
<channel>
  <title>Ramblings on Librarianship, Technology, and Academia</title>
  <link>https://deborah.dreamwidth.org/</link>
  <description>Ramblings on Librarianship, Technology, and Academia - Dreamwidth Studios</description>
  <lastBuildDate>Mon, 03 Jun 2019 22:14:17 GMT</lastBuildDate>
  <generator>LiveJournal / Dreamwidth Studios</generator>
  <lj:journal>deborah</lj:journal>
  <lj:journaltype>personal</lj:journaltype>
  <image>
    <url>https://v2.dreamwidth.org/15770/37793</url>
    <title>Ramblings on Librarianship, Technology, and Academia</title>
    <link>https://deborah.dreamwidth.org/</link>
    <width>100</width>
    <height>100</height>
  </image>

<item>
  <guid isPermaLink='true'>https://deborah.dreamwidth.org/85773.html</guid>
  <pubDate>Mon, 03 Jun 2019 22:14:17 GMT</pubDate>
  <title>speech recognition at last? I have so many questions.</title>
  <link>https://deborah.dreamwidth.org/85773.html</link>
  <description>At WWDC (the annual Apple developers&apos; conference), Apple announced something which &lt;i&gt;might&lt;/i&gt; be full command-and-control speech recognition for the Mac at last, for the first time.&lt;a href=&quot;#note1&quot; aria-label=&quot;Footnote 1&quot; role=&quot;doc-noteref&quot;&gt;[1]&lt;/a&gt;&lt;a name=&quot;ref1&quot;&gt;&lt;/a&gt; None of the regular tech journalists are asking the questions I desperately want to know, however. &lt;br /&gt;&lt;br /&gt;Most of my questions boil down to this:&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;How much did the Apple developers and designers of this product work with users of Dragon NaturallySpeaking for Windows (DNS), DragonDictate for Mac (DD), and Windows Speech Recognition (WSR)? &lt;br /&gt;&lt;br /&gt;How much did they learn about what the speech recognition community already expects as a minimal baseline, as well as what speech recognition users have been lacking in our current tools?&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;Because how Apple answers that first question will inform the answers to all these details:&lt;br /&gt;&lt;br /&gt;&lt;ol&gt;&lt;br /&gt; &lt;li&gt;Will this allow complete hands-free command and control? In other words, will users be able to control their computer without a mouse, a keyboard, a virtual keyboard, a switch, or mouse emulation?&lt;/li&gt;&lt;br /&gt; &lt;li&gt;Will it give access to the menus, graphical icons, or any other aspects of the standard OS X desktop chrome, as long as the code is written using Apple standards?&lt;/li&gt;&lt;br /&gt; &lt;li&gt;How will it work with tools that are not natively enabled to use it? For example, if I install an application that runs in a virtual machine (eg. Eclipse or Slack), what aspects of this speech recognition will be available and what won&apos;t?&lt;/li&gt;&lt;br /&gt; &lt;li&gt;Will it require the cloud or network access to work?&lt;/li&gt;&lt;br /&gt; &lt;li&gt;Will it have a trainable voice model?&lt;/li&gt;&lt;br /&gt; &lt;li&gt;Will it have a configurable vocabulary?&lt;/li&gt;&lt;br /&gt; &lt;li&gt;Will it be programmable, either with simple macros or with complex third-party tools?&lt;/li&gt;&lt;br /&gt; &lt;li&gt;In what languages will it be available?&lt;/li&gt;&lt;br /&gt; &lt;li&gt;Will the mobile version require a physical trigger to access, as with the built in microphone-icon-to-dictate currently available on iOS? Can it be left on all the time?&lt;/li&gt;&lt;br /&gt; &lt;li&gt;How will the privacy be guaranteed for any always-listening aspects?&lt;/li&gt;&lt;br /&gt; &lt;li&gt;Does it integrate with Apple VoiceOver?&lt;/li&gt;&lt;/ol&gt;&lt;br /&gt;&lt;br /&gt;&lt;span class=&quot;cut-wrapper&quot;&gt;&lt;span style=&quot;display: none;&quot; id=&quot;span-cuttag___1&quot; class=&quot;cuttag&quot;&gt;&lt;/span&gt;&lt;b class=&quot;cut-open&quot;&gt;(&amp;nbsp;&lt;/b&gt;&lt;b class=&quot;cut-text&quot;&gt;&lt;a href=&quot;https://deborah.dreamwidth.org/85773.html#cutid1&quot;&gt;For context, the answers to these questions for DNS and WSR&lt;/a&gt;&lt;/b&gt;&lt;b class=&quot;cut-close&quot;&gt;&amp;nbsp;)&lt;/b&gt;&lt;/span&gt;&lt;div style=&quot;display: none;&quot; id=&quot;div-cuttag___1&quot; aria-live=&quot;assertive&quot;&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;What other questions do people have?&lt;br /&gt;&lt;br /&gt;&lt;span class=&quot;cut-wrapper&quot;&gt;&lt;span style=&quot;display: none;&quot; id=&quot;span-cuttag___2&quot; class=&quot;cuttag&quot;&gt;&lt;/span&gt;&lt;b class=&quot;cut-open&quot;&gt;(&amp;nbsp;&lt;/b&gt;&lt;b class=&quot;cut-text&quot;&gt;&lt;a href=&quot;https://deborah.dreamwidth.org/85773.html#cutid2&quot;&gt;Endnotes&lt;/a&gt;&lt;/b&gt;&lt;b class=&quot;cut-close&quot;&gt;&amp;nbsp;)&lt;/b&gt;&lt;/span&gt;&lt;div style=&quot;display: none;&quot; id=&quot;div-cuttag___2&quot; aria-live=&quot;assertive&quot;&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;img src=&quot;https://www.dreamwidth.org/tools/commentcount?user=deborah&amp;ditemid=85773&quot; width=&quot;30&quot; height=&quot;12&quot; alt=&quot;comment count unavailable&quot; style=&quot;vertical-align: middle;&quot;/&gt; comments</description>
  <comments>https://deborah.dreamwidth.org/85773.html</comments>
  <category>windows</category>
  <category>macos</category>
  <category>user interfaces</category>
  <category>privacy</category>
  <category>mobile devices</category>
  <category>accessibility</category>
  <lj:security>public</lj:security>
  <lj:reply-count>2</lj:reply-count>
</item>
</channel>
</rss>
