<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Introspection &#187; CCXML</title>
	<atom:link href="http://blog.jeffhaynie.us/category/ccxml/feed" rel="self" type="application/rss+xml" />
	<link>http://blog.jeffhaynie.us</link>
	<description>Jeff Haynie on business and technology in Silicon Valley</description>
	<lastBuildDate>Mon, 01 Feb 2010 18:32:07 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>How will iPhone change the speech industry?</title>
		<link>http://blog.jeffhaynie.us/how-will-iphone-change-the-speech-industry.html</link>
		<comments>http://blog.jeffhaynie.us/how-will-iphone-change-the-speech-industry.html#comments</comments>
		<pubDate>Mon, 15 Jan 2007 18:17:27 +0000</pubDate>
		<dc:creator>Jeff Haynie</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[CCXML]]></category>
		<category><![CDATA[Speech]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[VoIP]]></category>
		<category><![CDATA[VoiceXML]]></category>

		<guid isPermaLink="false">http://blog.jeffhaynie.us/?p=71</guid>
		<description><![CDATA[This is a question I have been pondering lately.  I&#8217;m not as active in the speech industry these days &#8211; for several reasons.  I won&#8217;t entirely list them here, I&#8217;ll save that for another post one day.  However, one reason is because I believe the speech industry is stagnated a bit and [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>This is a question I have been pondering lately.  I&#8217;m not as active in the speech industry these days &#8211; for several reasons.  I won&#8217;t entirely list them here, I&#8217;ll save that for another post one day.  However, one reason is because I believe the speech industry is stagnated a bit and isn&#8217;t enjoying the growth we&#8217;re seeing in other technology sectors.  I am however still watching the industry and still involved loosely with a few projects.</p>
<p>Today, I received my normal email solicitation from <a href="http://www.tmaa.com/meisel.htm">Bill Meisel</a>.  Bill is a legend in the Speech Recognition world if you don&#8217;t already know him.  He&#8217;s currently an analyst and runs a firm called <a href="http://www.tmaa.com">TMA Associates</a> and publishes a newsletter titled &#8220;Speech Strategy News&#8221;.  Today&#8217;s brief email said the following:</p>
<blockquote>
<p style="text-indent: 0.25in">Apple&#8217;s iPhone is a well-publicized example of the changes in telephony. On the device side, the wireless phone is migrating into all all-purpose portable assistant and entertainment center. Communications infrastructure is also getting a makeover with the integration of the telephone and other communications modalities using Web and network standards. Ad-supported services, including directory assistance and information portals, are making telephony even more analogous to the developments in the Web world.</p>
<p style="text-indent: 0.25in">But the Web analogy can be taken too far. The Graphical User Interface that served the PC and Web world so well has obvious limitations on mobile devices with small screens and clumsy text-input modes. Any phone can call a network-based service and immediately have a Voice User Interface using speech recognition, and that VUI is the <em>same on any phone</em><span style="font-style: normal">. This creates opportunities that some companies are already beginning to exploit for services to consumers and businesses. Call centers will soon find that free directory assistance and other trends increase the number of calls and change their nature. Company telephone systems are also evolving into Unified Communications systems, and speech technology makes the many features of such systems usable. Speech technology can also automate field forces using only cell phones. </span></p>
<p style="text-indent: 0.25in">Do speech recognition and other speech technologies work? Yes, despite a tendency to paint anything short of human abilities as a limitation. When implemented well and used in appropriate applications, the technology is extremely effective. And the many successful deployed applications prove it.</p>
</blockquote>
<p>Bill&#8217;s right about how much a device can be limited compared to the offering of an in-network speech recognition application.  Speech recognition applications can also enjoy a lot of the benefits of a web application, especially when built on a <a href="http://www.voicexml.org">VoiceXML</a> (and CCXML and other related web standards) architecture.</p>
<p>But what&#8217;s the hangup for the industry?  Why the slow down?</p>
<p>It seems, like usual, there are always a multiple of reasons &#8211; many of which are timing, technology, finance and just pure circumstantial &#8211; all of which are uncontrollable.  However, I believe there are a couple of reasons that are addressable by the industry as a whole. <span id="more-71"></span><br />
<strong>First part, the pricing doesn&#8217;t work.</strong></p>
<p>Lucky for <a href="http://www.nuance.com">Nuance</a> these days, they&#8217;re not only the 800 lb gorilla &#8211; they&#8217;re really the only significant player left in the marketplace as far as real speech recognition and text-to-speech product offerings go.  They&#8217;re the monopoly player in the speech space, and they&#8217;re really exploiting that advantage from a distribution and pricing standpoint.  They&#8217;re going to extract as much from that position in terms of power and pricing as they can.  Ultimately, I believe, to their detriment &#8211; or better said, to the reduction of their own marketplace.<br />
Speech has always been a &#8220;nice to have&#8221; &#8211; not because you can&#8217;t build compelling applications that reduce cost.  Not because you can&#8217;t craft a compelling ROI.  Mike Dickerson, my previous co-founder at <a href="http://www.vocalocity.com">Vocalocity</a> (which we are no longer at), always use to say that speech suffers from the &#8220;I&#8221; part of the ROI being &#8220;too expensive&#8221; to get to the &#8220;return on&#8221; part of ROI.  In other words, if the investment is so large that you have to risk the return, it&#8217;s no longer compelling and just increases the risk that you were hoping to mitigate in the business case.  <em>Speech is just too expensive</em>.   Speech, except for a few applications, is a nice-to-have, it&#8217;s a luxury item &#8211; it&#8217;s nice technology, but only for the chosen few.  It really helps and generally people like it &#8211; if you are willing to spend more to get it.  In every case I was involved in, once you got it deployed and working, they loved it.  You just had to justify the cost first, and build a business case that made sense.  That seemed to always be the hardest with a nice-to-have technology like speech recognition.</p>
<p>Unfortunately, as the web continues to be more pervasive and ubiquitous, it continues to be just as fast, far less risky and much less expensive than voice.  And Bill is right, &#8220;the web analogy can be taken too far&#8221;.  People should stop trying to make the voice web fit the visual web.  However, what I think we&#8217;re seeing, is the interactive web is making it more difficult to justify (at the current pricing) the voice web.  Of course, I&#8217;m not trying to say the voice web is necessarily going away &#8211; that would be silly.  However, with email, IM, SMS and web browsers built directly into these devices &#8211; and with the availability of wi-fi networks everywhere &#8211; it&#8217;s almost impossible to not have access to the Web, when needed.  Also, as more and more software is deployed over the web as services using a modern browser, you almost don&#8217;t even have to have your own computer to gain access to necessary data and applications anymore (Another early justification for the phone and VoiceXML, that&#8217;s losing ground).</p>
<p>So how do you change this?  Lowering pricing is certainly one way.  I think you&#8217;d start to see, over a period of time, the ability to make speech more dominant in phone applications.  However, you have to couple that with a healthy eco-system and partner base of developers to get innovative applications conceived, developed and deployed.  And that leads to the second issue&#8230;</p>
<p><strong>Second part, the ecosystem isn&#8217;t healthy.</strong></p>
<p>The voice ecosystem has been dying in the past 2-3 years.  The evidence of this is that many small to mid-size companies (in relative terms to the industry itself) have merged with larger companies.  SpeechWorks was bought by Scansoft and then Scansoft and Nuance merged into Nuance after many years of fierce competition.  <a href="http://www.voicegenie.com">VoiceGenie</a> was acquired by <a href="http://www.genesyslabs.com">Genesys Labs</a>, which was also the acquirer of Telera.  Numerous smaller companies are either getting collapsed by larger companies or they&#8217;re simply going away.  Several smaller companies such as <a href="http://www.voxeo.com">Voxeo</a> and <a href="http://www.angel.com">Angel.com</a> seem to really be making serious headway in their businesses (I will say because I believe they&#8217;ve embraced the web model directly) and are growing significantly.  However, the sign of so much merger mania really is the fall out of the fact that the &#8220;speech industry&#8221; isn&#8217;t there.  It&#8217;s not big enough, it&#8217;s not compelling enough, and hard enough to maintain on its own.   Speech really is just (or really should be) another technology feature of an interactive application.  And, to make matters worse, even the <a href="http://www.speechtek.com">industry conference</a> and <a href="http://www.speechtechmag.com">magazine</a> recently merged with a larger conference as John Kelly <a href="http://triad.bizjournals.com/louisville/stories/2006/06/19/daily11.html">sold</a> SpeechTek to <a href="http://www.infotoday.com/">Information Today</a>. Mergers can be good.  In this case, they&#8217;re the direct reaction to survival.  You must merge to stay alive and in most of these cases, that&#8217;s the underlying reason.  Heck, even <a href="http://www.intel.com">Intel</a> sold off its telecommunications unit, <a href="http://www.dialogic.com/">Dialogic</a>, to another smaller company, Eicon Networks &#8211; and Brooktrout (the Dialogic competitor) has merged with Excel Switching to become <a href="http://www.cantata.com">Cantata</a>.  While they&#8217;re not speech only companies, everyone in the industry recognizes them as major players in the industry.  (By the way, these are just a few mergers, there have been many, many more such as <a href="http://www.intervoice.com">Intervoice</a> and <a href="http://www.edify.com">Edify</a> or the shocking <a href="http://www.ipunity.com">IP Unity</a> and <a href="http://www.glenayre.com">Glenayre</a>).</p>
<p>If you&#8217;re in to &#8220;reading the tealeaves&#8221;, another interesting note is how many people are &#8220;leaving&#8221; the industry.  Or, maybe not really leaving, but relocating to bigger opportunities, whether that be directly or indirectly related to the &#8220;speech industry&#8221;.  Industry veteran and ex co-founder and CTO of Snowshore and CTO of Cantata just left this past week to join <a href="http://www.bea.com">BEA</a> as Deputy CTO.  Smart move in my opinion given the state of everything that&#8217;s going on there and in the industry.  Of course, numerous other people have joined <a href="http://www.google.com">Google</a> in the past 2 years from the industry.<br />
<strong>So, where is the industry headed?</strong></p>
<p>Gosh, if I knew that, I&#8217;d still be in the industry and probably be an analyst, or better yet, a billionaire.</p>
<p>There are a lot of changing dynamics in the &#8220;speech industry&#8221;.  The 800 lb gorilla is small in relative terms, they have a $2B market cap and did almost $400M in 2006.  But, they also have a fairly diverse portfolio of products such as scanners and dictation, of which speech recognition is only one part.</p>
<p>For many years, we&#8217;ve thought <a href="http://www.ibm.com">IBM</a> and <a href="http://www.microsoft.com">Microsoft</a> would get serious and get into the business.  They&#8217;ve made some decent steps forward, but nothing serious yet.</p>
<p>Like we&#8217;ve said in the past, maybe 2007 will be the year&#8230;  (But, don&#8217;t bet on it)</p>
<p><img alt="Technorati" id="image329" src="http://freehogg.wordpress.com/files/2006/04/technorati.gif" /> technorati tags: <a rel="tag" href="http://www.technorati.com/tags/speech">speech</a>, <a rel="tag" href="http://www.technorati.com/tags/voicexml">voicexml</a>, <a rel="tag" href="http://www.technorati.com/tags/ccxml">ccxml</a>, <a rel="tag" href="http://www.technorati.com/tags/nuance">nuance</a>, <a rel="tag" href="http://www.technorati.com/tags/cantata">cantata</a>, <a rel="tag" href="http://www.technorati.com/tags/bea">bea</a>, <a rel="tag" href="http://www.technorati.com/tags/asr">asr</a>, <a rel="tag" href="http://www.technorati.com/tags/tts">tts</a></p>
<img src="http://blog.jeffhaynie.us/?ak_action=api_record_view&id=71&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://blog.jeffhaynie.us/how-will-iphone-change-the-speech-industry.html/feed</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>ZiVva to acquire Vocalocity</title>
		<link>http://blog.jeffhaynie.us/zivva-to-acquire-vocalocity.html</link>
		<comments>http://blog.jeffhaynie.us/zivva-to-acquire-vocalocity.html#comments</comments>
		<pubDate>Wed, 28 Jun 2006 15:41:06 +0000</pubDate>
		<dc:creator>Jeff Haynie</dc:creator>
				<category><![CDATA[CCXML]]></category>
		<category><![CDATA[MRCP]]></category>
		<category><![CDATA[SIP]]></category>
		<category><![CDATA[Speech]]></category>
		<category><![CDATA[VoIP]]></category>
		<category><![CDATA[VoiceXML]]></category>

		<guid isPermaLink="false">http://blog.jeffhaynie.us/?p=27</guid>
		<description><![CDATA[If you have not already heard, today ZiVva annouced the acquisition of Vocalocity.  You can read the press release to get a little more information.
This does mark the end to my involvement with Vocalocity.  It is a very bitter sweet ending.  I would like to thank Jim White of Sutter Hill Ventures, [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>If you have not already heard, today <a href="http://www.zivva.com">ZiVva</a> annouced the acquisition of Vocalocity.  You can read the <a href="http://biz.yahoo.com/prnews/060628/clw037.html?.v=57">press release</a> to get a little more information.</p>
<p>This does mark the end to my involvement with Vocalocity.  It is a very bitter sweet ending.  I would like to thank Jim White of <a href="http://www.shv.com">Sutter Hill Ventures</a>, Mike von Grey of Revenue Med, Kelly Conway of <a href="http://www.eloyalty.com">eLoyalty</a>, Sig Mosely of Imlay Investments, Said Mohammadioun, Lou Santora and all the other close advisors of the company, employees and investors &#8211; and most of all, I would like to thank Mike Dickerson, my partner and loyal friend for 5 tough and rewarding years at Vocalocity.  I will forever be indebted to you and all the people that made Vocalocity a success.</p>
<p>At Vocalocity, we had the concept of taking complex and ever changing communications standards such as VoiceXML, CCXML and SIP, to name a few, and making software that completely followed the standard and that was available on an OEM basis to larger customers.  To name a few: Microsoft, IBM, Aspect and Siemens.  We also strongly believed that standards in the communications world would completely disrupt their proprietary cousins, over time.   Time being the operative word.  Standards have taken over, but at at rate that is hard to track.   SIP has become so mainstream that some of the largest carriers in the world are using it natively.  <a href="http://www.w3.org/Voice">VoiceXML</a> has completely taken over the IVR and next generation interactive voice application space.  MRCP is now the de-facto specification for interfacing with a speech server.</p>
<p>However, standards also have competition &#8211; not just from big companies like Microsoft who sometimes choose not to follow them (or better, extend and embrace them).  For example, <a href="http://www.skype.com">Skype</a> is the world&#8217;s largest VoIP network &#8211; and they don&#8217;t use SIP &#8211; at least not natively.  And you can&#8217;t say that&#8217;s because they have a legacy application problem &#8211; they&#8217;re somewhat a new company dispite their size.</p>
<p>Some of the frustration that I always felt as part of my involvement in the standards process is that they move too slow.  Slow on purpose, I understand. But sometimes, too slow for any good.  Technology is evolving at a breakneck pace &#8211; and standards need to evolve on a more practical timeframe, not a timeframe that is suitable for a large research organization.  The <a href="http://www.ietf.org">IETF</a> is quite dramatically different than the <a href="http://www.w3.org">W3C</a> in this regard.  While the W3C moves like a glacier, the IETF moves like a startup.  IETF drafts expire every 6 months, forcing momentum.  But causing lots of drafts that sometimes have not much change.  On the other hand, how many different SIP extension drafts exist?  A lot and that causes some fragmentation.</p>
<p>I don&#8217;t claim to have any answers or even plausible suggestions &#8211; even after 4 years of standards work.  But, it needs to be examined.</p>
<p>To all the former employees of Vocalocity, you&#8217;ll always be near and dear to my heart for your hard work, dedication and friendship.  I wish everyone the best of luck in the future and hope we can work together again.  Please keep in touch.</p>
<hr />
<p><b>Technorati Tags:</b> <a href="http://www.technorati.com/tags/vocalocity" rel="tag">vocalocity</a>, <a href="http://www.technorati.com/tags/zivva" rel="tag">zivva</a>, <a href="http://www.technorati.com/tags/voicexml" rel="tag">voicexml</a>, <a href="http://www.technorati.com/tags/ccxml" rel="tag">ccxml</a>, <a href="http://www.technorati.com/tags/sip" rel="tag">sip</a>, <a href="http://www.technorati.com/tags/ietf" rel="tag">ietf</a>, <a href="http://www.technorati.com/tags/w3c" rel="tag">w3c</a>, <a href="http://www.technorati.com/tags/mrcp" rel="tag">mrcp</a></p>
<img src="http://blog.jeffhaynie.us/?ak_action=api_record_view&id=27&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://blog.jeffhaynie.us/zivva-to-acquire-vocalocity.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Managing State in complex applications</title>
		<link>http://blog.jeffhaynie.us/managing-state-in-complex-applications.html</link>
		<comments>http://blog.jeffhaynie.us/managing-state-in-complex-applications.html#comments</comments>
		<pubDate>Fri, 17 Feb 2006 02:43:28 +0000</pubDate>
		<dc:creator>Jeff Haynie</dc:creator>
				<category><![CDATA[CCXML]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[SIP]]></category>
		<category><![CDATA[Speech]]></category>
		<category><![CDATA[VoIP]]></category>
		<category><![CDATA[VoiceXML]]></category>

		<guid isPermaLink="false">http://blog.jeffhaynie.us/?p=16</guid>
		<description><![CDATA[Using state charts or modeling tools like UML aren&#8217;t really anything new &#8211; part of developing a VoiceXML, CCXML or SIP application these days is about managing and coordinate states.  It&#8217;s not always fun, can be largely redundant and causes wicked, hard to find bugs in application development.
The W3C Voice Browser Working Group is [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Using state charts or modeling tools like UML aren&#8217;t really anything new &#8211; part of developing a VoiceXML, CCXML or SIP application these days is about managing and coordinate states.  It&#8217;s not always fun, can be largely redundant and causes wicked, hard to find bugs in application development.</p>
<p>The <a href="http://www.w3.org">W3C</a> <a href="http://www.w3.org/Voice">Voice Browser Working Group</a> is working on a new programming language called <a href="http://www.w3.org/TR/2006/WD-scxml-20060124/">State Machine Notation for Control Abstraction</a> (SCXML) which is modeled after Harel State Charts.  Jim Barnett of <a href="http://www.aspect.com">Aspect Software</a> is the lead Editor and an extremely bright individual.</p>
<p>Lots of the core language of the VoiceXML FIA and the CCXML EHIA is largely driven by state. CCXML itself is really a state machine application. The VoiceXML application you typically would author is really no more than a finite state machine, at least at this point in the evolution of modern dialog systems.  So, why SCXML and how is it affecting VXML and CCXML?  SCXML is an attempt to refactor part of the common capabilities and requirements of the language into another language that is common and specific to this cause.  A good reason to refactor.  Also, VoiceXML 3.0 will have additional requirements and having an ability to mix CCXML and VoiceXML applications using a unified state representation allows better applications, more modular applications and applications which can focus on the domain rather than the form.</p>
<p>Enter a new (yet old)  player in the area: AT&#038;T and Call Advantage.  They have years of experience working in the real world building a very complex SIP based system which is not your grandfather&#8217;s phone system.  They have recently introduced the results of their research in a new project called <a href="http://echarts.org/">echarts</a>.  What&#8217;s interesting here is that there is a lot to learn from their experience building highly state driven, very dynamic features in SIP.  There system has allowed them to layer sophisticated feature interaction and do it in a way which scales in a large environment with many subscribers.</p>
<p>We should look at how we can leverage their work as part of SCXML.</p>
<img src="http://blog.jeffhaynie.us/?ak_action=api_record_view&id=16&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://blog.jeffhaynie.us/managing-state-in-complex-applications.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
