Market Outlook for Text Analytics
by Seth Grimes
Originally published April 23, 2009
2008 Market SizingLast year I estimated a worldwide 2007 text analytics market (software and support and vendor-provided services) of $250 million with continuing 25% year-on-year growth, at that time over twice the rate of growth of the business intelligence (BI) software market. Higher growth rates reflected an immature market, a smaller growth baseline and rapid uptake of a relatively new technology. Despite the economic downturn, I believe the value of the worldwide text analytics market passed $350 million in 2008. And I believe growth will continue apace, at a rate exceeding 25%, through 2009.
To explain my methodology…
Several text analytics players release revenue and growth figures. Some of them are public companies, Nstein for example. Some privately held European companies will provide figures as a matter of mandate or practice. Expert System and Temis are in this category. In one case, revenue figures came out because of an acquisition, 2008’s Lexalytics-Infonic deal.
Where revenue figures are not available, I estimate them, either by projecting from prior-year figures – for example, when Business Objects announced its 2007 acquisition of Inxight, we learned that Inxight’s 2006 revenues were over $25 million – or by estimating from market share, head-count, venture funding and/or sales announcements. Business Objects and the Inxight technologies are, of course, now part of SAP. In many cases, Attensity, Basis Technology and Clarabridge among them, I can get company execs or other observers to confirm that estimates are in the ballpark, even if others such as Linguamatics decline to comment.
For many companies, both public and private, text analytics is only one component of a broad line of products and services. In these cases – companies such as Autonomy, IBM, SAP, SAS, and SPSS – I allocate a portion of overall revenue to text analytics. This step is critical, particularly in the case of Autonomy, which is by far the largest text analytics company, albeit one with a strong solutions and search focus whose technologies cover a variety of media beyond text. Autonomy’s recorded $503.2 million in 2008 revenue, up 47% from 2007, shows that steep growth is not limited to small companies.
Vendor DynamicsTo consider vendors dynamics – new entrants, mergers, repositioning – let’s slice up the software providers by size.
We start with the smallest providers, new entrants and established companies with headcount under, somewhat arbitrarily, twenty. Who’s out there? Among others, there’s Crimson Hexagon, Digital Sonata, GeeYee, Jodange, Leximancer, Sentimetrix and Serendio, most focused to a significant extent on sentiment analysis. Data points don’t derive solely from company communications. I field inquiries from potential users looking for a solution provider who ask specifically about these companies and others. The demand is out there.
Do note that not every small company wants to get large. Take Alias-I, which publishes the LingPipe suite of Java libraries for the linguistic analysis of human language. Partners Breck Baldwin and Bob Carpenter seem quite happy and seem to be doing quite well, sticking to what they do best: coding great natural-language processing software.
The established middle range includes companies such as Megaputer, IxReveal, Attensity and Clarabridge. They’re bulking up, with new OEM deals, partnerships, and (in some cases) executive line-ups. Take Attensity as a special case. The company is now part of a roll-up under industry veterans, former Inxight CEO Ian Bonner, Inxight co-founder Ian Hersey and SAP co-founder Klaus Tschira, and executives of Attensity, e-mail and content solutions providers empolis and Living-e. The roll-up will allow the combination to cover a wider spectrum of information life cycle management needs.
GATE and RapidMiner also deserve special mention. Like other successful open-source projects, the user community is far out of proportion to project staffing. Core GATE development is carried out at the University of Sheffield, England. The software is being extended through collaborations with organizations such as Matrixware, and the project is looking at making support, consulting and customization available from a commercial entity, just as the folks behind RapidMiner did in creating Rapid-I to commercialize the software.
Provider PositioningProvider focus on solutions rather than just primarily on technology is a sure indicator of market maturity. Autonomy CEO Michael Lynch captured this shift in a statement accompanying his company’s January announcement of 2008 financial results, calling “the explosion of unstructured information” a fundamental driver. According to Lynch, “this change for the IT industry affects all areas... Whilst this switch to unstructured information may take ten to fifteen years, in 2008 we saw two particular aspects of this bigger picture come to the fore. The first and most important of these which affects all industry sectors are regulatory changes such as the amendments to the U.S. Federal Rules of Civil Procedure. The second of these effects was the increase in regulation and litigation in the financial markets as a result of recent turmoil.”
Now where Autonomy has built strong verticals in e-discover and compliance, through acquisitions and internal development, other companies such as Attensity, Clarabridge and Expert System have intensified their focus on voice of the customer (VOC) and related applications. Island Data went so far as to change its name to Overtone, reflecting this same focus.
The Demand SideFinally, it's also important to understand the customer/prospect demand side of the equation. How have current users benefitted from text analytics, and what's on their (and prospective users') wish lists? I studied this topic in last year’s BeyeNETWORK report, "Voice of the Customer: Text Analytics for the Responsive Enterprise." It's definitely worth updating and broadening coverage beyond VOC to the spectrum of text analytics applications. You can help. Please take a short survey that I have set up. It's designed for users, prospects, integrators and consultants – no vendors please. It should take you 5-10 minutes depending on how much you have to say. I'll be writing up my findings in a free report that I'll plan to summarize in a future BeyeNETWORK channel article. I’ll also relay findings in my opening address at this year’s Text Analytics Summit, which I chair. (Do consider attending this year, June 1-2 in Boston, and if you’re just starting out with text analytics, my pre-summit workshop, Text Analytics for Dummies, may be for you. The BeyeNETWORK is a summit media partner.)
I expect the demand-side study to surface positive perceptions of text analytics, of technology capabilities and experiences, and ROI, but there's no substitute for actually asking. Similarly, investigating key market-outlook indicators validates perceptions of text analytics market strength, that solution providers, whom users rely on, are doing well. The market seems healthy, full of opportunity for users and providers alike.
Recent articles by Seth Grimes
Copyright 2004 — 2020. Powell Media, LLC. All rights reserved.
BeyeNETWORK™ is a trademark of Powell Media, LLC