Blog: Seth Grimes Subscribe to this blog's RSS feed!

Seth Grimes

Welcome to my BeyeNETWORK Blog, which will focus on text analytics and other matters related to making sense of unstructured information sources in support of better enterprise decision making.

About the author >

Seth is a business intelligence and decision systems expert. He is founding chair of the Text Analytics Summit and principal consultant at Washington, D.C., based Alta Plana Corporation. Seth consults, writes, and speaks on information-systems strategy, data management and analysis systems, IT industry trends, and emerging analytical technologies.

Editor’s note: More articles, resources, news and events are available in Seth's BeyeNETWORK Expert Channel on Text Analytics. Be sure to visit today!

For a forthcoming BeyeNETWORK article, Perspectives on Text Analytics in 2010, I asked solution-provider executives about the top challenges and opportunities they foresee for the coming year.  Eight out of ten responses cited social media and sentiment analysis: solutions that harvest opinions, attitudes, mood, and other subjective information from news, social media, surveys, and other forms of enterprise feedback.

Claire Thomas, text analytics lead at SAP, calls sentiment analysis "broadly applicable to various industries and initiatives." She captures the rationale for the up-coming Sentiment Analysis Symposium, a conference I am organizing, slated for April 13 in New York. 

Claire and SAP have not endorsed the symposium.  I'm quoting Claire and other industry leaders simply as evidence that a practical, solutions focused sentiment-analysis forum, bridging technology and business concerns, is sorely needed.

Behind the need:

IBM SPSS Vice President Olivier Jouve notes that "Twitter, Facebook and other Web 2.0 media are the new critical sources for marketing" and a range of other enterprise functions.  These functions include what's often now called "social CRM," customer relationship management that taps social sources.  I predict that category will be short-lived, just as early-2000's "e-commerce" is now just one facet of modern, comprehensive business solutions.

The fact is, "more companies are looking at 360-degree views of customer feedback, and social media is a critical early warning (before a customer buys) and customer support (when a customer is having issues) indicator of customer experience," as Clarabridge CEO Sid Banerjee puts it.

I'd guess we can all agree with these statements, but how to handle these and traditional information sources?

According to Lexalytics CEO Jeff Catlin, in 2010, "sentiment will complete its transition to a 'checklist' feature that everyone who works in this space will have to provide.  All of the vendors (big and small) will claim to have sentiment."  So there's a problem evaluating sometimes over-stated claims to choose an appropriate solution.

What's appropriate?  Many user organizations can get by -- for the present -- with what Attensity CTO Ian Hersey characterizes as "social media aggregation and lightweight analytics (e.g., buzz analysis, media monitoring)." 

The Sentiment Analysis Symposium will be for organizations with needs that range from focused to sophisticated, to users who, in Ian's words, want "to incorporate the social media into the same analytical models as they use for their internal data and, more important, plug that social media into business processes."

These enterprise scale users require solutions that, in Olivier Jouve's words, handle sources that are "voluminous, cryptic, multi-lingual and deeply interconnected" via "sophisticated data collection mechanisms, advanced multi-lingual analysis and the infrastructure to manage daily terabytes of data."

This being the BeyeNETWORK's text-analytics channel, I've quoted text-analytics industry leaders although I expect media-monitoring, listening platform, and brand/reputation management users, agencies, and solution providers will be well represented at the Sentiment Analysis Symposium.

So check out the event on-line -- follow @SentimentSymp or me, @SethGrimes, on Twitter for updates -- and you have until February 3 to submit a speaking proposal by the way -- and do send me your questions and comments.

 


Posted January 26, 2010 8:55 AM
Permalink | No Comments |
I've posted a follow-on to my text-analytics channel article Text Data Quality.  It's titled Text Data Quality: Mistakes and More.  Stay tuned: The BeyeNETWORK will post another article on text data quality, this one focusing on sources, in early December.

Seth

Posted November 25, 2009 8:37 AM
Permalink | No Comments |

Covering text analytics software, market, conference, and other news and developments to help readers better understand advances in Knowledge Discovery in Text...

Software

Attensity has released a new version of its Voice of the Customer product, Analyze for VOC Version 5.2, that includes accuracy enhancements, a new sentiment scoring feature, a RESTful real-time integration architecture, and analysis enhancements include normalized time-series charts and calculated values according to the company. Version 5.2 introduces new out-of-the box reports for sentiment, Net Promoter Score (NPS) issues, customer churn and competition, according to Attensity.

Attensity has also released E-Service Version 6.1, an enhanced version of the company's application suite for customer service and support organizations.

Clarabridge launched Clarabridge Social Media Analysis (SMA), which the company characterizes as "the industry's first advanced text analytics software that allows companies to integrate social media content into their existing internal enterprise feedback to create more useful customer analysis," in September. The solution uses social media content from Alterian Techrigy's warehouse of social media content, with data from blogs, Facebook, Twitter, YouTube, MySpace, and other social media sites.

Linguamatics released I2E 3.1 in October. According to the company, key enhancements include support for enterprise deployment, NLP-based querying of a greater choice of document types, a new I2E Chemistry option with substructure and structure similarity search powered by ChemAxon, extended results reporting, flexible hyperlinking from extracted entities to web resources such as gene identifiers, glossaries of biomedical terms, and chemical structure visualization.

Orchestr8, a developer of semantic tagging and text mining software, in September announced a new technology to complement their content analysis service, AlchemyAPI. Visual Constraints is designed for extraction of structured data (product info, pricing, descriptions, etc.) from Web pages. A new, October AlchemyAPI release extends extraction capabilities to quotations and named-entity coreferences.

Resources

The UK National Centre for Text Mining has posted presentation slides from the October workshop on Text Mining for Scholarly Communications and Repositories.

An e-book by Graham Wilcock of the University of Helsinki, "Introduction to Linguistic Annotation and Text Analytics," according publisher Morgan & Claypool, "provides a basic introduction to both fields, and aims to show that good linguistic annotations are the essential foundation for good text analytics."

Conferences

Text Analysis Conference (TAC 2009) workshops will be held November 16-17, 2009 at the National Institute of Standards and Technology in Gaithersburg, Maryland, co-located with the Text REtrieval Conference (TREC), November 17-20, 2009.

A May 1, 2010 workshop on text mining will be held in Columbus, Ohio in conjunction with the 2010 SIAM International Conference on Data Mining (SDM 2010). The workshop is devoted to techniques of machine learning in conjunction with natural language processing, information extraction and algebraic/mathematical approaches to computational information retrieval.

The 1st Information Retrieval Facility Conference is slated for May 31, 2010 in Vienna, followed by the 3rd IRF Symposium, June 1-4 2010. The conference aims to provides a forum for researchers in information retrieval, Semantic Web technologies for IR, Natural language processing for IR, and large-scale or distributed computing for those areas. They symposium will especially focus on methodology and evaluation in patent searching and retrieval.

The North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL HLT) 2010 conference will take place June 1-6, 2010 in Los Angeles, California. The conference covers a broad spectrum of disciplines working towards enabling intelligent systems to interact with humans using natural language, and towards enhancing human-human communication through services such as speech recognition, automatic translation, information retrieval, text summarization, and information extraction.

The 48th annual meeting of the Association for Computational Linguistics (ACL 2010) will be held in Uppsala, July 11-16, 2010. The ACL workshops will be held July 15-16.

The 33rd annual ACM SIGIR Conference is slated for 18-23 July 2010, Geneva, Switzerland. SIGIR is the major international forum for the presentation of new research results and for the demonstration of new systems and techniques in the broad field of information retrieval (IR).

The 23rd International Conference on Computational Linguistics (COLING 2010) will be held in Beijing, August 23-27, 2010. There will be pre-conference workshops on August 21-22 and post-conference workshops on August 28.


Posted November 14, 2009 11:11 AM
Permalink | No Comments |

I recently received an inquiry from a student at a European management school who is writing a thesis about the relationship between search technology and business intelligence. She sees the two technologies as having a meeting point at text analytics and asked to pose a few questions on the topic. Many folks share her interest so my BeyeNETWORK blog seemed like a great place to share my responses. Here goes!

Management student> I have been struggling differentiating some terms and understanding them more clearly. Therefore, my questions are related to that confusion. I would also like to hear your opinion on these two technologies (BI as software and enterprise search) and their uses of text analytics.

MS> What is the difference between text analytics and text mining? Is it related to structured vs. unstructured data? Or is text mining a subset of text analytics?

Seth> There isn't a significant difference. I find that text mining is used in areas that have applied the technology longer and that apply data mining. Examples include life sciences and intelligence (e.g., counter-terrorism). Text analytics is more often used in business.

MS> Is content analysis the same as text analysis (if we look at textual documents, not rich data)?

Seth> To me, "content" generally indicates managed information that is typically found in a repository and that is often published on the Web. In this sense, e-mail and IM messages, survey responses, contact center notes and transcripts, and other forms of text generated during business operations are not content. In this sense, content analysis that concerns text is a subset of text analysis.

But "content" does also cover video, audio, and other media as you note. Content analysis would include these forms where text analysis wouldn't, as you understand, beyond work with textual tags.

MS> Is there a difference between text analytics done by search technology and BI applications?

Seth> Text analytics that backs up search is meant to support information retrieval: indexing, summarizing, and ranking documents in response to a search query. TA enables semantic indexing by topics and themes and relationships in order to go beyond indexing based solely on keywords. TA in support of search can also enable smarter, and natural-language, query processing. The example I'll give is that you can enter "map oslo" in Google and get a map of Oslo, because Google is doing a combination of named entity recognition for the geographic area, Oslo, and pattern matching that understands that "map " is a request for a map.

TA in BI (outside use of search for BI) is different. A complete definition of BI include treatment of information in textual and other forms, in databases, repositories, and on the Web. Search is a BI tool, and so is information extraction (a text analytics technique; information = entities, facts, topics, themes, etc.) into structured databases -- some see IE from text as equivalent to ETL for traditional databases -- and also analysis in the sense of data mining of text-extracted information. So when, for instance, you visualize a relationship network that includes people, companies, etc., based on text-extracted named entities and links (relationships, events, etc.), that's TA at work for BI.

MS > What are the fields that use text analytics the most? (any industries in particular?)

Seth> Life sciences and intelligence (including counter-terrorism) were the earliest use cases with serious work going back to the late '90s and they're still very strong domains for TA. But now we're seeing use in a spectrum of business applications as well.

Seth> Let me refer you for this question and the next to a report I recently published, which you can download for free at http://altaplana.com/TA2009 .

MS> How would you describe text analytics market?

[Seth> In my paper, I estimate a 2008 diversified, global market for text-analytics software and vendor provided professional services at $350 million, representing 40% growth from 2007. I foresee sustained growth rates of up to 25% for 2009.]

MS> There is a lot of talk about eDiscovery where text analytics plays a crucial role, but it is also one of the main markets for search technology. Are these two technologies (is it ok to call text analytics is a technology?) coming together?

Seth> I believe that in e-discovery, the principal application of TA is (still) in support of search in the sense that I wrote about above, creating richer indexes that allow legal researchers (litigants) to respond faster and comprehensively to discovery mandates. TA is only starting to be used by legal professional for investigatory purposes, for what you could call "making the case." Compliance and fraud investigations, and risk management, are starting points in this type of use. But I don't think the technology is being used systematically by litigators yet. I do think we'll see a lot more of this investigatory type of use.

I hope you've found our Q&A useful! As always, if you have questions or comments, do get in touch.


Posted July 30, 2009 1:13 PM
Permalink | No Comments |

Covering text analytics software, market, conference, and other news and developments to help KDnuggets readers better understand advances in Knowledge Discovery in Text...

Software

Orchestr8 released on June 18 a significant upgrade to its AlchemyAPI content analysis online service. According to the company, the update includes expanded language coverage (adding Portuguese and Swedish), enhanced text categorization, and integration with Linked Data standards. "AlchemyAPI is a web-based service that enriches a publisher's content through automated tagging, categorization, and semantic analysis available as both a free online API and commercial subscription service."

Attensity Group announced on July 8 the availability of its new, hosted Survey Advantage service at a $5,000 per month point of entry. Attensity Survey Advantage "enables departments within large organizations and government agencies to measure, chart and understand customer sentiment and top issues expressed in customer feedback surveys."

Book

Natural Language Processing with Python: Analyzing Text with the Natural Language Toolkit by Steven Bird, Ewan Klein, and Edward Loper was published in June 2009. "This book offers a highly accessible introduction to Natural Language Processing, the field that underpins a variety of language technologies ranging from predictive text and email filtering to automatic summarization and translation. You'll learn how to write Python programs to analyze the structure and meaning of texts, drawing on techniques from the fields of linguistics and artificial intelligence." Visit O'Reilly for information.

Conferences

A joint conference of the 47th Annual Meeting of the Association for Computational Linguistics and the 4th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing will be held August 2-7, 2009 in Singapore. ACL-IJCNLP 2009 will cover a broad spectrum of technical areas related to natural language and computation.

The 2009 conference of the German Society for Computational Linguistics and Language Technology (GSCL) will include a workshop on the Unstructured Information Management Architecture (UIMA), September 30, 2009, in Potsdam, Germany. "Participants are invited to present applications realized using UIMA, general experiences using UIMA as a platform for natural language processing, as well as technical papers on particular aspects of the UIMA framework. Alternatives to and comparisons of other frameworks - e.g. GATE, LingPipe, etc. - with UIMA are of interest, too."

The third IEEE International Conference on Semantic Computing is slated to be held September 14-16, 2009 in Berkeley, California. ICSC 2009 is "an international forum for researchers and practitioners to present research that advances the state of the art and practice of Semantic Computing, as well as identifying the emerging research topics and defining the future of the field."

Recent Advances in Natural Language Processing RANLP 2009 is slated for September 14-16, 2009 in Borovets, Bulgaria, preceded by September 12-13 tutorials and followed by associated workshops September 17-18.

The ACM Eighteenth Conference on Information and Knowledge Management (CIKM 2009) will take place in Hong Kong, November 2-6, 2009. The conference is sponsored by ACM SIGIR and SIGWEB.

Language and Technology Conference 2009: Human Language Technologies as a Challenge for Computer Science and Linguistics (LTC 2009) will take place November 6-8 in Poznan, Poland. "Human Language Technologies (HLT) continue to be a challenge for computer science, linguistics and related fields as these areas become an ever more essential element of our everyday technological environment... [creating] a favorable climate for the intensive exchange of novel ideas, concepts and solutions across initially distant disciplines."

Text Analysis Conference (TAC 2009) workshops will be held November 16-17, 2009 at the National Institute of Standards and Technology in Gaithersburg, Maryland, co-located with the Text REtrieval Conference (TREC), November 17-20, 2009.

Mining User-Generated Content for Security (MINUCS 2009) will take place December 9, 2009, in Venice, Italy, colocated with the First International Conference on User Centric Media (UCMedia 2009) in Venice, 9-11 December 2009. "The aim of this workshop is to bring together researchers from academia and industry who develop technologies for mining open-source user-generated textual data on the Web, as well as end-users interested in exploiting such technologies for knowledge discovery. The emphasis is placed on large-scale text mining systems..."


Posted July 30, 2009 7:10 AM
Permalink | No Comments |
I recently gave a series of talks -- two live presentations and a webinar -- on Text Mining, Web 3.0 & the Semantic Web.  They were sponsored by a software publisher, Nstein, but other than focusing on Nstein's media & publishing sweet spot, my talks were vendor neutral and editorially independent.

I'm thrilled that a couple of attendees have blogged my talk, in particular, my June 18, 2009 London presentation.  John Welsh posted "Seth Grimes on the semantic web - but is B2B media ready to benefit?" and Peter Thomas's write-up was titled "Literary calculus?"  (I don't know why these two Brits are so enamored of question marks.) 

As an aside -- Peter had earlier posted a blog article, "A first for me...," noting that he "received my invitation to the event through Seth himself after having made contact with him on twitter.com."  I'm the first person Peter has met "IRL" -- you can surely guess what those letters stand for -- post our twitter contact.  I similarly met analyst Merv Adrian that way.  We became twitter friends through mutual real-life contacts.  Then one May morning he tweeted that he was in New York City for a meeting, near Penn Station.  It so happens that I was in New York to attend a different meeting, and I was staying near Penn Station.  Forty-five minutes later, Merv and I were sitting down to breakfast.  (For you foodies: We met at the Tick-Tock Diner at the corner of 34th St. and 8th Ave and both had the excellent corned-beef hash.)

Back to Text Mining, Web 3.0 & the Semantic Web -- If you'd like to listen to a recording of my Nstein webinar, please visit http://www.nstein.com/en/ondemand_webinars.php .  Let me know what you think!

Posted June 26, 2009 3:13 PM
Permalink | 2 Comments |

Text analytics New and Noteworthy -- May-June, 2009 -- covering text analytics software, market, conference, and other news and developments to help readers better understand advances in Knowledge Discovery in Text...

Standards

OASIS, the international open standards consortium, announced on March 19 approval of the Unstructured Information Management Architecture (UIMA) version 1.0 as an OASIS Standard. Visit the OASIS UIMA Technical Committee page for information.

Product Releases and Announcements

Provalis Research released version 3.2 of QDA Miner on March 30. The company characterizes QDA Miner as "an easy-to-use qualitative analysis software tool or coding, annotating, retrieving and analyzing small and large collections of documents."

The Univ of Sheffield released GATE 5.0 on May 29th, offering JAPE pattern-language improvements, GUI improvements, and new ontology-based gazetteer, inter-annotator agreement, and other plug-ins. GATE, the General Architecture for Text Engineering, provides an open-source information extraction framework with many extensions. Release details can be found in the changelog, and the software may be downloaded at http://www.gate.ac.uk/download/.

Attensity announced the latest version of its Voice of the Customer (VoC) and Market Voice applications, Attensity 5, on June 1. A part of Attensity Group's suite of business applications for unstructured data, Attensity 5 offers integrated Automated Response for Customer Experience Management (CEM).

Teragram, a division of SAS, launched Sentiment Analysis Manager (SAM), a social media analysis tool, slated for late-July availability.

Nstein has announced TME 5, a next-generation Text Mining Engine, with fall 2009 availability. The new release will offer enhanced content enrichment, faceted sentiment analysis, full W3C compliant, and a RESTful interface.

Reports and Publications

My study report, Text Analytics 2009: User Perspectives on Solutions and Providers, is out. It reports findings of a survey I recently conducted and also provides an introduction to text analytics that is targeted to BI, data mining, and data warehousing practitioners and business analysts. The report is free. Download it via http://altaplana.com/TA2009.

The April issue of the quarterly newsletter of the (UK) National Centre for Text Mining (NaCTeM) is now available. (http://www.nactem.ac.uk/newsletter/)

Events

DEFT'09, fifth edition of the Defi Fouille de Texte evaluation campaign in text mining, will address this year multilingual opinion analysis. The challenge, now under way, will culminate in a one-day workshop, June 22, 2009, near Paris.

SIGIR 2009, the Association for Computing Machinery's Special Interest Group on Information Retrieval, will convene July 19-23, 2009 in Boston.

The 2009 annual meeting of the Association for Computational Linguistics (ACL) will be combined with the International Joint Conference on Natural Language Processing (IJCNLP), taking place August 2-7, 2009 in Singapore.

The European Summer School in Information Retrieval (ESSIR) is slated for August 31-September 4, 2009 at the University of Padua, Italy. Closing date for registration is June 30.

The ACM Eighteenth Conference on Information and Knowledge Management (CIKM 2009) will take place in Hong Kong, November 2-6, 2009. The conference is sponsored by ACM SIGIR and SIGWEB.


Posted June 10, 2009 11:17 AM
Permalink | No Comments |
A quick note to let everyone know --

My study report, "Text Analytics 2009: User Perspectives on Solutions and Providers," is out.  It reports findings of a survey I recently conducted and also provides an introduction to text analytics that is targeted to BI and data warehousing practitioners and business analysts. The report is free.  Download it via http://altaplana.com/TA2009 .

I'm running another survey, topic "Continuous Intelligence: BI Beyond Real Time."  I'd very much welcome your participation: http://altaplana.com/CIsurvey .  To explain: Continuous Intelligence drives faster decisions and actions across the enterprise.  A more usual term for CI is Complex Event Processing. It's software that can analyze and transform "data in flight" without requiring data to be loaded to a data warehouse.  So CI takes BI beyond real time to deliver intelligence on demand.  I'm working with a sponsoring company, Aleri, to understand the needs of data users.  You can respond anonymously.  We will send the first 200 respondents a $5 Amazon gift card in thanks!


Posted June 4, 2009 12:31 PM
Permalink | No Comments |

I'll quickly hit several text-analytics topics in this blog article: a Summit, a Survey, Interest Indicators, and Market Growth.

Market Growth Keeps Pace

In my article, Market Outlook for Text Analytics, I foresaw sustained text-analytics market growth of 25% in 2009. I'm pleased that earnings announcements made since the date I submitted my article, April 6, affirm that prediction. Notably, text-analytics giant (albeit under the guise of an enterprise search platform and applications vendor) Autonomy announced "record revenues, up 23% from Q1 2008 including strong organic growth." (Revenue due to Autonomy's Interwoven acquisition, which closed March 17, would have had only a relatively minor role in this growth.) And TEMIS 2009 Q1 "revenue reflects an overall growth of over 30% compared to last year." This is great news despite a global economic downturn!

Text Analytics Survey

It's important also to look at the demand side, at customer and prospect perspectives.

Please respond to a survey I'm conducting. I'd like to know how your organization is dealing with "unstructured" information and the role text mining/analytics plays or might play. I'll write up survey findings in a free report, available in early June. The survey will take you 5-10 minutes. Please help.

Text Analytics Summit

Next, I want to make sure you're aware of this year's Text Analytics Summit, slated for June 1-2 in Boston. You can get $100 off your registration with the link I've provided if you enter my name in the promotional offer box. (Note that I do not have a financial interest in registrations or the summit.) Also, this year I'll again be leading a workshop, "Text Analytics for Dummies," the afternoon of May 31.

Interest Indicators

Interest in text analysis is growing! I've been interviewed recently by several outlets as a leading consultant and analyst:


Lastly, as always, if you have questions or concerns related to text analytics, please do get in touch!


Posted April 28, 2009 10:16 AM
Permalink | No Comments |

Covering text analytics software, market, conference, and other news and developments to help KDnuggets readers better understand advances in Knowledge Discovery in Text...

Software

Clarabridge has released Clarabridge Smart Response for real-time automated customer response management. The company calls the solution "the industry's first real-time, automated customer issue analysis, response, and routing solution powered by text mining."

SPSS has released Text Analysis for Surveys 3.0. The company states that the "new version includes pre-built survey categories and expanded language support to easily gather Voice of the Customer feedback from open-ended surveys."

Lexalytics Salience 4.1 is now out and includes an Entity Management Toolkit used to build and train entity recognizers.

Services

Bioalma launched novo|seek in February, a free, text-analytics reliant search engine for biomedical literature in Medline and US Grants. The company aims to compete with Google Scholar and PubMed by combining named entity recognition, information extraction, and knowledge discovery to produce conceptually meaningful results with a greater ease of use than its competitors. Visit http://www.novoseek.com.

A February release of ResponseTek:CEM adds new capabilities for analysis of free-form customer feedback, and integration with customer support tools, to ResponseTek's hosted customer experience management (CEM) software platform. Visit http://www.responsetek.com.

Open Health Natural Language Processing Consortium

The Open Health Natural Language Processing Consortium (OHNLP), launched on March 16 "to facilitate and encourage new annotator and pipeline development, exchange insights and collaborate on novel biomedical natural language processing systems and develop gold-standard corpora for development and testing. The Consortium promotes the open source Unstructured Information Management Architecture (UIMA) framework and SDK as the basis for biomedical NLP systems." As part of the launch, Mayo Clinic and IBM have released clinical NLP technologies open source. The two Apache-licensed NLP platforms, cTAKES (clinical Text Analysis and Knowledge Extraction System) and medKAT/P (medical Knowledge Analysis Tool/Pathology), are based on UIMA and "aim at extracting information from clinical notes and pathology reports.

Book

"Computational Linguistic Text Processing: Lexicon, Grammar, Parsing and Anaphora Resolution" by Rodolfo Delmonte deals with sentence-level linguistic phenomena and text or discourse level linguistic phenomena. It comes with a CD that contains the system GETARUNS for unrestricted text parsing.

Conferences

Enterprise Search Summit 2009 will take place May 12-13 in New York and will include text-analytics content. The summit Web site is www.enterprisesearchsummit.com/2009/.

A detailed program for the fifth annual Text Analytics Summit, June 1-2 in Boston, is available.

The 2009 Semantic Technology Conference is slated for June 14-18 in San Jose. SemTech aims to cover the entire spectrum of business, government, and consumer activity taking place within the emerging field of semantic technologies.

DEFT'09, fifth edition of the DÉfi Fouille de Texte evaluation campaign in text mining, will address this year multilingual opinion analysis. The challenge, now under way, will culminate in a one-day workshop, June 22, 2009, near Paris.

SIGIR 2009, the Association for Computing Machinery's Special Interest Group on Information Retrieval, will convene July 19-23, 2009 in Boston.

Surveys

I am running a survey studying how organizations use text analytics, what information they're analyzing, and what they look for in solutions. I will relate findings in a free report that will benchmark text-analytics usage and perceptions about the technology and solutions. Please take part in the survey: http://altaplana.com/TAsurvey.

Please participate in Hurwitz & Associates' survey on corporate use of text analytics. Analyst Fern Halper will post results in her blog at http://fbhalper.wordpress.com.


Posted April 14, 2009 7:03 AM
Permalink | No Comments |