1. Latest News
  2. Submit Press Release
  1. PR Home
  2. Latest News
  3. Feeds
  4. Alerts
  5. Submit Free Press Release
  6. Journalist Account
  7. PRNewswire Distribution

Clustify 3.1 Adds Ability to Automatically Ignore Email Headers and Footers for Improved Clustering

New Clustify version features the ability automatically ignore email headers, footers, email addresses, and other clutter that can reduce the quality of document clustering results.

 
PRLog - Jul. 16, 2012 - BRYN MAWR, Pa. -- Hot Neuron LLC announces the release of version 3.1 of its ClustifyTM software, featuring the ability to automatically ignore email headers and footers to produce cleaner results that are more useful during the document review phase of e-discovery.

Clustify groups related documents into labeled clusters, providing an overview of the document set and allowing the user to review and categorize related documents together for greater efficiency and consistency.  The user chooses whether to group documents that are conceptually similar, near-duplicates, or elements of an email thread.  Version 3.1 adds the option to automatically ignore email headers, footers, email addresses, and other clutter that can reduce the quality of the results.

Email can be problematic for text analytics because the substantive part of the email is often short, but it may be accompanied by a large amount of unimportant text in the headers and footers.  The extraneous text can result in software choosing less informative labels for the clusters, and can cause emails to be grouped together because they share long disclaimers in the footers, rather than because they have something important in common.  Replies in an email thread can have header and footer text from parent emails embedded in the middle of the body, making it difficult to identify and remove the clutter.  Clustify 3.1 handles this automatically.  It ignores the clutter when analyzing documents, and ghosts it when displaying the documents for the user, so a reviewer can see everything with the proper emphasis.

"We constantly push Clustify to produce better results with minimal effort by the user, and this is a solid step in that direction," says Bill Dimm, the CEO of Hot Neuron.  "Version 3.1 makes it even easier to get good, clean results without a lot of tweaking or data polishing by the user."

About Hot Neuron

Hot Neuron LLC is an information retrieval software and services company located in Havertown, Pa.  Its Clustify software (http://www.cluster-text.com) does conceptual clustering, near-duplicate detection, content-based email threading, and automatic categorization / predictive coding.  Clustify is used in litigation to make the document review phase of e-discovery more efficient and consistent.

Clustify is a trademark of Hot Neuron LLC.  Hot Neuron is a registered service mark of Hot Neuron LLC.

--- End ---

Click to Share

Contact Email:
***@hotneuron.com Email Verified
Source:Hot Neuron LLC
Phone:610-581-7702
Zip:19010
City/Town:Bryn Mawr - Pennsylvania - United States
Industry:Software, Legal
Tags:ediscovery, e-discovery, clustering, cluster, litigation
Shortcut:prlog.org/11925377
Disclaimer:   Issuers of the press releases are solely responsible for the content of their press releases. PRLog can't be held liable for the content posted by others.   Report Abuse

Latest Press Releases By “

More...

Trending News...



  1. SiteMap
  2. Privacy Policy
  3. Terms of Service
  4. Copyright Notice
  5. About
  6. Advertise
Like PRLog?
9K2K1K
Click to Share