KantanMT Releases New Language Technology Feature to Optimize Machine Translation Training Data

New Kantan Preprocessor™ technology dynamically improves training data for machine translation
By: KantanMT.com
 
DUBLIN - Dec. 22, 2014 - PRLog -- KantanMT is pleased to announce the release of its new Kantan Preprocessor™ feature, which optimizes training data for a more efficient and streamlined KantanMT engine building process.

The Kantan Preprocessor uses a series of Search/Replace rules, based on KantanMT Regex (regular expressions) technology to dynamically alter data prior to its inclusion as training data for a KantanMT engine.

The KantanMT Community can now easily create, test and manage their customized preprocessing rules for training data, such as date formats and differing placeholder counts in the source and target text. This powerful and flexible approach to optimizing data, ensures segments previously rejected by Kantan Data Cleansers are now suitable for training purposes.

Benefits of using the Kantan Preprocessor:

Improved Data Quality: Rejected segments can now be easily edited or fixed using the Kantan Preprocessor rule editor

Increased Productivity: Production ready engines can be deployed faster, thanks to higher quality training data

The rule editor is accessible from an engine's rejects report, by clicking the gear icon to the right of each rule. The editor displays the effect of your rules on both the source and target components of the rejected segment in real-time, and via a user interface, similar to the Microsoft Word track changes feature.

“Increasing productivity and translation efficiency are key drivers for success in the translation supply chain”, said Tony O’Dowd, Founder and Chief Architect of KantanMT, “the Kantan Preprocessor provides a powerful and flexible way to optimize data, which then ensures a much more efficient and productive MT workflow.”

For more information, please go to www.KantanMT.com, or contact Louise Irwin (louisei@kantanmt.com).

About KantanMT

KantanMT.com is a leading SaaS based machine translation platform that enables users to develop and manage customized machine translation engines in the cloud. The innovative technologies offered on the KantanMT.com platform enable users to easily build MT engines in over 750 language combinations, seamlessly integrating into localization workflows and web applications. KantanMT is based in the INVENT Building, DCU Campus, Dublin 9.

Contact
Louise Irwin
***@kantanmt.com
End
Source:KantanMT.com
Email:***@kantanmt.com Email Verified
Tags:Machine Translation, mt, Language Technology, Localization, Kantan Preprocessor
Industry:Technology
Location:Dublin - Dublin - Ireland
Account Email Address Verified     Account Phone Number Verified     Disclaimer     Report Abuse
KantanMT News
Trending
Most Viewed
Daily News



Like PRLog?
9K2K1K
Click to Share