UIMA

UIMA (/juˈmə/ yoo-EE-mə),[1] short for Unstructured Information Management Architecture, is an OASIS standard[2] for content analytics, originally developed at IBM. It provides a component software architecture for the development, discovery, composition, and deployment of multi-modal analytics for the analysis of unstructured information and integration with search technologies.

Structure

The UIMA architecture can be thought of in four dimensions:

  1. It specifies component interfaces in an analytics pipeline.
  2. It describes a set of design patterns.
  3. It suggests two data representations: an in-memory representation of annotations for high-performance analytics and an XML representation of annotations for integration with remote web services.
  4. It suggests development roles allowing tools to be used by users with diverse skills.

Implementations and uses

Apache UIMA
Developer(s)IBM, Apache Software Foundation (since October 2006)
Stable release
3.1.1 / November 8, 2019; 5 years ago (2019-11-08)[3]
Repository
Written inJava with C++ enablement
Operating systemcross-platform
Typetext mining, information extraction
LicenseApache License 2.0
Websiteuima.apache.org

Apache UIMA, a reference implementation of UIMA, is maintained by the Apache Software Foundation.

UIMA is used in a number of software projects:

  • IBM Research's Watson uses UIMA for analyzing unstructured data.[4]
  • The Clinical Text Analysis and Knowledge Extraction System (Apache cTAKES) is a UIMA-based system for information extraction from medical records.
  • DKPro Core is a collection of reusable UIMA components for general-purpose natural language processing.

See also

References

  1. ^ UIMA Frequently Asked Questions (FAQ's) The Apache Software Foundation
  2. ^ UIMA Specification The Apache Software Foundation.
  3. ^ "Apache UIMA - News". uima.apache.org. Retrieved 11 December 2019.
  4. ^ "Apache Innovation Bolsters IBM's "Smartest Machine on Earth" in First-ever Man vs. Machine Competition on Jeopardy! Quiz Show : The Apache Software Foundation Blog". blogs.apache.org. 14 February 2011. Retrieved 23 April 2018.