Jump to content

IBM SystemT

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Sanjsahibm (talk | contribs) at 23:38, 18 July 2019 (added references). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
IBM SystemT
Developer(s)IBM
Written inAQL [de], Java
Operating systemLinux, macOS, Windows
TypeInformation Extraction, Text mining
WebsiteSystemT

IBM SystemT is a declarative information extraction system. It was first built in 2005, as a research project at IBM's IBM Almaden Research Center. Its name is partially inspired by System R, a seminal project from the same research center.

SystemT[1] comprises the following three main components: (1) AQL, a declarative rule language with a similar syntax to SQL; (2) Optimizer, which accepts AQL statements as input and generates high-performance algebraic execution plans; and (3) Executing engine, which executes the plan generated by the Optimizer and performs information extraction over input documents.

SystemT is available as part of IBM BigInsights [2], and has also been taught in multiple universities around the globe. A version of SystemT was available (starting in September 2016) as a companion to a sequence of online courses in Text Analytics[3].

[4]


References

  1. ^ Chiticariu, Laura; Krishnamurthy, Rajasekar; Li, Yunyao; Raghavan, Sriram; Reiss, Frederick R.; Vaithyanathan, Shivakumar (2010-01-01). "SystemT: An Algebraic Approach to Declarative Information Extraction". Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. ACL '10. Uppsala, Sweden: Association for Computational Linguistics: 128โ€“137.
  2. ^ IBM BigInsights
  3. ^ Text Analytics: Getting Results with SystemT
  4. ^ {{Cite title = "{S}ystem{T}: Declarative Text Understanding for Enterprise", author = "Chiticariu, Laura and Danilevsky, Marina and Li, Yunyao and Reiss, Frederick and Zhu, Huaiyu", booktitle = "Proceedings of the 2018 Conference of the North {A}merican Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 3 (Industry Papers)", month = jun, year = "2018", address = "New Orleans - Louisiana", publisher = "Association for Computational Linguistics", url = "https://www.aclweb.org/anthology/N18-3010", doi = "10.18653/v1/N18-3010", pages = "76--83", abstract = "The rise of enterprise applications over unstructured and semi-structured documents poses new challenges to text understanding systems across multiple dimensions. We present SystemT, a declarative text understanding system that addresses these challenges and has been deployed in a wide range of enterprise applications. We highlight the design considerations and decisions behind SystemT in addressing the needs of the enterprise setting. We also summarize the impact of SystemT on business and education."}}