IBM SystemT
Developer(s) | IBM |
---|---|
Written in | AQL , Java |
Operating system | Linux, macOS, Windows |
Type | Information Extraction, Text mining |
Website | SystemT |
IBM SystemT is a declarative information extraction system. It was first built in 2005, as a research project at IBM's IBM Almaden Research Center. Its name is partially inspired by System R, a seminal project from the same research center.
SystemT[1] comprises the following three main components: (1) AQL, a declarative rule language with a similar syntax to SQL; (2) Optimizer, which accepts AQL statements as input and generates high-performance algebraic execution plans; and (3) Executing engine, which executes the plan generated by the Optimizer and performs information extraction over input documents.
SystemT is available as part of IBM BigInsights [2], and has also been taught in multiple universities around the globe. A version of SystemT was available (starting in September 2016) as a companion to a sequence of online courses in Text Analytics[3].
References
- ^ Chiticariu, Laura; Krishnamurthy, Rajasekar; Li, Yunyao; Raghavan, Sriram; Reiss, Frederick R.; Vaithyanathan, Shivakumar (2010-01-01). "SystemT: An Algebraic Approach to Declarative Information Extraction". Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. ACL '10. Uppsala, Sweden: Association for Computational Linguistics: 128โ137.
- ^ IBM BigInsights
- ^ Text Analytics: Getting Results with SystemT
- ^ {{Cite title = "{S}ystem{T}: Declarative Text Understanding for Enterprise", author = "Chiticariu, Laura and Danilevsky, Marina and Li, Yunyao and Reiss, Frederick and Zhu, Huaiyu", booktitle = "Proceedings of the 2018 Conference of the North {A}merican Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 3 (Industry Papers)", month = jun, year = "2018", address = "New Orleans - Louisiana", publisher = "Association for Computational Linguistics", url = "https://www.aclweb.org/anthology/N18-3010", doi = "10.18653/v1/N18-3010", pages = "76--83", abstract = "The rise of enterprise applications over unstructured and semi-structured documents poses new challenges to text understanding systems across multiple dimensions. We present SystemT, a declarative text understanding system that addresses these challenges and has been deployed in a wide range of enterprise applications. We highlight the design considerations and decisions behind SystemT in addressing the needs of the enterprise setting. We also summarize the impact of SystemT on business and education."}}