Corpus ID: 15084403
A Survey of Text Mining Architectures and the UIMA Standard
Mathias Bank, Martin Schierle
Published in LREC 2012
Computer Science
With the rising amount of digitally available text, the need for efficient processing algorithms is growing fast. Although a lot of libraries are commonly available, their modularity and… Expand
View On ACL
Mathias-Bank.De
Share This Paper
10 Citations
Highly Influential Citations
1
Background Citations
2
Methods Citations
2
Tables and Topics from this paper
Table 1
Text mining
Information processing
Information management
Library (computing)
Algorithm
Natural language processing
10 Citations
An encoder-decoder approach to mine conditions for engineering textual data
Fernando O. Gallego, R. Corchuelo
Computer ScienceEng. Appl. Artif. Intell.
2020
TLDR
A new condition mining method that relies on a deep neural network and attempts to overcome the limitations of existing methods for condition mining is proposed, which revealed two key findings: the connectives follows a long-tail distribution and the conditions are quite dissimilar from a semantic point of view. Expand
CAPLAN: An Accessible, Flexible and Scalable Semantification Architecture
Sebastian Furth, Volker Belli, Alexander Legler, A. Striffler, Joachim Baumeister
Computer Science
LWDA

2016
TLDR
This paper defines requirements for a state-of-the-art semantification architecture and presents a concept for a new semantization architecture meeting these requirements, key strengths of the presented concepts are accessibility for non-experts, scalability and flexibility. Expand
Interoperability and Customisation of Annotation Schemata in Argo
Rafal Rak, Jacob Carter, Andrew Rowley, R. Batista-Navarro, S. Ananiadou
Computer Science
LREC

2014
TLDR
It is argued that the customisation of annotation schemata does not need to compromise their interoperability and is superior to other state-of-the-art solutions in terms of expressiveness. Expand
9 Citations
PDF
Curation Technologies for Cultural Heritage Archives: Analysing and transforming a heterogeneous data set into an interactive curation workbench
Georg Rehm, Martin Lee, J. Schneider, Peter Bourgonje
Computer Science
DATeCH

2019
We present a platform that enables the semantic analysis, enrichment, visualisation and presentation of a document collection in a way that enables human users to intuitively interact and explore the… Expand
Advances in Machine Learning and Data Mining for Astronomy
M. Way, J. Scargle, K. Ali, A. Srivastava
Computer Science
2012
TLDR
This book explores how advances in machine learning and data mining can solve current and future problems in astronomy and looks at how they could lead to the creation of entirely new algorithms within the data mining community. Expand
Domain Specific Ontology Enhancing Communication Accuracy in Airport Operation
Dewan Abdullah, Hironao Takahashi, Uzair Lakhani
Computer Science
2019 IEEE 14th International Symposium on Autonomous Decentralized System (ISADS)
2019
TLDR
This paper proposes designing domain specific ontology to improve the accuracy of understanding and the proposed solution not only improves accuracy but is economical also as it does not require any major hardware investment. Expand
Apprentissage interactif de règles d'extraction d'information textuelle. (Iteractive learning of textual information extraction rules)
S. Bannour
Philosophy, Computer Science
2015
TLDR
Pour minimiser l’effort humain requis dans les de two familles d’approches de mise au point de regles, nous avons propose, dans ce travail de these, une approche hybride qui combine les deux en un seul systeme interactif qui procede en plusieurs iterations.Expand
1 Citation
Computational Methods for Text Analysis and Text Classification
H. Dalianis
Computer Science
2018
This chapter presents the computational methods for text analysis and text classification, including both rule-based and machine learning-based methods such as unsupervised and supervised methods.
3 Citations
PDF
Clinical Text Mining
H. Dalianis
Computer Science
Springer International Publishing
2018
22 Citations
PDF
Mining the Biomedical Literature
C. Mihaila, R. Batista-Navarro, +5 authors S. Ananiadou
Computer Science, Engineering
Healthcare Data Analytics
2015
8 Citations
References
SHOWING 1-10 OF 26 REFERENCES
UIMA: an architectural approach to unstructured information processing in the corporate research environment
D. Ferrucci, Adam Lally
Computer ScienceNatural Language Engineering
2004
TLDR
A general introduction to U IMA is given focusing on the design points of its analysis engine architecture and how UIMA is helping to accelerate research and technology transfer is discussed. Expand
974 Citations
Shallow , Deep and Hybrid Processing with UIMA and Heart of Gold
Ulrich Schäfer
2008
The Unstructured Information Management Architecture (UIMA) is a generic platform for processing text and other unstructured, human-generated data. For text, it has been proposed and is being used… Expand
8 Citations
PDF
The DeepThought Core Architecture Framework
Ulrich Callmeier, A. Eisele, Ulrich Schäfer, Melanie Siegel
Computer Science
LREC

2004
TLDR
The research performed in the DeepThought project aims at demonstrating the potential of deep linguistic processing if combined with shallow methods for robustness, and the feasibility of three ambitious applications will be demonstrated, namely: precise information extraction for business intelligence; email response management for customer relationship management; creativity support for document production and collective brainstorming.Expand
69 Citations
PDF
Evolving GATE to meet new challenges in language engineering
Kalina Bontcheva, V. Tablan, D. Maynard, H. Cunningham
Computer ScienceNatural Language Engineering
2004
TLDR
The focus of this paper is on recent developments in response to new challenges in Language Engineering: Semantic Web, integration with Information Retrieval and data mining, and the need for machine learning support. Expand
234 Citations
PDF
Ellogon: A New Text Engineering Platform
G. Petasis, V. Karkaletsis, G. Paliouras, Ion Androutsopoulos, C. Spyropoulos
Computer Science
LREC

2002
TLDR
Ellogon provides a powerful TIPSTER-based infrastructure for managing, storing and exchanging textual data, embedding and managing text processing components as well as visualising textual data and their associated linguistic information. Expand
FreeLing 1.3: Syntactic and semantic services in an open-source NLP library
Jordi Atserias Batalla, B. Casas, Elisabet Comelles Pujadas, Maritxell González, Lluís Padró, Muntsa Padró
Computer Science
LREC

2006
TLDR
This paper describes version 1.3 of the FreeLing suite of NLP tools, which has been improved and enlarged to cover more languages and offer more services: Named entity recognition and classification, chunking, dependency parsing, and WordNet based semantic annotation. Expand
235 Citations
PDF
Middleware for Creating and Combining Multi-dimensional NLP Markup
Ulrich Schäfer
Computer Science
NLPXML@EACL

2006
We present the Heart of Gold middleware by demonstrating three XML-based integration scenarios where multi-dimensional markup produced online by multilingual natural language processing (NLP)… Expand
TIPSTER Text Phase II Architecture Design Version 2.1p 19 June 1996
R. Grishman
Computer Science
TIPSTER

1996
The TIPSTER Program aims to push the technology for access to information in large (multi-GB) text collections, in particular for the analysts in Government agencies. Technology is being developed… Expand
ATLAS: A Flexible and Extensible Architecture for Linguistic Annotation
Steven Bird, David S. Day, John S. Garofolo, John Henderson, Christophe Laprun, M. Liberman
Computer Science
LREC

2000
TLDR
A formal model for annotating linguistic artifacts is described, from which an application programming interface (API) to a suite of tools for manipulating these annotations are derived, and a review of the current efforts towards implementing key pieces of this architecture is reviewed.Expand
145 Citations
PDF
SDL—A Description Language for Building NLP Systems
Hans-Ulrich Krieger
Computer ScienceHLT-NAACL 2003
2003
We present the system description language SDL that offers a declarative way of specifying new complex NLP systems from already existing modules with the help of three operators: sequence,… Expand
...
1
2
3
...
SORT BY
Related Papers
GPGPU Implementation of Cellular Automata Model of Water Flow
P. Topa, Pawel Mlocek
Computer Science
PPAM

2011
TLDR
This paper demonstrates how existing model of water flow can be ported to GPU environment with OpenCL programming framework, and presents how Cellular Automata model can be implemented for processing on Graphics Processing Unit (GPU).
10 Citations
Extending a DBMS with Spatial Operations
Walid G. Aref, H. Samet
Computer Science
SSD

1991
TLDR
A data architecture that matches the requirements for efficient processing of spatial queries in the extended database environment is proposed and provides an equal opportunity for both the spatial components and the non-spatial components of the data to participate in query processing and optimization.
90 Citations
Show More
2/10
Abstract
Tables and Topics
10 Citations
26 References
Related Papers
Stay Connected With Semantic Scholar
What Is Semantic Scholar?
Semantic Scholar is a free, AI-powered research tool for scientific literature, based at the Allen Institute for AI.
Learn More
About
About Us
Publishers
Beta Program
Contact
Research
Team
Datasets
Open Corpus
Supp.ai
Resources
Librarians
Tutorials
FAQ
API
Proudly built by AI2
Terms of ServicePrivacy Policy
By clicking accept or continuing to use the site, you agree to the terms outlined in our Privacy Policy, Terms of Service, and Dataset License
ACCEPT & CONTINUE