The following information was submitted:
Transactions: INTERNATIONAL JOURNAL of EDUCATION AND INFORMATION TECHNOLOGIES
Transactions ID Number: 19-872
Full Name: Janez Brezovnik
Position: Assistant
Age: ON
Sex: Male
Address: Smetanova ulica 17, 2000 Maribor
Country: SLOVENIA
Tel:
Tel prefix:
Fax:
E-mail address: janez.brezovnik@uni-mb.si
Other E-mails: janez.brezovnik@gmail.com
Title of the Paper: TextProc – a natural language processing framework and its use as plagiarism detection system
Authors as they appear in the Paper: Janez Brezovnik, Milan Ojsteršek
Email addresses of all the authors: janez.brezovnik@uni-mb.si,ojstersek@uni-mb.si
Number of paper pages: 8
Abstract: A natural language processing framework called TextProc is described in this paper. First the frameworks software architecture is described. The architecture is made of several parts and all of them are described in detail. Natural language processing capabilities are implemented as software plug-ins. Plug-ins can be put together into processes that perform a practical natural processing function. Several practical TextProc processes are briefly described, like part-of-speech tagging, named entity tagging and others. One of those is capable to perform plagiarism detection on texts in Slovenian language, which is explained in detail. This process is actually used in digital library of University of Maribor. The integration of digital library with TextProc is also briefly described. At the end of this paper some ideas for future development are given.
Keywords: Natural language processing, Text processing, Text mining, Plagiarism detection, Software framework, Slovenian language
EXTENSION of the file: .doc
Special (Invited) Session: TextProc – a natural language processing framework
Organizer of the Session: 104-218
How Did you learn about congress:
IP ADDRESS: 164.8.252.52