The following information was submitted:
Transactions: INTERNATIONAL JOURNAL of COMPUTERS
Transactions ID Number: 20-234
Full Name: Marko Ferme
Position: Assistant
Age: ON
Sex: Male
Address: Smetanova 17
Country: SLOVENIA
Tel:
Tel prefix:
Fax:
E-mail address: marko.ferme@uni-mb.si
Other E-mails:
Title of the Paper: Text analysis with sequence matching
Authors as they appear in the Paper: Marko Ferme, Milan Ojster¹ek
Email addresses of all the authors: marko.ferme@uni-mb.si,ojstersek@uni-mb.si
Number of paper pages: 8
Abstract: This article describes some common problems faced in natural language processing. The main problem consist of a user given sentence, which has to be matched against an existing knowledge base, consisting of semantically described words or phrases. Some main problems in this process are outlined and the most common solutions used in natural language processing are overviewed. A sequence matching algorithm is introduced as an alternative solution and its advantages over the existing approaches are explained. The algorithm is explained in detail where the longest subsequences discovery algorithm is explained first. Then the major components of the similarity measure are defined and the computation of concurrence and dispersion measure is presented. Results of the algorithms performance on a test set are then shown and different implementations of algorithm usage are discussed. The work is concluded with some ideas for the future and some examples where our approach ca!
n be practically used.
Keywords: Sequence matching, subsequence analysis, similarity measure, fuzzy string search, phrase detection
EXTENSION of the file: .pdf
Special (Invited) Session: Sequence matching with subsequence analysis
Organizer of the Session: 104-237
How Did you learn about congress:
IP ADDRESS: 164.8.252.52