The following information was submitted:
Transactions: WSEAS TRANSACTIONS ON INFORMATION SCIENCE AND APPLICATIONS
Transactions ID Number: 53-654
Full Name: Tengku Sembok
Position: Professor
Age: ON
Sex: Male
Address: Faculty of Information Science and Technology
Country: MALAYSIA
Tel: 0123373539
Tel prefix: 0123373539
Fax: +60389256732
E-mail address: tmtsembok@gmail.com
Other E-mails: tmts@ftsm.ukm.my
Title of the Paper: A Rule and Template Based Stemming Algorithm for Arabic Language
Authors as they appear in the Paper: Tengku Mohd T. Sembok, Belal Mustafa Abu Ata and Zainab Abu Bakar
Email addresses of all the authors: tmtsembok@gmail.com, zainabcs@salam.uitm.edu.my
Number of paper pages: 10
Abstract: Stemming is defined as the conflation of all variations of specific words to a single form called the root or stem. Stemming plays a vital role in natural language processing and understanding. As in other languages, there is a need for an effective stemming algorithm for Arabic words. Arabic is a language having a rich and complex morphological word structures and rules. An Arabic stemming algorithm based on morphological rules has been developed, and to enhance its effectiveness, a dictionary of root words is used to determine the right stems. The Arabic stemming algorithm developed by Al-Omari is studied and a new algorithm is proposed to enhance the performance. The improvements obtained relate to the order in which the dictionary is looked-up and the order in which the morphological rules are applied.
Keywords: Stemming, indexing, information retrieval, natural language processing
EXTENSION of the file: .doc
Special (Invited) Session:
Organizer of the Session:
How Did you learn about congress:
IP ADDRESS: 124.82.23.240