The following information was submitted:
Transactions: WSEAS TRANSACTIONS ON INFORMATION SCIENCE AND APPLICATIONS
Transactions ID Number: 29-292
Full Name: Azlinah Mohamed
Position: Associate Professor
Age: ON
Sex: Female
Address: Faculty of Information Technology and Quantitative Sciences, Universiti Teknologi MARA, UiTM Shah Alam, Selangor
Country: MALAYSIA
Tel: 603-55211242
Tel prefix:
Fax: 603-55435510
E-mail address: azlinah@tmsk.uitm.edu.my
Other E-mails:
Title of the Paper: Malay Document Analysis and Recognition
Authors as they appear in the Paper: Norzaidah Md Noh, Mohd Rusydi Abdul Talib, Azlin Ahmad, Shamimi A. Halim, Azlinah Mohamed
Email addresses of all the authors: norzaidah@tmsk.uitm.edu.my, azlin@tmsk.uitm.edu.my, shamimi@tmsk.uitm.edu.my, azlinah@tmsk.uitm.edu.my
Number of paper pages: 10
Abstract: Malay Document Analysis and Recongition aims to extract digital malay documents automaticaly. These extracted documents are presented in the form of namely articles, newspapers and magazines. Over the years, Malay digital documents has increased and published on the world-wide-web (www) and consequently used by many organizations local and abroad. In this paper, we introduce the implementation of a tool for Malay language document identification in mono- and multi-lingual documents. The tool development includes a feature extraction and a neural network technique. The feature extraction consists of documents filtering, word matching and binary representation of varied length sentences from many types of documents including generic text files, MS Word files, Adobe PDF and HTML web pages. The neural network employs back propagation neural network (BPNN) algorithm with adjustable number of neurons and weights between input, hidden and output layer. A database was cons!
tructed consisting of 300 sentences of mono and multi-lingual documents. Experiments show average recognition rate of 90% accuracy in recognizing of Malay language documents, which has more than 80%, matched Malay words. Our tool is able to recognise Malay language documents with reasonable accuracy.
Keywords: Document processing, Language recognition, Backpropagation neural network, Document filtering, Word matching technique
EXTENSION of the file: .doc
Special (Invited) Session: Malay Language Document Identification Using BPNN
Organizer of the Session: 699-245
How Did you learn about congress:
IP ADDRESS: 60.53.134.24