Friday, 15 January 2010

Wseas Transactions

New Subscription to Wseas Transactions

The following information was submitted:

Transactions: WSEAS TRANSACTIONS ON INFORMATION SCIENCE AND APPLICATIONS
Transactions ID Number: 89-302
Full Name: Mario Malcangi
Position: Professor
Age: ON
Sex: Male
Address: Via Comelico 39 - 20135 Milano
Country: ITALY
Tel:
Tel prefix:
Fax:
E-mail address: malcangi@dico.unimi.it
Other E-mails:
Title of the Paper: Multi-method Audio-based Retrieval of Multimedia Information
Authors as they appear in the Paper: Mario Malcangi
Email addresses of all the authors: malcangi@dico.unimi.it
Number of paper pages: 10
Abstract: - Multimedia information and embedded systems are two major technological advances that have significantly changed the way people interact with systems and information in recent years. In this context, audio proves to be the most advantageous media for interacting with embedded systems and their content. Advantages include: hands-free operation; unattended interaction; and simple, cheap devices for capture and playback. The use of embedded systems to seek information stored locally or on the web points up several difficulties inherent in the nature of multimedia-information signals. These difficulties are especially evident when palmtop or deeply embedded devices are used for such purposes. Developing a set of digital-signal-processing-based algorithms for extracting audio information is a primary step toward providing user-friendly access to multimedia information and developing powerful communication interfaces. The algorithms aim to extract semantic and sy!
ntactic information from audio signals, including voice. Extracted audio features are employed to access information in multimedia databases, as well as to index it. More extensive, higher-level information, such as audio-source identification (speaker identification) and genre (in the case of music), must be extracted from the audio signal. One basic task involves transforming audio into symbols (e.g. music transformed into a score, speech transformed into text) and transcribing symbols into audio (e.g. score transformed into musical audio, text transformed into speech). The purpose is to search for and access any kind of multimedia information by means of audio. To attain these results, digital audio-processing, digital speech-processing, and soft-computing methods need to be integrated. Neural networks are used as classifiers and fuzzy logic is used for making smart decisions.
Keywords: Audio features, Multimedia information, Speech-to-text, Audio-to-score, Text-to-speech, score-to-audio, Digital audio processing, Pattern matching, Soft computing
EXTENSION of the file: .doc
Special (Invited) Session: Audio Interaction with Multimedia Information
Organizer of the Session: 697-624
How Did you learn about congress:
IP ADDRESS: 87.15.160.1