The following information was submitted:
Transactions: WSEAS TRANSACTIONS ON INFORMATION SCIENCE AND APPLICATIONS
Transactions ID Number: 53-190
Full Name: Ijaz Ali Shoukat
Position: Researcher
Age: ON
Sex: Male
Address: College of Computer and Information Sciences, King Saud University, P. O. Box. 51178 Riyadh 11543 Saudi Arabia
Country: SAUDI ARABIA
Tel: 00966563483371
Tel prefix: --
Fax: --
E-mail address: ishoukat@ksu.edu.sa
Other E-mails: ijaz342@yahoo.com
Title of the Paper: Indexing Size Approximation of WWW Repository with Leading Information Retrieval and Web Filtering Robots
Authors as they appear in the Paper: Ijaz Ali Shoukat , Mohsin Iftikhar
Email addresses of all the authors: ishoukat@ksu.edu.sa
Number of paper pages: 10
Abstract: The biggest information system follows World Wide Web indexing that is critical to estimate. Web is beneficial and growing scientific utility like digital library to explore electronic literature to its lovers. Indexing estimation of WWW information is an open problem since 1998. Yahoo has claimed 19 billion web documents as its indexed size on which Google is not satisfied because in accordance with last published study by Gulli and Signorini the total indexed web size was around 11.5 billion pages. Web is growing hastily; what is the current size of web? Which search engine possesses large indexing of authentic information (PDF files)? Which search engine provides large indexing of all types of Web pages? This short article provides the answers of all above questions. We estimated the index size of leading search engines (Google, Yahoo and MSN) under easy and cost effective approach because if easy way persists then why we select tough heuristics. Our technique !
relies on querying over the search engines with selected common affixes that can be a part of each and every document or web page. This short paper concludes the total size of present indexed web contents and provides comparative analysis to support scholars which search engine has more authentic information and large indexing size.
Keywords: Index Size of Search Engines, Total Web Size, Comparison of Google, Yahoo and MSN
EXTENSION of the file: .pdf
Special (Invited) Session:
Organizer of the Session:
How Did you learn about congress: Internet and Web Applications , Web crawlers, WWW Information System(s)
IP ADDRESS: 212.138.69.18