##plugins.themes.academic_pro.article.main##

Abstract

Contemporary web is comprised of trillions of pages and everyday tremendous amount of requests are made to put more web pages on the WWW. It has been difficult to manage information present on web than to create it. Web page categorization can be defined as an approach to categorize the web pages based on a set of predefined categories to manage large web content. Yahoo! and ODP are the examples of web directories in which pages are categorized manually or semi automatically, but it is a very time consuming task. There are many ways of categorizing web pages using different techniques. An approach to categorize web pages automatically on the basis of characteristics of web pages using neural network based single discrete perception training algorithm which is extended by selecting web page specific features to categorize web pages of predefined categories with high accuracy. The idea is presented with the help of two specific and major categories of web pages chosen for categorization that are newspaper and education.

Keywords: Categorization, Web

##plugins.themes.academic_pro.article.details##

Author Biographies

Shikha Batra, Computer Science Department

M.TECH,

Ashwani Goel, Computer Science Department

Lecturer in MCA Department

How to Cite
Batra, S., & Goel, A. (2014). Web Page Categorization Based on Neurons. International Journal of Emerging Trends in Science and Technology, 1(07). Retrieved from http://igmpublication.org/ijetst.in/index.php/ijetst/article/view/329

References

1. Pierre J. M., “Practical Issues for Automated Categorization of Web Pages,” September 2000.
2. Xiaoguang Q. and Davison B. D., “Web page classification: Features and algorithms,” ACM Computing Surveys, 41(2), 2009
3. Yahoo!, http://www.yahoo.com, Accessed date 14th March, 2012.
4. Open Directory Project, http://www.dmoz.org, Accessed date 15th March, 2012
5. Xu Z. et. al., “A Web Page Classification Algorithm Based On Link Information,” in DCABES’11 Proceedings of the Tenth International Symposium on Distributed Computing and Applications to Business, Engineering and Science , pp. 82-86, 2011.
6. Bartik V., “Text-Based Web Page Classification with Use of Visual Information,” in ASONAM’10 Proceedings of the International Conference on Advances in Social Network Analysis and Mining, pp. 416-420, 2010.
7. He Z. and Liu Z., “A Novel Approach to Naïve Bayes Web Page Automatic Classification,” in FSKD’08 Proceedings of the Fifth International Conference on Fuzzy System and Knowledge Discovery, pp. 361-365, 2008.
8. Radovanović M. and Ivanović M.,“Document Representation for Classification of Short Web Page Descriptions,” in Yugoslav Journal of Operations Research, I8, Number 1, pp. 123-138, 2008.
9. Dai W. et. al., “A Novel Web Page Categorization Algorithm Based on Block Propagation Using Query-Log Information,” in WAIM’06, LNCS 4016, pp. 435-446, 2006.
10. Materna J., “Automatic Web Page Classification,” in RASLAN’08 Proceedings of Recent Advances in Slavonic Natural Language Processing, pp. 84-93, 2008. Page | 38
11. Kwon O. and Lee J., “Web page classification based Nearest Neighbor approach,” in IRAL’00 Proceedings of the fifth international workshop on Information retrieval with Asian languages, pp. 9-15, 2000.
12. McCallum A. and Nigam K., “A Comparison of Event Models for Naive Bayes Text Classification,”