next up previous
Next: About this document ... Up: A Competitive Term Selection Previous: Discussion


Baeza-Yates, R. & Ribeiro, N.: Modern Information Retrieval, Addison Wesley, 1999.

Booth A.: A law of occurrence of words of low frequency, Information and Control, 10 (4), pp. 383-396, 1967.

C. E. Shannon, The Bell System Technical Journal 27, 379 (1948).

Gelbukh, A.; Sidorov, G. & Guzman-Arenas, A.: Use of a weighted topic hierarchy for text retrieval and classification, LNCS 1692, pp 130-135, 1999.

Jiménez-Salazar, H.; Castro, M.; Rojas, F.; Miñón, E.; Pinto, D. & F. Carcedo: Unsupervised Term Selection using Entropy, Research on Computing Science 14, ISSN 1665-9899, pp. 163-172, México, 2005.

Montemurro, M.A. & Zanette D. H.: Entropic Analysis of the role of the words in literaty texts, CoRR, arXiv:cond-mat/0109218, v1 12, sep 2001.

Moyotl, E.: DPT: un método de selección de términos para categorización de textos, Master in Computer Science Thesis, FCC-BUAP, 2005 (In spanish).

E. Moyotl & H. Jiménez: An Analysis on Frequency of Terms for Text Categorization, Procesamiento del Lenguaje Natural, ISSN 1135-9948, pp 141-146, España.

Moyotl, E. & Jiménez, H.: Enhancement of DPT Feature Selection Method for Text Categorization, LNCS 3406, pp. 706-709, 2005.

Pérez-Carballo, J. & Strzalkowski, T.: Natural Language Information Retrieval: progress report, Information Processing and Management v.36(1), Elsevier, pp. 155-178, 2000.

Pinto, D.; Jiménez-Salazar, H.; Rosso P. & Sanchis, E.: BUAP-UPV TPIRS: A System for Document Indexing Reduction at WebCLEF. Accessing Multilingual Information Repositories, CLEF 2005, LNCS 4022, 2006.

Pinto D.; Jiménez-Salazar, H. & Paolo Rosso: Clustering Abstracts of Scientific Texts using the Transition Point Technique, LNCS 3878, pp. 536-546, 2006.

Rojas, F.; Jiménez, H.; Pinto, D. & Aurelio López: Dimensionality reduction for Information Retrieval, Research on Computing Science, Vol 20, pp 107-112 2006.

Rojas, F.; Jiménez, H. & Pinto, D.: Text Reduction-Enrichment at WebCLEF, In Proceedings of CLEF 2006, pp. 53, 2006.

Salton, G., Wong, A. & Yang, C.: A Vector Space Model for Automatic Indexing, Communications of the ACM, 18(11) pp. 613-620, 1975.

Sebastiani, F.: Machine Learning in Automated Text Categorization, ACM Computing Surveys, 34(1), pp. 1-47, 2002.

Urbizagástegui, A.R.: Las Posibilidades de la Ley de Zipf en la Indización Automática, /2851/RUBEN2.htm, 1999 (In spanish).

Yang, Y., Pedersen, P.: A Comparative Study on Feature Selection in Text Categorization, Proc. of ICML-97, 14th Int. Conf. on Machine Learning, pp. 412-420, 1997.

Zipf, G.K.: Human Behaviour and the Principle of Least Effort, Addison-Wesley, 1949.

David Pinto 2007-05-08