|
|
|
|
|
|
|
|
|
We have made a comparison among our results and those reported by Pinto et al. [20]. This evaluation is presented in Tables 10 and 11, where our best approach is compared with the results presented in [20], which we have named PintoetAl. The comparison could be done only by using both, the CICLing-2002 and the hep-ex corpora, because up to now, there are not published results with the characteristics needed for the KnCr corpus. We have observed that the use of KLD obtains comparable results, and we consider that this behaviour is derived from the size of each text. We are suggesting to use a smooth procedure, but the number document terms that does not appear in the corpus vocabulary can be extremely high. Further analysis will investigate this issue.