Next: Experimental Results
Up: Clustering of Abstracts in
Previous: Data Set
We used measure (commonly used in information retrieval [16]) in order to determine which method obtains the best
performance. Given a set of clusters
and a set of classes
, the measure between a cluster
and a class is given by the following formula.

(3) 
where
,
. and are defined as follows:

(4) 
and

(5) 
The global performance of the clustering is calculated using the values of . This measure is named measure and it is shown
as follows:

(6) 
David Pinto
20060525