-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy pathREADME
43 lines (31 loc) · 1.2 KB
/
README
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
WKMEANS Copyright (c) 2012, Deniz Yuret
Usage: wkmeans [options] < input > output
-k number of clusters (default 2)
-r number of restarts (default 0)
-s random seed
-l input file contains labels
-w input file contains instance weights
-v verbose output
Input format (assuming you have m vectors of n dimensions):
[label_1] [weight_1] x_11 ... x_1m
...
[label_m] [weight_m] x_m1 ... x_mn
label_i : (string) label of the ith vector, required when -l used
weight_i : (double) weight of the ith vector, required when -w used
x_ij : (double) ith vector's jth component
Output format:
[label_1] c_1
...
[label_m] c_m
c_i : (int) cluster of ith vector
Algorithm: wkmeans is a k-means algorithm with (optional) instance
weights.
* Based on mpi_kmeans-1.5 by Peter Gehler.
* Based on C. Elkan. Using the triangle inequality to accelerate
kMeans. ICML 2003.
* Initialization based on Arthur, D. and Vassilvitskii,
S. (2007). K-means++: the advantages of careful seeding.
Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete
algorithms. pp. 1027-1035.
Please see the file LICENSE for terms of use. Everything is standard
C, so just typing make should give you an executable.