程序包 weka.clusterers
类 sIB
java.lang.Object
weka.clusterers.AbstractClusterer
weka.clusterers.RandomizableClusterer
weka.clusterers.sIB
- 所有已实现的接口:
Serializable
,Cloneable
,Clusterer
,CapabilitiesHandler
,OptionHandler
,Randomizable
,RevisionHandler
,TechnicalInformationHandler
Cluster data using the sequential information bottleneck algorithm.
Note: only hard clustering scheme is supported. sIB assign for each instance the cluster that have the minimum cost/distance to the instance. The trade-off beta is set to infinite so 1/beta is zero.
For more information, see:
Noam Slonim, Nir Friedman, Naftali Tishby: Unsupervised document classification using sequential information maximization. In: Proceedings of the 25th International ACM SIGIR Conference on Research and Development in Information Retrieval, 129-136, 2002. BibTeX:
Note: only hard clustering scheme is supported. sIB assign for each instance the cluster that have the minimum cost/distance to the instance. The trade-off beta is set to infinite so 1/beta is zero.
For more information, see:
Noam Slonim, Nir Friedman, Naftali Tishby: Unsupervised document classification using sequential information maximization. In: Proceedings of the 25th International ACM SIGIR Conference on Research and Development in Information Retrieval, 129-136, 2002. BibTeX:
@inproceedings{Slonim2002, author = {Noam Slonim and Nir Friedman and Naftali Tishby}, booktitle = {Proceedings of the 25th International ACM SIGIR Conference on Research and Development in Information Retrieval}, pages = {129-136}, title = {Unsupervised document classification using sequential information maximization}, year = {2002} }Valid options are:
-I <num> maximum number of iterations (default 100).
-M <num> minimum number of changes in a single iteration (default 0).
-N <num> number of clusters. (default 2).
-R <num> number of restarts. (default 5).
-U set not to normalize the data (default true).
-V set to output debug info (default false).
-S <num> Random number seed. (default 1)
- 版本:
- $Revision: 5538 $
- 作者:
- Noam Slonim, Anna Huang
- 另请参阅:
-
构造器概要
构造器 -
方法概要
修饰符和类型方法说明void
buildClusterer
(Instances data) Generates a clusterer.int
clusterInstance
(Instance instance) Cluster a given instance, this is the method defined in Clusterer interface do nothing but just return the cluster assigned to itReturns the tip text for this propertyReturns default capabilities of the clusterer.boolean
getDebug()
Get debug modeint
Get the max number of iterationsint
get the minimum number of changesboolean
Get whether to normalize instances to unify prior probability before building the clustererint
Get the number of clustersint
Get the number of restartsString[]
Gets the current settings.Returns the revision string.Returns an instance of a TechnicalInformation object, containing detailed information about the technical background of this class, e.g., paper reference or book this class is based on.Returns a string describing this clustererReturns an enumeration describing the available options.static void
Returns the tip text for this property.Returns the tip text for this property.Returns the tip text for this property.int
Get the number of clustersReturns the tip text for this property.Returns the tip text for this property.void
setDebug
(boolean v) Set debug mode - verbose outputvoid
setMaxIterations
(int i) Set the max number of iterationsvoid
setMinChange
(int m) set the minimum number of changesvoid
setNotUnifyNorm
(boolean b) Set whether to normalize instances to unify prior probability before building the clusterervoid
setNumClusters
(int n) Set the number of clustersvoid
setNumRestarts
(int i) Set the number of restartsvoid
setOptions
(String[] options) Parses a given list of options.toString()
从类继承的方法 weka.clusterers.RandomizableClusterer
getSeed, seedTipText, setSeed
从类继承的方法 weka.clusterers.AbstractClusterer
distributionForInstance, forName, makeCopies, makeCopy
-
构造器详细资料
-
sIB
public sIB()
-
-
方法详细资料
-
buildClusterer
Generates a clusterer.- 指定者:
buildClusterer
在接口中Clusterer
- 指定者:
buildClusterer
在类中AbstractClusterer
- 参数:
data
- the training instances- 抛出:
Exception
- if something goes wrong
-
clusterInstance
Cluster a given instance, this is the method defined in Clusterer interface do nothing but just return the cluster assigned to it- 指定者:
clusterInstance
在接口中Clusterer
- 覆盖:
clusterInstance
在类中AbstractClusterer
- 参数:
instance
- the instance to be assigned to a cluster- 返回:
- the number of the assigned cluster as an integer
- 抛出:
Exception
- if instance could not be clustered successfully
-
setOptions
Parses a given list of options. Valid options are:-I <num> maximum number of iterations (default 100).
-M <num> minimum number of changes in a single iteration (default 0).
-N <num> number of clusters. (default 2).
-R <num> number of restarts. (default 5).
-U set not to normalize the data (default true).
-V set to output debug info (default false).
-S <num> Random number seed. (default 1)
- 指定者:
setOptions
在接口中OptionHandler
- 覆盖:
setOptions
在类中RandomizableClusterer
- 参数:
options
- the list of options as an array of strings- 抛出:
Exception
- if an option is not supported
-
listOptions
Returns an enumeration describing the available options.- 指定者:
listOptions
在接口中OptionHandler
- 覆盖:
listOptions
在类中RandomizableClusterer
- 返回:
- an enumeration of all the available options.
-
getOptions
Gets the current settings.- 指定者:
getOptions
在接口中OptionHandler
- 覆盖:
getOptions
在类中RandomizableClusterer
- 返回:
- an array of strings suitable for passing to setOptions()
-
debugTipText
Returns the tip text for this property- 返回:
- tip text for this property suitable for displaying in the explorer/experimenter gui
-
setDebug
public void setDebug(boolean v) Set debug mode - verbose output- 参数:
v
- true for verbose output
-
getDebug
public boolean getDebug()Get debug mode- 返回:
- true if debug mode is set
-
maxIterationsTipText
Returns the tip text for this property.- 返回:
- tip text for this property
-
setMaxIterations
public void setMaxIterations(int i) Set the max number of iterations- 参数:
i
- max number of iterations
-
getMaxIterations
public int getMaxIterations()Get the max number of iterations- 返回:
- max number of iterations
-
minChangeTipText
Returns the tip text for this property.- 返回:
- tip text for this property
-
setMinChange
public void setMinChange(int m) set the minimum number of changes- 参数:
m
- the minimum number of changes
-
getMinChange
public int getMinChange()get the minimum number of changes- 返回:
- the minimum number of changes
-
numClustersTipText
Returns the tip text for this property.- 返回:
- tip text for this property
-
setNumClusters
public void setNumClusters(int n) Set the number of clusters- 参数:
n
- number of clusters
-
getNumClusters
public int getNumClusters()Get the number of clusters- 返回:
- the number of clusters
-
numberOfClusters
public int numberOfClusters()Get the number of clusters- 指定者:
numberOfClusters
在接口中Clusterer
- 指定者:
numberOfClusters
在类中AbstractClusterer
- 返回:
- the number of clusters
-
numRestartsTipText
Returns the tip text for this property.- 返回:
- tip text for this property
-
setNumRestarts
public void setNumRestarts(int i) Set the number of restarts- 参数:
i
- number of restarts
-
getNumRestarts
public int getNumRestarts()Get the number of restarts- 返回:
- number of restarts
-
notUnifyNormTipText
Returns the tip text for this property.- 返回:
- tip text for this property
-
setNotUnifyNorm
public void setNotUnifyNorm(boolean b) Set whether to normalize instances to unify prior probability before building the clusterer- 参数:
b
- true to normalize, otherwise false
-
getNotUnifyNorm
public boolean getNotUnifyNorm()Get whether to normalize instances to unify prior probability before building the clusterer- 返回:
- true if set to normalize, false otherwise
-
globalInfo
Returns a string describing this clusterer- 返回:
- a description of the clusterer suitable for displaying in the explorer/experimenter gui
-
getTechnicalInformation
Returns an instance of a TechnicalInformation object, containing detailed information about the technical background of this class, e.g., paper reference or book this class is based on.- 指定者:
getTechnicalInformation
在接口中TechnicalInformationHandler
- 返回:
- the technical information about this class
-
getCapabilities
Returns default capabilities of the clusterer.- 指定者:
getCapabilities
在接口中CapabilitiesHandler
- 指定者:
getCapabilities
在接口中Clusterer
- 覆盖:
getCapabilities
在类中AbstractClusterer
- 返回:
- the capabilities of this clusterer
- 另请参阅:
-
toString
-
getRevision
Returns the revision string.- 指定者:
getRevision
在接口中RevisionHandler
- 覆盖:
getRevision
在类中AbstractClusterer
- 返回:
- the revision
-
main
-