程序包 weka.clusterers

类 sIB

所有已实现的接口:
Serializable, Cloneable, Clusterer, CapabilitiesHandler, OptionHandler, Randomizable, RevisionHandler, TechnicalInformationHandler

public class sIB extends RandomizableClusterer implements TechnicalInformationHandler
Cluster data using the sequential information bottleneck algorithm.

Note: only hard clustering scheme is supported. sIB assign for each instance the cluster that have the minimum cost/distance to the instance. The trade-off beta is set to infinite so 1/beta is zero.

For more information, see:

Noam Slonim, Nir Friedman, Naftali Tishby: Unsupervised document classification using sequential information maximization. In: Proceedings of the 25th International ACM SIGIR Conference on Research and Development in Information Retrieval, 129-136, 2002.

BibTeX:

 @inproceedings{Slonim2002,
    author = {Noam Slonim and Nir Friedman and Naftali Tishby},
    booktitle = {Proceedings of the 25th International ACM SIGIR Conference on Research and Development in Information Retrieval},
    pages = {129-136},
    title = {Unsupervised document classification using sequential information maximization},
    year = {2002}
 }
 

Valid options are:

 -I <num>
  maximum number of iterations
  (default 100).
 -M <num>
  minimum number of changes in a single iteration
  (default 0).
 -N <num>
  number of clusters.
  (default 2).
 -R <num>
  number of restarts.
  (default 5).
 -U
  set not to normalize the data
  (default true).
 -V
  set to output debug info
  (default false).
 -S <num>
  Random number seed.
  (default 1)
版本:
$Revision: 5538 $
作者:
Noam Slonim, Anna Huang
另请参阅:
  • 构造器详细资料

    • sIB

      public sIB()
  • 方法详细资料

    • buildClusterer

      public void buildClusterer(Instances data) throws Exception
      Generates a clusterer.
      指定者:
      buildClusterer 在接口中 Clusterer
      指定者:
      buildClusterer 在类中 AbstractClusterer
      参数:
      data - the training instances
      抛出:
      Exception - if something goes wrong
    • clusterInstance

      public int clusterInstance(Instance instance) throws Exception
      Cluster a given instance, this is the method defined in Clusterer interface do nothing but just return the cluster assigned to it
      指定者:
      clusterInstance 在接口中 Clusterer
      覆盖:
      clusterInstance 在类中 AbstractClusterer
      参数:
      instance - the instance to be assigned to a cluster
      返回:
      the number of the assigned cluster as an integer
      抛出:
      Exception - if instance could not be clustered successfully
    • setOptions

      public void setOptions(String[] options) throws Exception
      Parses a given list of options.

      Valid options are:

       -I <num>
        maximum number of iterations
        (default 100).
       -M <num>
        minimum number of changes in a single iteration
        (default 0).
       -N <num>
        number of clusters.
        (default 2).
       -R <num>
        number of restarts.
        (default 5).
       -U
        set not to normalize the data
        (default true).
       -V
        set to output debug info
        (default false).
       -S <num>
        Random number seed.
        (default 1)
      指定者:
      setOptions 在接口中 OptionHandler
      覆盖:
      setOptions 在类中 RandomizableClusterer
      参数:
      options - the list of options as an array of strings
      抛出:
      Exception - if an option is not supported
    • listOptions

      public Enumeration listOptions()
      Returns an enumeration describing the available options.
      指定者:
      listOptions 在接口中 OptionHandler
      覆盖:
      listOptions 在类中 RandomizableClusterer
      返回:
      an enumeration of all the available options.
    • getOptions

      public String[] getOptions()
      Gets the current settings.
      指定者:
      getOptions 在接口中 OptionHandler
      覆盖:
      getOptions 在类中 RandomizableClusterer
      返回:
      an array of strings suitable for passing to setOptions()
    • debugTipText

      public String debugTipText()
      Returns the tip text for this property
      返回:
      tip text for this property suitable for displaying in the explorer/experimenter gui
    • setDebug

      public void setDebug(boolean v)
      Set debug mode - verbose output
      参数:
      v - true for verbose output
    • getDebug

      public boolean getDebug()
      Get debug mode
      返回:
      true if debug mode is set
    • maxIterationsTipText

      public String maxIterationsTipText()
      Returns the tip text for this property.
      返回:
      tip text for this property
    • setMaxIterations

      public void setMaxIterations(int i)
      Set the max number of iterations
      参数:
      i - max number of iterations
    • getMaxIterations

      public int getMaxIterations()
      Get the max number of iterations
      返回:
      max number of iterations
    • minChangeTipText

      public String minChangeTipText()
      Returns the tip text for this property.
      返回:
      tip text for this property
    • setMinChange

      public void setMinChange(int m)
      set the minimum number of changes
      参数:
      m - the minimum number of changes
    • getMinChange

      public int getMinChange()
      get the minimum number of changes
      返回:
      the minimum number of changes
    • numClustersTipText

      public String numClustersTipText()
      Returns the tip text for this property.
      返回:
      tip text for this property
    • setNumClusters

      public void setNumClusters(int n)
      Set the number of clusters
      参数:
      n - number of clusters
    • getNumClusters

      public int getNumClusters()
      Get the number of clusters
      返回:
      the number of clusters
    • numberOfClusters

      public int numberOfClusters()
      Get the number of clusters
      指定者:
      numberOfClusters 在接口中 Clusterer
      指定者:
      numberOfClusters 在类中 AbstractClusterer
      返回:
      the number of clusters
    • numRestartsTipText

      public String numRestartsTipText()
      Returns the tip text for this property.
      返回:
      tip text for this property
    • setNumRestarts

      public void setNumRestarts(int i)
      Set the number of restarts
      参数:
      i - number of restarts
    • getNumRestarts

      public int getNumRestarts()
      Get the number of restarts
      返回:
      number of restarts
    • notUnifyNormTipText

      public String notUnifyNormTipText()
      Returns the tip text for this property.
      返回:
      tip text for this property
    • setNotUnifyNorm

      public void setNotUnifyNorm(boolean b)
      Set whether to normalize instances to unify prior probability before building the clusterer
      参数:
      b - true to normalize, otherwise false
    • getNotUnifyNorm

      public boolean getNotUnifyNorm()
      Get whether to normalize instances to unify prior probability before building the clusterer
      返回:
      true if set to normalize, false otherwise
    • globalInfo

      public String globalInfo()
      Returns a string describing this clusterer
      返回:
      a description of the clusterer suitable for displaying in the explorer/experimenter gui
    • getTechnicalInformation

      public TechnicalInformation getTechnicalInformation()
      Returns an instance of a TechnicalInformation object, containing detailed information about the technical background of this class, e.g., paper reference or book this class is based on.
      指定者:
      getTechnicalInformation 在接口中 TechnicalInformationHandler
      返回:
      the technical information about this class
    • getCapabilities

      public Capabilities getCapabilities()
      Returns default capabilities of the clusterer.
      指定者:
      getCapabilities 在接口中 CapabilitiesHandler
      指定者:
      getCapabilities 在接口中 Clusterer
      覆盖:
      getCapabilities 在类中 AbstractClusterer
      返回:
      the capabilities of this clusterer
      另请参阅:
    • toString

      public String toString()
      覆盖:
      toString 在类中 Object
    • getRevision

      public String getRevision()
      Returns the revision string.
      指定者:
      getRevision 在接口中 RevisionHandler
      覆盖:
      getRevision 在类中 AbstractClusterer
      返回:
      the revision
    • main

      public static void main(String[] argv)