程序包 weka.clusterers

类 DBSCAN

所有已实现的接口:
Serializable, Cloneable, Clusterer, CapabilitiesHandler, OptionHandler, RevisionHandler, TechnicalInformationHandler

public class DBSCAN extends AbstractClusterer implements OptionHandler, TechnicalInformationHandler
Basic implementation of DBSCAN clustering algorithm that should *not* be used as a reference for runtime benchmarks: more sophisticated implementations exist! Clustering of new instances is not supported. More info:

Martin Ester, Hans-Peter Kriegel, Joerg Sander, Xiaowei Xu: A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. In: Second International Conference on Knowledge Discovery and Data Mining, 226-231, 1996.

BibTeX:

 @inproceedings{Ester1996,
    author = {Martin Ester and Hans-Peter Kriegel and Joerg Sander and Xiaowei Xu},
    booktitle = {Second International Conference on Knowledge Discovery and Data Mining},
    editor = {Evangelos Simoudis and Jiawei Han and Usama M. Fayyad},
    pages = {226-231},
    publisher = {AAAI Press},
    title = {A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise},
    year = {1996}
 }
 

Valid options are:

 -E <double>
  epsilon (default = 0.9)
 -M <int>
  minPoints (default = 6)
 -I <String>
  index (database) used for DBSCAN (default = weka.clusterers.forOPTICSAndDBScan.Databases.SequentialDatabase)
 -D <String>
  distance-type (default = weka.clusterers.forOPTICSAndDBScan.DataObjects.EuclideanDataObject)
版本:
$Revision: 9434 $
作者:
Matthias Schubert (schubert@dbs.ifi.lmu.de), Zhanna Melnikova-Albrecht (melnikov@cip.ifi.lmu.de), Rainer Holzmann (holzmann@cip.ifi.lmu.de)
另请参阅:
  • 构造器详细资料

    • DBSCAN

      public DBSCAN()
  • 方法详细资料

    • getCapabilities

      public Capabilities getCapabilities()
      Returns default capabilities of the clusterer.
      指定者:
      getCapabilities 在接口中 CapabilitiesHandler
      指定者:
      getCapabilities 在接口中 Clusterer
      覆盖:
      getCapabilities 在类中 AbstractClusterer
      返回:
      the capabilities of this clusterer
      另请参阅:
    • buildClusterer

      public void buildClusterer(Instances instances) throws Exception
      Generate Clustering via DBSCAN
      指定者:
      buildClusterer 在接口中 Clusterer
      指定者:
      buildClusterer 在类中 AbstractClusterer
      参数:
      instances - The instances that need to be clustered
      抛出:
      Exception - If clustering was not successful
    • clusterInstance

      public int clusterInstance(Instance instance) throws Exception
      Classifies a given instance.
      指定者:
      clusterInstance 在接口中 Clusterer
      覆盖:
      clusterInstance 在类中 AbstractClusterer
      参数:
      instance - The instance to be assigned to a cluster
      返回:
      int The number of the assigned cluster as an integer
      抛出:
      Exception - If instance could not be clustered successfully
    • numberOfClusters

      public int numberOfClusters() throws Exception
      Returns the number of clusters.
      指定者:
      numberOfClusters 在接口中 Clusterer
      指定者:
      numberOfClusters 在类中 AbstractClusterer
      返回:
      int The number of clusters generated for a training dataset.
      抛出:
      Exception - if number of clusters could not be returned successfully
    • listOptions

      public Enumeration listOptions()
      Returns an enumeration of all the available options..
      指定者:
      listOptions 在接口中 OptionHandler
      返回:
      Enumeration An enumeration of all available options.
    • setOptions

      public void setOptions(String[] options) throws Exception
      Sets the OptionHandler's options using the given list. All options will be set (or reset) during this call (i.e. incremental setting of options is not possible).

      Valid options are:

       -E <double>
        epsilon (default = 0.9)
       -M <int>
        minPoints (default = 6)
       -I <String>
        index (database) used for DBSCAN (default = weka.clusterers.forOPTICSAndDBScan.Databases.SequentialDatabase)
       -D <String>
        distance-type (default = weka.clusterers.forOPTICSAndDBScan.DataObjects.EuclideanDataObject)
      指定者:
      setOptions 在接口中 OptionHandler
      参数:
      options - The list of options as an array of strings
      抛出:
      Exception - If an option is not supported
    • getOptions

      public String[] getOptions()
      Gets the current option settings for the OptionHandler.
      指定者:
      getOptions 在接口中 OptionHandler
      返回:
      String[] The list of current option settings as an array of strings
    • databaseForName

      public Database databaseForName(String database_Type, Instances instances)
      Returns a new Class-Instance of the specified database
      参数:
      database_Type - String of the specified database
      instances - Instances that were delivered from WEKA
      返回:
      Database New constructed Database
    • dataObjectForName

      public DataObject dataObjectForName(String database_distanceType, Instance instance, String key, Database database)
      Returns a new Class-Instance of the specified database
      参数:
      database_distanceType - String of the specified distance-type
      instance - The original instance that needs to hold by this DataObject
      key - Key for this DataObject
      database - Link to the database
      返回:
      DataObject New constructed DataObject
    • setMinPoints

      public void setMinPoints(int minPoints)
      Sets a new value for minPoints
      参数:
      minPoints - MinPoints
    • setEpsilon

      public void setEpsilon(double epsilon)
      Sets a new value for epsilon
      参数:
      epsilon - Epsilon
    • getEpsilon

      public double getEpsilon()
      Returns the value of epsilon
      返回:
      double Epsilon
    • getMinPoints

      public int getMinPoints()
      Returns the value of minPoints
      返回:
      int MinPoints
    • getDatabase_distanceType

      public String getDatabase_distanceType()
      Returns the distance-type
      返回:
      String Distance-type
    • getDatabase_Type

      public String getDatabase_Type()
      Returns the type of the used index (database)
      返回:
      String Index-type
    • setDatabase_distanceType

      public void setDatabase_distanceType(String database_distanceType)
      Sets a new distance-type
      参数:
      database_distanceType - The new distance-type
    • setDatabase_Type

      public void setDatabase_Type(String database_Type)
      Sets a new database-type
      参数:
      database_Type - The new database-type
    • epsilonTipText

      public String epsilonTipText()
      Returns the tip text for this property
      返回:
      tip text for this property suitable for displaying in the explorer/experimenter gui
    • minPointsTipText

      public String minPointsTipText()
      Returns the tip text for this property
      返回:
      tip text for this property suitable for displaying in the explorer/experimenter gui
    • database_TypeTipText

      public String database_TypeTipText()
      Returns the tip text for this property
      返回:
      tip text for this property suitable for displaying in the explorer/experimenter gui
    • database_distanceTypeTipText

      public String database_distanceTypeTipText()
      Returns the tip text for this property
      返回:
      tip text for this property suitable for displaying in the explorer/experimenter gui
    • globalInfo

      public String globalInfo()
      Returns a string describing this DataMining-Algorithm
      返回:
      String Information for the gui-explorer
    • getTechnicalInformation

      public TechnicalInformation getTechnicalInformation()
      Returns an instance of a TechnicalInformation object, containing detailed information about the technical background of this class, e.g., paper reference or book this class is based on.
      指定者:
      getTechnicalInformation 在接口中 TechnicalInformationHandler
      返回:
      the technical information about this class
    • toString

      public String toString()
      Returns a description of the clusterer
      覆盖:
      toString 在类中 Object
      返回:
      a string representation of the clusterer
    • getRevision

      public String getRevision()
      Returns the revision string.
      指定者:
      getRevision 在接口中 RevisionHandler
      覆盖:
      getRevision 在类中 AbstractClusterer
      返回:
      the revision
    • main

      public static void main(String[] args)
      Main Method for testing DBSCAN
      参数:
      args - Valid parameters are: 'E' epsilon (default = 0.9); 'M' minPoints (default = 6); 'I' index-type (default = weka.clusterers.forOPTICSAndDBScan.Databases.SequentialDatabase); 'D' distance-type (default = weka.clusterers.forOPTICSAndDBScan.DataObjects.EuclideanDataObject);