类 SubspaceCluster
java.lang.Object
weka.datagenerators.DataGenerator
weka.datagenerators.ClusterGenerator
weka.datagenerators.clusterers.SubspaceCluster
- 所有已实现的接口:
Serializable
,OptionHandler
,Randomizable
,RevisionHandler
A data generator that produces data points in hyperrectangular subspace clusters.
Valid options are:
-h Prints this help.
-o <file> The name of the output file, otherwise the generated data is printed to stdout.
-r <name> The name of the relation.
-d Whether to print debug informations.
-S The seed for random function (default 1)
-a <num> The number of attributes (default 1).
-c Class Flag, if set, the cluster is listed in extra attribute.
-b <range> The indices for boolean attributes.
-m <range> The indices for nominal attributes.
-P <num> The noise rate in percent (default 0.0). Can be between 0% and 30%. (Remark: The original algorithm only allows noise up to 10%.)
-C <cluster-definition> A cluster definition of class 'SubspaceClusterDefinition' (definition needs to be quoted to be recognized as a single argument).
Options specific to weka.datagenerators.clusterers.SubspaceClusterDefinition:
-A <range> Generates randomly distributed instances in the cluster.
-U <range> Generates uniformly distributed instances in the cluster.
-G <range> Generates gaussian distributed instances in the cluster.
-D <num>,<num> The attribute min/max (-A and -U) or mean/stddev (-G) for the cluster.
-N <num>..<num> The range of number of instances per cluster (default 1..50).
-I Uses integer instead of continuous values (default continuous).
- 版本:
- $Revision: 1.5 $
- 作者:
- Gabi Schmidberger (gabi@cs.waikato.ac.nz), FracPete (fracpete at waikato dot ac dot nz)
- 另请参阅:
-
字段概要
字段修饰符和类型字段说明static final int
cluster subtype: continuousstatic final int
cluster type: gaussianstatic final int
cluster subtype: integerstatic final Tag[]
the tags for the cluster typesstatic final Tag[]
the tags for the cluster typesstatic final int
cluster type: total uniformstatic final int
cluster type: uniform/random -
构造器概要
构造器构造器说明initializes the generator, sets the number of clusters to 0, since user has to specify them explicitly -
方法概要
修饰符和类型方法说明Returns the tip text for this propertyInitializes the format for the dataset produced.Generate an example of the dataset.Generate all examples of the dataset.Compiles documentation about the data generation after the generation processCompiles documentation about the data generation before the generation processreturns the currently set clustersdouble
Gets the percentage of noise set.int[]
returns array that stores the number of values for a nominal attribute.String[]
Gets the current settings of the datagenerator.Returns the revision string.boolean
Gets the single mode flag.Returns a string describing this data generator.boolean
isBoolean
(int index) Returns true if attribute is booleanboolean
isNominal
(int index) Returns true if attribute is nominalReturns an enumeration describing the available options.static void
Main method for testing this class.Returns the tip text for this propertyReturns the tip text for this propertyvoid
setClusterDefinitions
(ClusterDefinition[] value) sets the clusters to usevoid
setNoiseRate
(double newNoiseRate) Sets the percentage of noise set.void
setNumAttributes
(int numAttributes) Sets the number of attributes the dataset should have.void
setOptions
(String[] options) Parses a list of options for this object.从类继承的方法 weka.datagenerators.ClusterGenerator
booleanColsTipText, classFlagTipText, getBooleanCols, getClassFlag, getNominalCols, getNumAttributes, nominalColsTipText, setBooleanCols, setBooleanIndices, setClassFlag, setNominalCols, setNominalIndices
从类继承的方法 weka.datagenerators.DataGenerator
debugTipText, defaultOutput, formatTipText, getDatasetFormat, getDebug, getNumExamplesAct, getOutput, getRandom, getRelationName, getSeed, makeData, outputTipText, randomTipText, relationNameTipText, seedTipText, setDatasetFormat, setDebug, setOutput, setRandom, setRelationName, setSeed
-
字段详细资料
-
UNIFORM_RANDOM
public static final int UNIFORM_RANDOMcluster type: uniform/random- 另请参阅:
-
TOTAL_UNIFORM
public static final int TOTAL_UNIFORMcluster type: total uniform- 另请参阅:
-
GAUSSIAN
public static final int GAUSSIANcluster type: gaussian- 另请参阅:
-
TAGS_CLUSTERTYPE
the tags for the cluster types -
CONTINUOUS
public static final int CONTINUOUScluster subtype: continuous- 另请参阅:
-
INTEGER
public static final int INTEGERcluster subtype: integer- 另请参阅:
-
TAGS_CLUSTERSUBTYPE
the tags for the cluster types
-
-
构造器详细资料
-
SubspaceCluster
public SubspaceCluster()initializes the generator, sets the number of clusters to 0, since user has to specify them explicitly
-
-
方法详细资料
-
globalInfo
Returns a string describing this data generator.- 返回:
- a description of the data generator suitable for displaying in the explorer/experimenter gui
-
listOptions
Returns an enumeration describing the available options.- 指定者:
listOptions
在接口中OptionHandler
- 覆盖:
listOptions
在类中ClusterGenerator
- 返回:
- an enumeration of all the available options
-
setOptions
Parses a list of options for this object. Valid options are:-h Prints this help.
-o <file> The name of the output file, otherwise the generated data is printed to stdout.
-r <name> The name of the relation.
-d Whether to print debug informations.
-S The seed for random function (default 1)
-a <num> The number of attributes (default 1).
-c Class Flag, if set, the cluster is listed in extra attribute.
-b <range> The indices for boolean attributes.
-m <range> The indices for nominal attributes.
-P <num> The noise rate in percent (default 0.0). Can be between 0% and 30%. (Remark: The original algorithm only allows noise up to 10%.)
-C <cluster-definition> A cluster definition of class 'SubspaceClusterDefinition' (definition needs to be quoted to be recognized as a single argument).
Options specific to weka.datagenerators.clusterers.SubspaceClusterDefinition:
-A <range> Generates randomly distributed instances in the cluster.
-U <range> Generates uniformly distributed instances in the cluster.
-G <range> Generates gaussian distributed instances in the cluster.
-D <num>,<num> The attribute min/max (-A and -U) or mean/stddev (-G) for the cluster.
-N <num>..<num> The range of number of instances per cluster (default 1..50).
-I Uses integer instead of continuous values (default continuous).
- 指定者:
setOptions
在接口中OptionHandler
- 覆盖:
setOptions
在类中ClusterGenerator
- 参数:
options
- the list of options as an array of strings- 抛出:
Exception
- if an option is not supported
-
getOptions
Gets the current settings of the datagenerator.- 指定者:
getOptions
在接口中OptionHandler
- 覆盖:
getOptions
在类中ClusterGenerator
- 返回:
- an array of strings suitable for passing to setOptions
- 另请参阅:
-
DataGenerator.removeBlacklist(String[])
-
setNumAttributes
public void setNumAttributes(int numAttributes) Sets the number of attributes the dataset should have.- 覆盖:
setNumAttributes
在类中ClusterGenerator
- 参数:
numAttributes
- the new number of attributes
-
numAttributesTipText
Returns the tip text for this property- 覆盖:
numAttributesTipText
在类中ClusterGenerator
- 返回:
- tip text for this property suitable for displaying in the explorer/experimenter gui
-
getNoiseRate
public double getNoiseRate()Gets the percentage of noise set.- 返回:
- the percentage of noise set
-
setNoiseRate
public void setNoiseRate(double newNoiseRate) Sets the percentage of noise set.- 参数:
newNoiseRate
- new percentage of noise
-
noiseRateTipText
Returns the tip text for this property- 返回:
- tip text for this property suitable for displaying in the explorer/experimenter gui
-
getClusterDefinitions
returns the currently set clusters- 返回:
- the currently set clusters
-
setClusterDefinitions
sets the clusters to use- 参数:
value
- the clusters do use- 抛出:
Exception
- if clusters are not the correct class
-
clusterDefinitionsTipText
Returns the tip text for this property- 返回:
- tip text for this property suitable for displaying in the explorer/experimenter gui
-
getSingleModeFlag
public boolean getSingleModeFlag()Gets the single mode flag.- 指定者:
getSingleModeFlag
在类中DataGenerator
- 返回:
- true if methode generateExample can be used.
-
defineDataFormat
Initializes the format for the dataset produced.- 覆盖:
defineDataFormat
在类中DataGenerator
- 返回:
- the output data format
- 抛出:
Exception
- data format could not be defined- 另请参阅:
-
DataGenerator.defaultRelationName()
-
isBoolean
public boolean isBoolean(int index) Returns true if attribute is boolean- 参数:
index
- of the attribute- 返回:
- true if the attribute is boolean
-
isNominal
public boolean isNominal(int index) Returns true if attribute is nominal- 参数:
index
- of the attribute- 返回:
- true if the attribute is nominal
-
getNumValues
public int[] getNumValues()returns array that stores the number of values for a nominal attribute.- 返回:
- the array that stores the number of values for a nominal attribute
-
generateExample
Generate an example of the dataset.- 指定者:
generateExample
在类中DataGenerator
- 返回:
- the instance generated
- 抛出:
Exception
- if format not defined or generating
examples one by one is not possible, because voting is chosen
-
generateExamples
Generate all examples of the dataset.- 指定者:
generateExamples
在类中DataGenerator
- 返回:
- the instance generated
- 抛出:
Exception
- if format not defined
-
generateFinished
Compiles documentation about the data generation after the generation process- 指定者:
generateFinished
在类中DataGenerator
- 返回:
- string with additional information about generated dataset
- 抛出:
Exception
- no input structure has been defined
-
generateStart
Compiles documentation about the data generation before the generation process- 指定者:
generateStart
在类中DataGenerator
- 返回:
- string with additional information
-
getRevision
Returns the revision string.- 返回:
- the revision
-
main
Main method for testing this class.- 参数:
args
- should contain arguments for the data producer:
-