类 SMOTE

java.lang.Object
weka.filters.Filter
weka.filters.supervised.instance.SMOTE
所有已实现的接口:
Serializable, CapabilitiesHandler, OptionHandler, RevisionHandler, TechnicalInformationHandler, SupervisedFilter

public class SMOTE extends Filter implements SupervisedFilter, OptionHandler, TechnicalInformationHandler
Resamples a dataset by applying the Synthetic Minority Oversampling TEchnique (SMOTE). The original dataset must fit entirely in memory. The amount of SMOTE and number of nearest neighbors may be specified. For more information, see

Nitesh V. Chawla et. al. (2002). Synthetic Minority Over-sampling Technique. Journal of Artificial Intelligence Research. 16:321-357.

BibTeX:

 @article{al.2002,
    author = {Nitesh V. Chawla et. al.},
    journal = {Journal of Artificial Intelligence Research},
    pages = {321-357},
    title = {Synthetic Minority Over-sampling Technique},
    volume = {16},
    year = {2002}
 }
 

Valid options are:

 -S <num>
  Specifies the random number seed
  (default 1)
 
 -P <percentage>
  Specifies percentage of SMOTE instances to create.
  (default 100.0)
 
 -K <nearest-neighbors>
  Specifies the number of nearest neighbors to use.
  (default 5)
 
 -C <value-index>
  Specifies the index of the nominal class value to SMOTE
  (default 0: auto-detect non-empty minority class))
 
版本:
$Revision: 9657 $
作者:
Ryan Lichtenwalter (rlichtenwalter@gmail.com)
另请参阅:
  • 构造器详细资料

    • SMOTE

      public SMOTE()
  • 方法详细资料

    • globalInfo

      public String globalInfo()
      Returns a string describing this classifier.
      返回:
      a description of the classifier suitable for displaying in the explorer/experimenter gui
    • getTechnicalInformation

      public TechnicalInformation getTechnicalInformation()
      Returns an instance of a TechnicalInformation object, containing detailed information about the technical background of this class, e.g., paper reference or book this class is based on.
      指定者:
      getTechnicalInformation 在接口中 TechnicalInformationHandler
      返回:
      the technical information about this class
    • getRevision

      public String getRevision()
      Returns the revision string.
      指定者:
      getRevision 在接口中 RevisionHandler
      覆盖:
      getRevision 在类中 Filter
      返回:
      the revision
    • getCapabilities

      public Capabilities getCapabilities()
      Returns the Capabilities of this filter.
      指定者:
      getCapabilities 在接口中 CapabilitiesHandler
      覆盖:
      getCapabilities 在类中 Filter
      返回:
      the capabilities of this object
      另请参阅:
    • listOptions

      public Enumeration listOptions()
      Returns an enumeration describing the available options.
      指定者:
      listOptions 在接口中 OptionHandler
      返回:
      an enumeration of all the available options.
    • setOptions

      public void setOptions(String[] options) throws Exception
      Parses a given list of options. Valid options are:

       -S <num>
        Specifies the random number seed
        (default 1)
       
       -P <percentage>
        Specifies percentage of SMOTE instances to create.
        (default 100.0)
       
       -K <nearest-neighbors>
        Specifies the number of nearest neighbors to use.
        (default 5)
       
       -C <value-index>
        Specifies the index of the nominal class value to SMOTE
        (default 0: auto-detect non-empty minority class))
       
      指定者:
      setOptions 在接口中 OptionHandler
      参数:
      options - the list of options as an array of strings
      抛出:
      Exception - if an option is not supported
    • getOptions

      public String[] getOptions()
      Gets the current settings of the filter.
      指定者:
      getOptions 在接口中 OptionHandler
      返回:
      an array of strings suitable for passing to setOptions
    • randomSeedTipText

      public String randomSeedTipText()
      Returns the tip text for this property.
      返回:
      tip text for this property suitable for displaying in the explorer/experimenter gui
    • getRandomSeed

      public int getRandomSeed()
      Gets the random number seed.
      返回:
      the random number seed.
    • setRandomSeed

      public void setRandomSeed(int value)
      Sets the random number seed.
      参数:
      value - the new random number seed.
    • percentageTipText

      public String percentageTipText()
      Returns the tip text for this property.
      返回:
      tip text for this property suitable for displaying in the explorer/experimenter gui
    • setPercentage

      public void setPercentage(double value)
      Sets the percentage of SMOTE instances to create.
      参数:
      value - the percentage to use
    • getPercentage

      public double getPercentage()
      Gets the percentage of SMOTE instances to create.
      返回:
      the percentage of SMOTE instances to create
    • nearestNeighborsTipText

      public String nearestNeighborsTipText()
      Returns the tip text for this property.
      返回:
      tip text for this property suitable for displaying in the explorer/experimenter gui
    • setNearestNeighbors

      public void setNearestNeighbors(int value)
      Sets the number of nearest neighbors to use.
      参数:
      value - the number of nearest neighbors to use
    • getNearestNeighbors

      public int getNearestNeighbors()
      Gets the number of nearest neighbors to use.
      返回:
      the number of nearest neighbors to use
    • classValueTipText

      public String classValueTipText()
      Returns the tip text for this property.
      返回:
      tip text for this property suitable for displaying in the explorer/experimenter gui
    • setClassValue

      public void setClassValue(String value)
      Sets the index of the class value to which SMOTE should be applied.
      参数:
      value - the class value index
    • getClassValue

      public String getClassValue()
      Gets the index of the class value to which SMOTE should be applied.
      返回:
      the index of the clas value to which SMOTE should be applied
    • setInputFormat

      public boolean setInputFormat(Instances instanceInfo) throws Exception
      Sets the format of the input instances.
      覆盖:
      setInputFormat 在类中 Filter
      参数:
      instanceInfo - an Instances object containing the input instance structure (any instances contained in the object are ignored - only the structure is required).
      返回:
      true if the outputFormat may be collected immediately
      抛出:
      Exception - if the input format can't be set successfully
    • input

      public boolean input(Instance instance)
      Input an instance for filtering. Filter requires all training instances be read before producing output.
      覆盖:
      input 在类中 Filter
      参数:
      instance - the input instance
      返回:
      true if the filtered instance may now be collected with output().
      抛出:
      IllegalStateException - if no input structure has been defined
    • batchFinished

      public boolean batchFinished() throws Exception
      Signify that this batch of input to the filter is finished. If the filter requires all instances prior to filtering, output() may now be called to retrieve the filtered instances.
      覆盖:
      batchFinished 在类中 Filter
      返回:
      true if there are instances pending output
      抛出:
      IllegalStateException - if no input structure has been defined
      Exception - if provided options cannot be executed on input instances
    • main

      public static void main(String[] args)
      Main method for running this filter.
      参数:
      args - should contain arguments to the filter: use -h for help