类 RemoveFrequentValues

java.lang.Object
weka.filters.Filter
weka.filters.unsupervised.instance.RemoveFrequentValues
所有已实现的接口:
Serializable, CapabilitiesHandler, OptionHandler, RevisionHandler, UnsupervisedFilter

public class RemoveFrequentValues extends Filter implements OptionHandler, UnsupervisedFilter
Determines which values (frequent or infrequent ones) of an (nominal) attribute are retained and filters the instances accordingly. In case of values with the same frequency, they are kept in the way they appear in the original instances object. E.g. if you have the values "1,2,3,4" with the frequencies "10,5,5,3" and you chose to keep the 2 most common values, the values "1,2" would be returned, since the value "2" comes before "3", even though they have the same frequency.

Valid options are:

 -C <num>
  Choose attribute to be used for selection.
 -N <num>
  Number of values to retain for the sepcified attribute, 
  i.e. the ones with the most instances (default 2).
 -L
  Instead of values with the most instances the ones with the 
  least are retained.
 
 -H
  When selecting on nominal attributes, removes header
  references to excluded values.
 -V
  Invert matching sense.
版本:
$Revision: 8972 $
作者:
FracPete (fracpete at waikato dot ac dot nz)
另请参阅:
  • 构造器详细资料

    • RemoveFrequentValues

      public RemoveFrequentValues()
  • 方法详细资料

    • globalInfo

      public String globalInfo()
      Returns a string describing this filter
      返回:
      a description of the classifier suitable for displaying in the explorer/experimenter gui
    • listOptions

      public Enumeration listOptions()
      Returns an enumeration describing the available options.
      指定者:
      listOptions 在接口中 OptionHandler
      返回:
      an enumeration of all the available options.
    • setOptions

      public void setOptions(String[] options) throws Exception
      Parses a given list of options.

      Valid options are:

       -C <num>
        Choose attribute to be used for selection.
       -N <num>
        Number of values to retain for the sepcified attribute, 
        i.e. the ones with the most instances (default 2).
       -L
        Instead of values with the most instances the ones with the 
        least are retained.
       
       -H
        When selecting on nominal attributes, removes header
        references to excluded values.
       -V
        Invert matching sense.
      指定者:
      setOptions 在接口中 OptionHandler
      参数:
      options - the list of options as an array of strings
      抛出:
      Exception - if an option is not supported
    • getOptions

      public String[] getOptions()
      Gets the current settings of the filter.
      指定者:
      getOptions 在接口中 OptionHandler
      返回:
      an array of strings suitable for passing to setOptions
    • attributeIndexTipText

      public String attributeIndexTipText()
      Returns the tip text for this property
      返回:
      tip text for this property suitable for displaying in the explorer/experimenter gui
    • getAttributeIndex

      public String getAttributeIndex()
      Get the index of the attribute used.
      返回:
      the index of the attribute
    • setAttributeIndex

      public void setAttributeIndex(String attIndex)
      Sets index of the attribute used.
      参数:
      attIndex - the index of the attribute
    • numValuesTipText

      public String numValuesTipText()
      Returns the tip text for this property
      返回:
      tip text for this property suitable for displaying in the explorer/experimenter gui
    • getNumValues

      public int getNumValues()
      Gets how many values are retained
      返回:
      how many values are retained
    • setNumValues

      public void setNumValues(int numValues)
      Sets how many values are retained
      参数:
      numValues - the number of values to retain
    • useLeastValuesTipText

      public String useLeastValuesTipText()
      Returns the tip text for this property
      返回:
      tip text for this property suitable for displaying in the explorer/experimenter gui
    • getUseLeastValues

      public boolean getUseLeastValues()
      Gets whether to use values with least or most instances
      返回:
      true if values with least instances are retained
    • setUseLeastValues

      public void setUseLeastValues(boolean leastValues)
      Sets whether to use values with least or most instances
      参数:
      leastValues - whether values with least or most instances are retained
    • modifyHeaderTipText

      public String modifyHeaderTipText()
      Returns the tip text for this property
      返回:
      tip text for this property suitable for displaying in the explorer/experimenter gui
    • getModifyHeader

      public boolean getModifyHeader()
      Gets whether the header will be modified when selecting on nominal attributes.
      返回:
      true if so.
    • setModifyHeader

      public void setModifyHeader(boolean newModifyHeader)
      Sets whether the header will be modified when selecting on nominal attributes.
      参数:
      newModifyHeader - true if so.
    • invertSelectionTipText

      public String invertSelectionTipText()
      Returns the tip text for this property
      返回:
      tip text for this property suitable for displaying in the explorer/experimenter gui
    • getInvertSelection

      public boolean getInvertSelection()
      Get whether the supplied columns are to be removed or kept
      返回:
      true if the supplied columns will be kept
    • setInvertSelection

      public void setInvertSelection(boolean invert)
      Set whether selected values should be removed or kept. If true the selected values are kept and unselected values are deleted.
      参数:
      invert - the new invert setting
    • isNominal

      public boolean isNominal()
      Returns true if selection attribute is nominal.
      返回:
      true if selection attribute is nominal
    • determineValues

      public void determineValues(Instances inst)
      determines the values to retain, it is always at least 1 and up to the maximum number of distinct values
      参数:
      inst - the Instances to determine the values from which are kept
    • getCapabilities

      public Capabilities getCapabilities()
      Returns the Capabilities of this filter.
      指定者:
      getCapabilities 在接口中 CapabilitiesHandler
      覆盖:
      getCapabilities 在类中 Filter
      返回:
      the capabilities of this object
      另请参阅:
    • setInputFormat

      public boolean setInputFormat(Instances instanceInfo) throws Exception
      Sets the format of the input instances.
      覆盖:
      setInputFormat 在类中 Filter
      参数:
      instanceInfo - an Instances object containing the input instance structure (any instances contained in the object are ignored - only the structure is required).
      返回:
      true if the outputFormat can be collected immediately
      抛出:
      UnsupportedAttributeTypeException - if the specified attribute is not nominal.
      Exception - if the inputFormat can't be set successfully
    • input

      public boolean input(Instance instance)
      Input an instance for filtering. Ordinarily the instance is processed and made available for output immediately. Some filters require all instances be read before producing output.
      覆盖:
      input 在类中 Filter
      参数:
      instance - the input instance
      返回:
      true if the filtered instance may now be collected with output().
      抛出:
      IllegalStateException - if no input format has been set.
    • batchFinished

      public boolean batchFinished()
      Signifies that this batch of input to the filter is finished. If the filter requires all instances prior to filtering, output() may now be called to retrieve the filtered instances.
      覆盖:
      batchFinished 在类中 Filter
      返回:
      true if there are instances pending output
      抛出:
      IllegalStateException - if no input structure has been defined
    • getRevision

      public String getRevision()
      Returns the revision string.
      指定者:
      getRevision 在接口中 RevisionHandler
      覆盖:
      getRevision 在类中 Filter
      返回:
      the revision
    • main

      public static void main(String[] argv)
      Main method for testing this class.
      参数:
      argv - should contain arguments to the filter: use -h for help