类 SnowballStemmer

java.lang.Object
weka.core.stemmers.SnowballStemmer
所有已实现的接口:
Serializable, OptionHandler, RevisionHandler, Stemmer

public class SnowballStemmer extends Object implements Stemmer, OptionHandler
A wrapper class for the Snowball stemmers. Only available if the Snowball classes are in the classpath.
If the class discovery is not dynamic, i.e., the property 'UseDynamic' in the props file 'weka/gui/GenericPropertiesCreator.props' is 'false', then the property 'org.tartarus.snowball.SnowballProgram' in the 'weka/gui/GenericObjectEditor.props' file has to be uncommented as well. If necessary you have to discover and fill in the snowball stemmers manually. You can use the 'weka.core.ClassDiscovery' for this:
java weka.core.ClassDiscovery org.tartarus.snowball.SnowballProgram org.tartarus.snowball.ext

For more information visit these web sites:
http://weka.wikispaces.com/Stemmers
http://snowball.tartarus.org/

Valid options are:

 -S <name>
  The name of the snowball stemmer (default 'porter').
  available stemmers:
     danish, dutch, english, finnish, french, german, italian, 
     norwegian, porter, portuguese, russian, spanish, swedish
 
版本:
$Revision: 5836 $
作者:
FracPete (fracpete at waikato dot ac dot nz)
另请参阅:
  • 字段详细资料

    • PACKAGE

      public static final String PACKAGE
      the package name for snowball.
      另请参阅:
    • PACKAGE_EXT

      public static final String PACKAGE_EXT
      the package name where the stemmers are located.
      另请参阅:
  • 构造器详细资料

    • SnowballStemmer

      public SnowballStemmer()
      initializes the stemmer ("porter").
    • SnowballStemmer

      public SnowballStemmer(String name)
      initializes the stemmer with the given stemmer.
      参数:
      name - the name of the stemmer
  • 方法详细资料

    • globalInfo

      public String globalInfo()
      Returns a string describing the stemmer.
      返回:
      a description suitable for displaying in the explorer/experimenter gui
    • listOptions

      public Enumeration listOptions()
      Returns an enumeration describing the available options.
      指定者:
      listOptions 在接口中 OptionHandler
      返回:
      an enumeration of all the available options.
    • setOptions

      public void setOptions(String[] options) throws Exception
      Parses the options.

      Valid options are:

       -S <name>
        The name of the snowball stemmer (default 'porter').
        available stemmers:
           danish, dutch, english, finnish, french, german, italian, 
           norwegian, porter, portuguese, russian, spanish, swedish
       
      指定者:
      setOptions 在接口中 OptionHandler
      参数:
      options - the options to parse
      抛出:
      Exception - if parsing fails
    • getOptions

      public String[] getOptions()
      Gets the current settings of the classifier.
      指定者:
      getOptions 在接口中 OptionHandler
      返回:
      an array of strings suitable for passing to setOptions
    • isPresent

      public static boolean isPresent()
      returns whether Snowball is present or not, i.e. whether the classes are in the classpath or not
      返回:
      whether Snowball is available
    • listStemmers

      public static Enumeration listStemmers()
      returns an enumeration over all currently stored stemmer names.
      返回:
      all available stemmers
    • getStemmer

      public String getStemmer()
      returns the name of the current stemmer, null if none is set.
      返回:
      the name of the stemmer
    • setStemmer

      public void setStemmer(String name)
      sets the stemmer with the given name, e.g., "porter".
      参数:
      name - the name of the stemmer, e.g., "porter"
    • stemmerTipText

      public String stemmerTipText()
      Returns the tip text for this property.
      返回:
      tip text for this property suitable for displaying in the explorer/experimenter gui
    • stem

      public String stem(String word)
      Returns the word in its stemmed form.
      指定者:
      stem 在接口中 Stemmer
      参数:
      word - the unstemmed word
      返回:
      the stemmed word
    • toString

      public String toString()
      returns a string representation of the stemmer.
      覆盖:
      toString 在类中 Object
      返回:
      a string representation of the stemmer
    • getRevision

      public String getRevision()
      Returns the revision string.
      指定者:
      getRevision 在接口中 RevisionHandler
      返回:
      the revision
    • main

      public static void main(String[] args)
      Runs the stemmer with the given options.
      参数:
      args - the options