程序包 weka.core
类 Stopwords
java.lang.Object
weka.core.Stopwords
- 所有已实现的接口:
RevisionHandler
Class that can test whether a given string is a stop word.
Lowercases all words before the test.
The format for reading and writing is one word per line, lines starting
with '#' are interpreted as comments and therefore skipped.
The default stopwords are based on Rainbow.
Accepts the following parameter:
-i file
loads the stopwords from the given file -o file
saves the stopwords to the given file -p
outputs the current stopwords on stdout Any additional parameters are interpreted as words to test as stopwords.
loads the stopwords from the given file -o file
saves the stopwords to the given file -p
outputs the current stopwords on stdout Any additional parameters are interpreted as words to test as stopwords.
- 版本:
- $Revision: 1.6 $
- 作者:
- Eibe Frank (eibe@cs.waikato.ac.nz), Ashraf M. Kibriya (amk14@cs.waikato.ac.nz), FracPete (fracpete at waikato dot ac dot nz)
-
构造器概要
构造器 -
方法概要
修饰符和类型方法说明void
adds the given word to the stopword list (is automatically converted to lower case and trimmed)void
clear()
removes all stopwordselements()
Returns a sorted enumeration over all stored stopwordsReturns the revision string.boolean
Returns true if the given string is a stop word.static boolean
isStopword
(String str) Returns true if the given string is a stop word.static void
Accepts the following parameter:void
read
(BufferedReader reader) Generates a new Stopwords object from the reader.void
Generates a new Stopwords object from the given filevoid
Generates a new Stopwords object from the given fileboolean
removes the word from the stopword listtoString()
returns the current stopwords in a stringvoid
write
(BufferedWriter writer) Writes the current stopwords to the given writer.void
Writes the current stopwords to the given filevoid
Writes the current stopwords to the given file
-
构造器详细资料
-
Stopwords
public Stopwords()initializes the stopwords (based on Rainbow).
-
-
方法详细资料
-
clear
public void clear()removes all stopwords -
add
adds the given word to the stopword list (is automatically converted to lower case and trimmed)- 参数:
word
- the word to add
-
remove
removes the word from the stopword list- 参数:
word
- the word to remove- 返回:
- true if the word was found in the list and then removed
-
is
Returns true if the given string is a stop word.- 参数:
word
- the word to test- 返回:
- true if the word is a stopword
-
elements
Returns a sorted enumeration over all stored stopwords- 返回:
- the enumeration over all stopwords
-
read
Generates a new Stopwords object from the given file- 参数:
filename
- the file to read the stopwords from- 抛出:
Exception
- if reading fails
-
read
Generates a new Stopwords object from the given file- 参数:
file
- the file to read the stopwords from- 抛出:
Exception
- if reading fails
-
read
Generates a new Stopwords object from the reader. The reader is closed automatically.- 参数:
reader
- the reader to get the stopwords from- 抛出:
Exception
- if reading fails
-
write
Writes the current stopwords to the given file- 参数:
filename
- the file to write the stopwords to- 抛出:
Exception
- if writing fails
-
write
Writes the current stopwords to the given file- 参数:
file
- the file to write the stopwords to- 抛出:
Exception
- if writing fails
-
write
Writes the current stopwords to the given writer. The writer is closed automatically.- 参数:
writer
- the writer to get the stopwords from- 抛出:
Exception
- if writing fails
-
toString
returns the current stopwords in a string -
isStopword
Returns true if the given string is a stop word.- 参数:
str
- the word to test- 返回:
- true if the word is a stopword
-
getRevision
Returns the revision string.- 指定者:
getRevision
在接口中RevisionHandler
- 返回:
- the revision
-
main
Accepts the following parameter: -i file
loads the stopwords from the given file -o file
saves the stopwords to the given file -p
outputs the current stopwords on stdout Any additional parameters are interpreted as words to test as stopwords.- 参数:
args
- commandline parameters- 抛出:
Exception
- if something goes wrong
-