| addLine() |
|
Add the data from a single line into the frequency table
|
| addWord() |
|
Adds a single word to the model with a count of 1 if
it does not yet exist, else increments its count by 1.
|
| addWords() |
|
Adds the wordsto the model, incrementing
their counts (and the total-count) for each.
|
| clear() |
|
Clears the model, resets variables, and prepares it for reloading
with new data
|
| contains() |
|
True if the concordance contains word, else false
|
| getCount() |
|
Returns the # of occurences of word
or 0 if the word does not exist in the table.
|
| getLeastCommonTokens() |
|
Returns the numberToReturn words with the highest frequency.
If there are less than numberToReturn words then all items
are returned.
|
| getMostCommonTokens() |
|
Returns the numberToReturn words with the highest frequency.
If there are less than numberToReturn words then all items
are returned.
|
| getProbability() |
|
Returns the normalized frequency (probability) of word,
1 if it is the only word in the model, 0 if it does not exist.
|
| isIgnoringCase() |
|
Returns whether the model is ignoring case by considering
all words as lowerCase (default=false)
|
| isIgnoringPunctuation() |
|
Returns whether the model is ignoring punctuation (default = true)
|
| isIgnoringStopWords() |
|
Returns whether the model is ignoring stopWords (default = false)
|
| loadFile() |
|
Loads the data from the file into a frequency table
|
| loadFiles() |
|
Loads the data from the files into a single frequency table
|
| setIgnoreCase() |
|
Sets whether the model should ignore case (default=false),
treating all tokens as lower-case
|
| setIgnorePunctuation() |
|
Sets whether the model should ignore punctuation (default = true)
|
| setIgnoreStopWords() |
|
Sets whether the model should ignore stopWords (default = false)
|
| setWordsToIgnore() |
|
Tells the model to ignore this set of words
|
| totalCount() |
|
Returns the total # of entries in the model.
|
| uniqueCount() |
|
Returns the # of unique words in the model.
|