RiTa
index
Name RiConcorder
Description Maintains a simple word frequency table for a set of input data
    RiConcorder ric = new RiConcorder(this);
    ric.setIgnoreCase(false);
    ric.setIgnoreStopWords(false);
    ric.setIgnorePunctuation(false);
    ric.loadFile("myTestFile.txt");
    ric.dump();
    String[] mostCommon = ric.getMostCommonTokens(5);
    print(mostCommon);
    
Constructors
RiConcorder(pApplet, fileNames, tokenizer);
RiConcorder(pApplet, fileName);
RiConcorder(pApplet, fileNames);
RiConcorder(pApplet, tokenizer);
RiConcorder(pApplet);
Methods
addLine()   Add the data from a single line into the frequency table

addWord()   Adds a single word to the model with a count of 1 if it does not yet exist, else increments its count by 1.

addWords()   Adds the wordsto the model, incrementing their counts (and the total-count) for each.

clear()   Clears the model, resets variables, and prepares it for reloading with new data

contains()   True if the concordance contains word, else false

getCount()   Returns the # of occurences of word or 0 if the word does not exist in the table.

getLeastCommonTokens()   Returns the numberToReturn words with the highest frequency. If there are less than numberToReturn words then all items are returned.

getMostCommonTokens()   Returns the numberToReturn words with the highest frequency. If there are less than numberToReturn words then all items are returned.

getProbability()   Returns the normalized frequency (probability) of word, 1 if it is the only word in the model, 0 if it does not exist.

isIgnoringCase()   Returns whether the model is ignoring case by considering all words as lowerCase (default=false)

isIgnoringPunctuation()   Returns whether the model is ignoring punctuation (default = true)

isIgnoringStopWords()   Returns whether the model is ignoring stopWords (default = false)

loadFile()   Loads the data from the file into a frequency table

loadFiles()   Loads the data from the files into a single frequency table

setIgnoreCase()   Sets whether the model should ignore case (default=false), treating all tokens as lower-case

setIgnorePunctuation()   Sets whether the model should ignore punctuation (default = true)

setIgnoreStopWords()   Sets whether the model should ignore stopWords (default = false)

setWordsToIgnore()   Tells the model to ignore this set of words

totalCount()   Returns the total # of entries in the model.

uniqueCount()   Returns the # of unique words in the model.

Usage Web & Application