|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectrita.RiObject
rita.support.remote.RiRemotable
rita.support.me.RiObjectME
rita.support.me.MaxEntChunker
public class MaxEntChunker
Simple chunker that finds non-recursive syntactic 'chunks'
such as noun-phrases, using the Penn conventions (shown below).
For more info see: Berger & Della Pietra's paper: 'A Maximum Entropy Approach to Natural Language Processing', which provides a good introduction to the maxent framework.
| Field Summary | |
|---|---|
static java.lang.String |
ADJ_PHRASE
|
static java.lang.String |
ADV_PHRASE
|
static java.lang.String |
IND_PHRASE
|
static java.lang.String |
NOUN_PHRASE
|
static java.lang.String |
PREP_PHRASE
|
static java.lang.String |
PRT_PHRASE
|
static java.lang.String |
SBAR_PHRASE
|
static java.lang.String |
VERB_PHRASE
|
| Fields inherited from class rita.support.me.RiObjectME |
|---|
ERROR_MSG, LOAD_FROM_MODEL_DIR |
| Fields inherited from interface rita.support.remote.RemoteConstants |
|---|
ARG_DELIM, ARR_DELIM, CHUNKER, DELIM, FS, LB, LP, MARKOV, PARSER, QQ, RB, RP, SPC, TAGGER, TYPE_DELIM |
| Fields inherited from interface processing.core.PConstants |
|---|
A, AB, ADD, AG, ALPHA, ALPHA_MASK, ALT, AMBIENT, AR, ARC, ARGB, ARROW, B, BACKSPACE, BASELINE, BEEN_LIT, BEVEL, BLEND, BLUE_MASK, BLUR, BOTTOM, BOX, BURN, CENTER, CENTER_DIAMETER, CENTER_RADIUS, CHATTER, CLOSE, CMYK, CODED, COMPLAINT, CONTROL, CORNER, CORNERS, CROSS, CUSTOM, DA, DARKEST, DB, DEG_TO_RAD, DELETE, DG, DIAMETER, DIFFERENCE, DILATE, DIRECTIONAL, DISABLE_ACCURATE_TEXTURES, DISABLE_DEPTH_SORT, DISABLE_DEPTH_TEST, DISABLE_OPENGL_2X_SMOOTH, DISABLE_OPENGL_ERROR_REPORT, DODGE, DOWN, DR, DXF, EB, EDGE, EG, ELLIPSE, ENABLE_ACCURATE_TEXTURES, ENABLE_DEPTH_SORT, ENABLE_DEPTH_TEST, ENABLE_NATIVE_FONTS, ENABLE_OPENGL_2X_SMOOTH, ENABLE_OPENGL_4X_SMOOTH, ENABLE_OPENGL_ERROR_REPORT, ENTER, EPSILON, ER, ERODE, ERROR_BACKGROUND_IMAGE_FORMAT, ERROR_BACKGROUND_IMAGE_SIZE, ERROR_PUSHMATRIX_OVERFLOW, ERROR_PUSHMATRIX_UNDERFLOW, ERROR_TEXTFONT_NULL_PFONT, ESC, EXCLUSION, G, GIF, GRAY, GREEN_MASK, HALF_PI, HAND, HARD_LIGHT, HINT_COUNT, HSB, IMAGE, INVERT, JAVA2D, JPEG, LEFT, LIGHTEST, LINE, LINES, LINUX, MACOSX, MAX_FLOAT, MAX_INT, MIN_FLOAT, MIN_INT, MITER, MODEL, MULTIPLY, NORMAL, NORMALIZED, NX, NY, NZ, OPAQUE, OPEN, OPENGL, ORTHOGRAPHIC, OTHER, OVERLAY, P2D, P3D, PATH, PDF, PERSPECTIVE, PI, platformNames, POINT, POINTS, POLYGON, POSTERIZE, PROBLEM, PROJECT, QUAD, QUAD_STRIP, QUADS, QUARTER_PI, R, RAD_TO_DEG, RADIUS, RECT, RED_MASK, REPLACE, RETURN, RGB, RIGHT, ROUND, SA, SB, SCREEN, SG, SHAPE, SHIFT, SHINE, SOFT_LIGHT, SPB, SPG, SPHERE, SPOT, SPR, SQUARE, SR, SUBTRACT, SW, TAB, TARGA, THIRD_PI, THRESHOLD, TIFF, TOP, TRIANGLE, TRIANGLE_FAN, TRIANGLE_STRIP, TRIANGLES, TWO_PI, TX, TY, TZ, U, UP, V, VERTEX_FIELD_COUNT, VW, VX, VY, VZ, WAIT, WHITESPACE, WINDOWS, X, Y, Z |
| Constructor Summary | |
|---|---|
MaxEntChunker()
|
|
MaxEntChunker(processing.core.PApplet p)
|
|
| Method Summary | |
|---|---|
java.lang.String |
chunk(java.util.List words,
java.util.List postags)
Returns a String of chunks inline |
java.lang.String |
chunk(java.lang.String[] words,
java.lang.String[] tags)
Returns a String of chunks inline |
static MaxEntChunker |
createRemote(java.util.Map params)
|
void |
destroy()
|
java.lang.String[] |
getAdjPhrases()
|
java.lang.String[] |
getAdvPhrases()
|
java.lang.String[] |
getChunkData()
|
static MaxEntChunker |
getInstance()
|
static MaxEntChunker |
getInstance(processing.core.PApplet p)
|
java.lang.String[] |
getNounPhrases()
|
java.lang.String[] |
getPrepPhrases()
|
java.lang.String[] |
getVerbPhrases()
|
static void |
main(java.lang.String[] args)
|
java.lang.String |
tagAndChunk(java.lang.String sentence)
Utility method that uses the default word tokenizer & pos-tagger to prepare a sentence for chunking, then returns the sentence String w' chunk-data inline |
| Methods inherited from class rita.support.me.RiObjectME |
|---|
getModelDir, setModelDir |
| Methods inherited from class rita.RiObject |
|---|
dispose, getId, getPApplet, nextId |
| Methods inherited from class java.lang.Object |
|---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
public static final java.lang.String NOUN_PHRASE
public static final java.lang.String VERB_PHRASE
public static final java.lang.String PREP_PHRASE
public static final java.lang.String SBAR_PHRASE
public static final java.lang.String ADJ_PHRASE
public static final java.lang.String ADV_PHRASE
public static final java.lang.String PRT_PHRASE
public static final java.lang.String IND_PHRASE
| Constructor Detail |
|---|
public MaxEntChunker()
public MaxEntChunker(processing.core.PApplet p)
| Method Detail |
|---|
public static MaxEntChunker getInstance()
public static MaxEntChunker getInstance(processing.core.PApplet p)
public static MaxEntChunker createRemote(java.util.Map params)
public java.lang.String chunk(java.util.List words,
java.util.List postags)
RiChunkerIF
chunk in interface RiChunkerIFpublic java.lang.String[] getNounPhrases()
getNounPhrases in interface RiChunkerIFpublic java.lang.String[] getVerbPhrases()
getVerbPhrases in interface RiChunkerIFpublic java.lang.String[] getPrepPhrases()
getPrepPhrases in interface RiChunkerIFpublic java.lang.String[] getAdjPhrases()
getAdjPhrases in interface RiChunkerIFpublic java.lang.String[] getAdvPhrases()
getAdvPhrases in interface RiChunkerIF
public java.lang.String chunk(java.lang.String[] words,
java.lang.String[] tags)
RiChunkerIF
chunk in interface RiChunkerIFpublic java.lang.String tagAndChunk(java.lang.String sentence)
tagAndChunk in interface RiChunkerIFsentence - public java.lang.String[] getChunkData()
getChunkData in interface RiChunkerIFpublic void destroy()
destroy in class RiRemotablepublic static void main(java.lang.String[] args)
|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||