|
|||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectnet.java.sen.StringTagger
This class generate morpheme tags from String. Sample code is here:
StringTagger tagger = StringTagger.getInstance("/usr/local/sen/conf/sen.xml"); Token[] token = tagger.analyze(s); for (int i = 0; i < token.length; i++) { Token t = token[i]; String pos = t.getPos(); // part of speech String basic = t.getBasic(); // un-conjugate representation String reading = t.getReading(); // reading }
Field Summary | |
protected java.lang.String |
unknownPos
|
Method Summary | |
void |
addPostProcessor(PostProcessor processor)
Add PostProcessor. |
void |
addPreProcessor(PreProcessor processor)
Add PreProcessor. |
Token[] |
analyze(java.lang.String input)
Analyze string. |
protected Token[] |
doPostProcess(Token[] tokens,
java.util.Map postProcessInfo)
Execute all registered preprocess. |
protected java.lang.String |
doPreProcess(java.lang.String input,
java.util.Map postProcessInfo)
Execute all registered preprocess. |
static StringTagger |
getInstance()
|
static StringTagger |
getInstance(java.util.Locale locale)
Deprecated. use instead of StringTagger#getinstance(String senConfig) |
static StringTagger |
getInstance(java.lang.String senConfig)
Obtain StringTagger instance for with specified configuration. |
boolean |
hasNext()
Check StringTagger have more morphemes or not. |
Token |
next()
Get next morpheme. |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
protected java.lang.String unknownPos
Method Detail |
public static StringTagger getInstance() throws java.lang.IllegalArgumentException, java.io.IOException
java.lang.IllegalArgumentException
java.io.IOException
public static StringTagger getInstance(java.util.Locale locale) throws java.io.IOException, java.lang.IllegalArgumentException
locale
- Locale to generate morphological analyzer.
java.io.IOException
java.lang.IllegalArgumentException
public static StringTagger getInstance(java.lang.String senConfig) throws java.io.IOException, java.lang.IllegalArgumentException
senConfig
- configuration file for sen.(ex. "SEN_HOME/conf/sen.xml").
java.io.IOException
java.lang.IllegalArgumentException
public Token[] analyze(java.lang.String input) throws java.io.IOException
input
- string to analyze.
java.io.IOException
public Token next()
public boolean hasNext()
public void addPostProcessor(PostProcessor processor)
processor
- PostProcessorpublic void addPreProcessor(PreProcessor processor)
processor
- PreProcessorprotected java.lang.String doPreProcess(java.lang.String input, java.util.Map postProcessInfo)
input
- input stringpostProcessInfo
- information passed to postProcess
protected Token[] doPostProcess(Token[] tokens, java.util.Map postProcessInfo)
tokens
- tokenspostProcessInfo
- information passed from preprocess
|
|||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |