TY - GEN
T1 - Transformation-based Khmer Part-of-Speech tagger
AU - Nou, Chenda
AU - Kameyama, Wataru
PY - 2007
Y1 - 2007
N2 - This paper introduces an initiative research on Khmer Part-of-Speech (POS) tagger based on Transformation based approach. Due to a few researches on natural language processing for Khmer, many pre-processing tasks are needed before the automatic tagging can take place. The first Khmer annotated corpus is tagged with 27 tags based on the traditional and modern grammar theories. The learner, based on learning algorithm introduced by Brill [2], is built with 32 transformation templates. After applying the transformation rules with our sophisticated ranking algorithm, the error rate of tagging on trained and untrained data can be reduced around 41% and 18% accordingly over the baseline. The experiments provide very encouraging results; however, some future works are drawn to improve the accuracy and the performance of the tagger to reach the better level.
AB - This paper introduces an initiative research on Khmer Part-of-Speech (POS) tagger based on Transformation based approach. Due to a few researches on natural language processing for Khmer, many pre-processing tasks are needed before the automatic tagging can take place. The first Khmer annotated corpus is tagged with 27 tags based on the traditional and modern grammar theories. The learner, based on learning algorithm introduced by Brill [2], is built with 32 transformation templates. After applying the transformation rules with our sophisticated ranking algorithm, the error rate of tagging on trained and untrained data can be reduced around 41% and 18% accordingly over the baseline. The experiments provide very encouraging results; however, some future works are drawn to improve the accuracy and the performance of the tagger to reach the better level.
KW - Automatic learning
KW - Corpus-based
KW - Khmer Part-of-Speech tagging
KW - Natural language processing
KW - Transformation-based tagger
UR - http://www.scopus.com/inward/record.url?scp=84866524184&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84866524184&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84866524184
SN - 9781601320254
T3 - Proceedings of the 2007 International Conference on Artificial Intelligence, ICAI 2007
SP - 581
EP - 587
BT - Proceedings of the 2007 International Conference on Artificial Intelligence, ICAI 2007
T2 - 2007 International Conference on Artificial Intelligence, ICAI 2007
Y2 - 25 June 2007 through 28 June 2007
ER -