TY - JOUR
T1 - Research and Implementation of Chinese Couplet Generation System With Attention Based Transformer Mechanism
AU - Wang, Yufeng
AU - Zhang, Jiang
AU - Zhang, Bo
AU - Jin, Qun
N1 - Funding Information:
This work was supported by the QingLan Project of Jiangsu Province.
Publisher Copyright:
IEEE
PY - 2021
Y1 - 2021
N2 - Couplet is a unique art form in Chinese traditional culture. The development of deep neural network (DNN) technology makes it possible for computers to automatically generate couplets. Especially, Transformer is a DNN-based ``Encoder-Decoder'' framework, and widely used in natural language processing (NLP). However, the existed Transformer mechanism cannot fully exploit the essential linguistic knowledge in Chinese, including the special format and requirements of Chinese couplets. Therefore, this article adapts the Transformer mechanism to generate meaningful Chinese couplets. Specifically, the contributions of our work are threefold. First, considering the fact that the words in the corresponding positions of the antecedent clause and the subsequent clause in a Chinese couplet always have same part-of-speech (pos, i.e., word class), pos information is intentionally added into the Transformer to improve the accuracy of the conceived couplet. Second, to deal with the large number of unregistered and low-frequency words in Chinese couplet, a specific unregistered/low-frequency word processing mechanism (UWP) is designed and combined with the Transformer model. Third, to further improve the coherence of couplets, we incorporate the polish mechanisms (PMs) into Transformer model. In terms of three evaluation criteria including bilingual evaluation understudy (BLEU), perplexity, and human evaluation, the experimental results demonstrate the effectiveness of our designed Chinese couplet generation system.
AB - Couplet is a unique art form in Chinese traditional culture. The development of deep neural network (DNN) technology makes it possible for computers to automatically generate couplets. Especially, Transformer is a DNN-based ``Encoder-Decoder'' framework, and widely used in natural language processing (NLP). However, the existed Transformer mechanism cannot fully exploit the essential linguistic knowledge in Chinese, including the special format and requirements of Chinese couplets. Therefore, this article adapts the Transformer mechanism to generate meaningful Chinese couplets. Specifically, the contributions of our work are threefold. First, considering the fact that the words in the corresponding positions of the antecedent clause and the subsequent clause in a Chinese couplet always have same part-of-speech (pos, i.e., word class), pos information is intentionally added into the Transformer to improve the accuracy of the conceived couplet. Second, to deal with the large number of unregistered and low-frequency words in Chinese couplet, a specific unregistered/low-frequency word processing mechanism (UWP) is designed and combined with the Transformer model. Third, to further improve the coherence of couplets, we incorporate the polish mechanisms (PMs) into Transformer model. In terms of three evaluation criteria including bilingual evaluation understudy (BLEU), perplexity, and human evaluation, the experimental results demonstrate the effectiveness of our designed Chinese couplet generation system.
KW - Computational modeling
KW - Computers
KW - Decoding
KW - Deep neural network (DNN) based Transformer mechanism
KW - Dictionaries
KW - Recurrent neural networks
KW - Semantics
KW - Telecommunications
KW - part-of-speech features
KW - polish-up mechanism
KW - unregistered and low-frequency words.
UR - http://www.scopus.com/inward/record.url?scp=85104573648&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85104573648&partnerID=8YFLogxK
U2 - 10.1109/TCSS.2021.3072153
DO - 10.1109/TCSS.2021.3072153
M3 - Article
AN - SCOPUS:85104573648
SN - 2329-924X
JO - IEEE Transactions on Computational Social Systems
JF - IEEE Transactions on Computational Social Systems
ER -