TY - GEN
T1 - A Simple and Effective Usage of Self-supervised Contrastive Learning for Text Clustering
AU - Shi, Haoxiang
AU - Wang, Cen
AU - Sakai, Tetsuya
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - Contrastive learning is a promising approach to unsupervised learning, as it inherits the advantages of well-studied deep models without a dedicated and complex model design. In this paper, based on bidirectional encoder representations from transformers, we propose self-supervised contrastive learning (SCL) as well as few-shot contrastive learning (FCL) with unsupervised data augmentation (UDA) for text clustering. SCL outperforms state-of-the-art unsupervised clustering approaches for short texts and those for long texts in terms of several clustering evaluation measures. FCL achieves performance close to supervised learning, and FCL with UDA further improves the performance for short texts.
AB - Contrastive learning is a promising approach to unsupervised learning, as it inherits the advantages of well-studied deep models without a dedicated and complex model design. In this paper, based on bidirectional encoder representations from transformers, we propose self-supervised contrastive learning (SCL) as well as few-shot contrastive learning (FCL) with unsupervised data augmentation (UDA) for text clustering. SCL outperforms state-of-the-art unsupervised clustering approaches for short texts and those for long texts in terms of several clustering evaluation measures. FCL achieves performance close to supervised learning, and FCL with UDA further improves the performance for short texts.
UR - http://www.scopus.com/inward/record.url?scp=85124320150&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85124320150&partnerID=8YFLogxK
U2 - 10.1109/SMC52423.2021.9659143
DO - 10.1109/SMC52423.2021.9659143
M3 - Conference contribution
AN - SCOPUS:85124320150
T3 - Conference Proceedings - IEEE International Conference on Systems, Man and Cybernetics
SP - 315
EP - 320
BT - 2021 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2021
Y2 - 17 October 2021 through 20 October 2021
ER -