TY - GEN
T1 - Building a diverse document leads corpus annotated with semantic relations
AU - Hangyo, Masatsugu
AU - Kawahara, Daisuke
AU - Kurohashi, Sadao
PY - 2012
Y1 - 2012
N2 - In these days, semantic analysis has been actively studied in natural language processing. For the study of semantic analysis, corpora with semantic annotations are essential. Although there are such corpora annotated on newspaper articles, there are various genres and styles, including linguistic expressions that are not found in newspaper articles. In this paper, we build a diverse document leads corpus annotated with semantic relations. To reduce the workload of annotators and annotate as many various documents as possible, we restrict the annotation target of each document to only the first three sentences. We have completed building a corpus of 1,000 documents and report the statistics of this corpus.
AB - In these days, semantic analysis has been actively studied in natural language processing. For the study of semantic analysis, corpora with semantic annotations are essential. Although there are such corpora annotated on newspaper articles, there are various genres and styles, including linguistic expressions that are not found in newspaper articles. In this paper, we build a diverse document leads corpus annotated with semantic relations. To reduce the workload of annotators and annotate as many various documents as possible, we restrict the annotation target of each document to only the first three sentences. We have completed building a corpus of 1,000 documents and report the statistics of this corpus.
UR - http://www.scopus.com/inward/record.url?scp=84883341328&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84883341328&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84883341328
SN - 9789791421171
T3 - Proceedings of the 26th Pacific Asia Conference on Language, Information and Computation, PACLIC 2012
SP - 535
EP - 544
BT - Proceedings of the 26th Pacific Asia Conference on Language, Information and Computation, PACLIC 2012
T2 - 26th Pacific Asia Conference on Language, Information and Computation, PACLIC 2012
Y2 - 7 November 2012 through 7 November 2012
ER -