Person retrieval on XML documents by coreference analysis utilizing structural features

Yumi Yonei, Mizuho Iwaihara, Masatoshi Yoshikawa

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Keyword retrieval of the present day exploits frequencies and positions of search keywords in target documents. As for retrieval by two or more keywords, semantic relation between keywords is important. For retrieving information about a person, it is common to search by a pair of keywords consisting of person's name and his/her attribute of the interest. By using dependency analysis and coreference analysis, correct occurrences of pairs of person and his/her attributes can be retrieved. However, existing natural language analysis does not consider the factor that logical structures of the documents strongly influence probabilistic patterns of coreference. In this paper, we propose a new way of person retrieval by computing a maximum entropy model from linguistic features and structural features, where structural features are learned from probabilistic distribution of coreference over XML document structures. Our method can utilize strong correlation between XML document structures and coreference, thus having superior accuracy than existing methods.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Pages552-565
Number of pages14
Volume5181 LNCS
DOIs
Publication statusPublished - 2008
Externally publishedYes
Event19th International Conference on Database and Expert Systems Applications, DEXA 2008 - Turin
Duration: 2008 Sep 12008 Sep 5

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume5181 LNCS
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other19th International Conference on Database and Expert Systems Applications, DEXA 2008
CityTurin
Period08/9/108/9/5

Fingerprint

Structural Analysis
Structural analysis
XML
Person
Retrieval
Linguistics
Entropy
Semantics
Attribute
Keyword Search
Maximum Entropy
Natural Language
Target
Computing
Model

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Yonei, Y., Iwaihara, M., & Yoshikawa, M. (2008). Person retrieval on XML documents by coreference analysis utilizing structural features. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5181 LNCS, pp. 552-565). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5181 LNCS). https://doi.org/10.1007/978-3-540-85654-2_47

Person retrieval on XML documents by coreference analysis utilizing structural features. / Yonei, Yumi; Iwaihara, Mizuho; Yoshikawa, Masatoshi.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 5181 LNCS 2008. p. 552-565 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5181 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Yonei, Y, Iwaihara, M & Yoshikawa, M 2008, Person retrieval on XML documents by coreference analysis utilizing structural features. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). vol. 5181 LNCS, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 5181 LNCS, pp. 552-565, 19th International Conference on Database and Expert Systems Applications, DEXA 2008, Turin, 08/9/1. https://doi.org/10.1007/978-3-540-85654-2_47
Yonei Y, Iwaihara M, Yoshikawa M. Person retrieval on XML documents by coreference analysis utilizing structural features. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 5181 LNCS. 2008. p. 552-565. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-540-85654-2_47
Yonei, Yumi ; Iwaihara, Mizuho ; Yoshikawa, Masatoshi. / Person retrieval on XML documents by coreference analysis utilizing structural features. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 5181 LNCS 2008. pp. 552-565 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{06539fa6b98e44aa9f957860b8109c30,
title = "Person retrieval on XML documents by coreference analysis utilizing structural features",
abstract = "Keyword retrieval of the present day exploits frequencies and positions of search keywords in target documents. As for retrieval by two or more keywords, semantic relation between keywords is important. For retrieving information about a person, it is common to search by a pair of keywords consisting of person's name and his/her attribute of the interest. By using dependency analysis and coreference analysis, correct occurrences of pairs of person and his/her attributes can be retrieved. However, existing natural language analysis does not consider the factor that logical structures of the documents strongly influence probabilistic patterns of coreference. In this paper, we propose a new way of person retrieval by computing a maximum entropy model from linguistic features and structural features, where structural features are learned from probabilistic distribution of coreference over XML document structures. Our method can utilize strong correlation between XML document structures and coreference, thus having superior accuracy than existing methods.",
author = "Yumi Yonei and Mizuho Iwaihara and Masatoshi Yoshikawa",
year = "2008",
doi = "10.1007/978-3-540-85654-2_47",
language = "English",
isbn = "3540856536",
volume = "5181 LNCS",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "552--565",
booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

}

TY - GEN

T1 - Person retrieval on XML documents by coreference analysis utilizing structural features

AU - Yonei, Yumi

AU - Iwaihara, Mizuho

AU - Yoshikawa, Masatoshi

PY - 2008

Y1 - 2008

N2 - Keyword retrieval of the present day exploits frequencies and positions of search keywords in target documents. As for retrieval by two or more keywords, semantic relation between keywords is important. For retrieving information about a person, it is common to search by a pair of keywords consisting of person's name and his/her attribute of the interest. By using dependency analysis and coreference analysis, correct occurrences of pairs of person and his/her attributes can be retrieved. However, existing natural language analysis does not consider the factor that logical structures of the documents strongly influence probabilistic patterns of coreference. In this paper, we propose a new way of person retrieval by computing a maximum entropy model from linguistic features and structural features, where structural features are learned from probabilistic distribution of coreference over XML document structures. Our method can utilize strong correlation between XML document structures and coreference, thus having superior accuracy than existing methods.

AB - Keyword retrieval of the present day exploits frequencies and positions of search keywords in target documents. As for retrieval by two or more keywords, semantic relation between keywords is important. For retrieving information about a person, it is common to search by a pair of keywords consisting of person's name and his/her attribute of the interest. By using dependency analysis and coreference analysis, correct occurrences of pairs of person and his/her attributes can be retrieved. However, existing natural language analysis does not consider the factor that logical structures of the documents strongly influence probabilistic patterns of coreference. In this paper, we propose a new way of person retrieval by computing a maximum entropy model from linguistic features and structural features, where structural features are learned from probabilistic distribution of coreference over XML document structures. Our method can utilize strong correlation between XML document structures and coreference, thus having superior accuracy than existing methods.

UR - http://www.scopus.com/inward/record.url?scp=53049089029&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=53049089029&partnerID=8YFLogxK

U2 - 10.1007/978-3-540-85654-2_47

DO - 10.1007/978-3-540-85654-2_47

M3 - Conference contribution

SN - 3540856536

SN - 9783540856535

VL - 5181 LNCS

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 552

EP - 565

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

ER -