Representation method for a set of documents from the viewpoint of Bayesian statistics

Masayuki Goto, Takashi Ishida, Shigeichi Hirasawa

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

In this paper, we consider the Bayesian approach for representation of a set of documents. In the field of representation of a set of documents, many previous models, such as the latent semantic analysis (LSA), the probabilistic latent semantic analysis (PLSA), the Semantic Aggregate Model (SAM), the Bayesian Latent Semantic Analysis (BLSA), and so on, were proposed. In this paper, we formulate the Bayes optimal solutions for estimation of parameters and selection of the dimension of the hidden latent class in these models and analyze it's asymptotic properties.

Original languageEnglish
Title of host publicationProceedings of the IEEE International Conference on Systems, Man and Cybernetics
Pages4637-4642
Number of pages6
Volume5
Publication statusPublished - 2003
Externally publishedYes
EventSystem Security and Assurance - Washington, DC, United States
Duration: 2003 Oct 52003 Oct 8

Other

OtherSystem Security and Assurance
CountryUnited States
CityWashington, DC
Period03/10/503/10/8

Fingerprint

Semantics
Statistics

Keywords

  • Automated document indexing
  • Bayesian statistics
  • Information retrieval
  • Probabilistic latent semantic indexing

ASJC Scopus subject areas

  • Hardware and Architecture
  • Control and Systems Engineering

Cite this

Goto, M., Ishida, T., & Hirasawa, S. (2003). Representation method for a set of documents from the viewpoint of Bayesian statistics. In Proceedings of the IEEE International Conference on Systems, Man and Cybernetics (Vol. 5, pp. 4637-4642)

Representation method for a set of documents from the viewpoint of Bayesian statistics. / Goto, Masayuki; Ishida, Takashi; Hirasawa, Shigeichi.

Proceedings of the IEEE International Conference on Systems, Man and Cybernetics. Vol. 5 2003. p. 4637-4642.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Goto, M, Ishida, T & Hirasawa, S 2003, Representation method for a set of documents from the viewpoint of Bayesian statistics. in Proceedings of the IEEE International Conference on Systems, Man and Cybernetics. vol. 5, pp. 4637-4642, System Security and Assurance, Washington, DC, United States, 03/10/5.
Goto M, Ishida T, Hirasawa S. Representation method for a set of documents from the viewpoint of Bayesian statistics. In Proceedings of the IEEE International Conference on Systems, Man and Cybernetics. Vol. 5. 2003. p. 4637-4642
Goto, Masayuki ; Ishida, Takashi ; Hirasawa, Shigeichi. / Representation method for a set of documents from the viewpoint of Bayesian statistics. Proceedings of the IEEE International Conference on Systems, Man and Cybernetics. Vol. 5 2003. pp. 4637-4642
@inproceedings{aa57f828101e46358f8163069bbb5ee1,
title = "Representation method for a set of documents from the viewpoint of Bayesian statistics",
abstract = "In this paper, we consider the Bayesian approach for representation of a set of documents. In the field of representation of a set of documents, many previous models, such as the latent semantic analysis (LSA), the probabilistic latent semantic analysis (PLSA), the Semantic Aggregate Model (SAM), the Bayesian Latent Semantic Analysis (BLSA), and so on, were proposed. In this paper, we formulate the Bayes optimal solutions for estimation of parameters and selection of the dimension of the hidden latent class in these models and analyze it's asymptotic properties.",
keywords = "Automated document indexing, Bayesian statistics, Information retrieval, Probabilistic latent semantic indexing",
author = "Masayuki Goto and Takashi Ishida and Shigeichi Hirasawa",
year = "2003",
language = "English",
volume = "5",
pages = "4637--4642",
booktitle = "Proceedings of the IEEE International Conference on Systems, Man and Cybernetics",

}

TY - GEN

T1 - Representation method for a set of documents from the viewpoint of Bayesian statistics

AU - Goto, Masayuki

AU - Ishida, Takashi

AU - Hirasawa, Shigeichi

PY - 2003

Y1 - 2003

N2 - In this paper, we consider the Bayesian approach for representation of a set of documents. In the field of representation of a set of documents, many previous models, such as the latent semantic analysis (LSA), the probabilistic latent semantic analysis (PLSA), the Semantic Aggregate Model (SAM), the Bayesian Latent Semantic Analysis (BLSA), and so on, were proposed. In this paper, we formulate the Bayes optimal solutions for estimation of parameters and selection of the dimension of the hidden latent class in these models and analyze it's asymptotic properties.

AB - In this paper, we consider the Bayesian approach for representation of a set of documents. In the field of representation of a set of documents, many previous models, such as the latent semantic analysis (LSA), the probabilistic latent semantic analysis (PLSA), the Semantic Aggregate Model (SAM), the Bayesian Latent Semantic Analysis (BLSA), and so on, were proposed. In this paper, we formulate the Bayes optimal solutions for estimation of parameters and selection of the dimension of the hidden latent class in these models and analyze it's asymptotic properties.

KW - Automated document indexing

KW - Bayesian statistics

KW - Information retrieval

KW - Probabilistic latent semantic indexing

UR - http://www.scopus.com/inward/record.url?scp=0242552144&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0242552144&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:0242552144

VL - 5

SP - 4637

EP - 4642

BT - Proceedings of the IEEE International Conference on Systems, Man and Cybernetics

ER -