Emerging Topic Tracking System

Khoo Khyou Bun, M. Ishizuka

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

We designed a system that track the changes to a particular area of a user's interests on the World Wide Web and to generate a summary of emerging topics back to the user. This system consists of three main components, which are the Area View System, the Web Spider and the Summary Generator. The Area View System, as a meta-search engine, directs the user's keywords to a commercial search engine, obtains the hits, performs further analysis and derives a number of most relevant domain sites. Then, the Web Spider dispatches and scans all these domains at a certain time interval to collect all the modified and newly added HTML pages. Lastly, the Summary Generator extracts all the newly added sentences or changes from the collected HTML pages and then counts the term weights in the changes by adapting a newly innovated algorithm called TF∗PDF (Term Frequency ∗ Proportional Document Frequency). The terms that deem to explain the emerging topic are heavily weighted. The sentences with the highest average weight are extracted to form a summary of emerging topics. We refer to our system as the Emerging Topic Tracking System (ETTS).

Original languageEnglish
Title of host publicationProceedings - 3rd International Workshop on Advanced Issues of E-Commerce and Web-Based Information Systems, WECWIS 2001
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages2-11
Number of pages10
ISBN (Print)0769512240, 9780769512242
DOIs
Publication statusPublished - 2001
Externally publishedYes
Event3rd International Workshop on Advanced Issues of E-Commerce and Web-Based Information Systems, WECWIS 2001 - San Juan, United States
Duration: 2001 Jun 212001 Jun 22

Other

Other3rd International Workshop on Advanced Issues of E-Commerce and Web-Based Information Systems, WECWIS 2001
CountryUnited States
CitySan Juan
Period01/6/2101/6/22

Fingerprint

HTML
Search engines
World Wide Web

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Bun, K. K., & Ishizuka, M. (2001). Emerging Topic Tracking System. In Proceedings - 3rd International Workshop on Advanced Issues of E-Commerce and Web-Based Information Systems, WECWIS 2001 (pp. 2-11). [933900] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/WECWIS.2001.933900

Emerging Topic Tracking System. / Bun, Khoo Khyou; Ishizuka, M.

Proceedings - 3rd International Workshop on Advanced Issues of E-Commerce and Web-Based Information Systems, WECWIS 2001. Institute of Electrical and Electronics Engineers Inc., 2001. p. 2-11 933900.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Bun, KK & Ishizuka, M 2001, Emerging Topic Tracking System. in Proceedings - 3rd International Workshop on Advanced Issues of E-Commerce and Web-Based Information Systems, WECWIS 2001., 933900, Institute of Electrical and Electronics Engineers Inc., pp. 2-11, 3rd International Workshop on Advanced Issues of E-Commerce and Web-Based Information Systems, WECWIS 2001, San Juan, United States, 01/6/21. https://doi.org/10.1109/WECWIS.2001.933900
Bun KK, Ishizuka M. Emerging Topic Tracking System. In Proceedings - 3rd International Workshop on Advanced Issues of E-Commerce and Web-Based Information Systems, WECWIS 2001. Institute of Electrical and Electronics Engineers Inc. 2001. p. 2-11. 933900 https://doi.org/10.1109/WECWIS.2001.933900
Bun, Khoo Khyou ; Ishizuka, M. / Emerging Topic Tracking System. Proceedings - 3rd International Workshop on Advanced Issues of E-Commerce and Web-Based Information Systems, WECWIS 2001. Institute of Electrical and Electronics Engineers Inc., 2001. pp. 2-11
@inproceedings{4adedf9112304cd79ec485c3ea4a6633,
title = "Emerging Topic Tracking System",
abstract = "We designed a system that track the changes to a particular area of a user's interests on the World Wide Web and to generate a summary of emerging topics back to the user. This system consists of three main components, which are the Area View System, the Web Spider and the Summary Generator. The Area View System, as a meta-search engine, directs the user's keywords to a commercial search engine, obtains the hits, performs further analysis and derives a number of most relevant domain sites. Then, the Web Spider dispatches and scans all these domains at a certain time interval to collect all the modified and newly added HTML pages. Lastly, the Summary Generator extracts all the newly added sentences or changes from the collected HTML pages and then counts the term weights in the changes by adapting a newly innovated algorithm called TF∗PDF (Term Frequency ∗ Proportional Document Frequency). The terms that deem to explain the emerging topic are heavily weighted. The sentences with the highest average weight are extracted to form a summary of emerging topics. We refer to our system as the Emerging Topic Tracking System (ETTS).",
author = "Bun, {Khoo Khyou} and M. Ishizuka",
year = "2001",
doi = "10.1109/WECWIS.2001.933900",
language = "English",
isbn = "0769512240",
pages = "2--11",
booktitle = "Proceedings - 3rd International Workshop on Advanced Issues of E-Commerce and Web-Based Information Systems, WECWIS 2001",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - GEN

T1 - Emerging Topic Tracking System

AU - Bun, Khoo Khyou

AU - Ishizuka, M.

PY - 2001

Y1 - 2001

N2 - We designed a system that track the changes to a particular area of a user's interests on the World Wide Web and to generate a summary of emerging topics back to the user. This system consists of three main components, which are the Area View System, the Web Spider and the Summary Generator. The Area View System, as a meta-search engine, directs the user's keywords to a commercial search engine, obtains the hits, performs further analysis and derives a number of most relevant domain sites. Then, the Web Spider dispatches and scans all these domains at a certain time interval to collect all the modified and newly added HTML pages. Lastly, the Summary Generator extracts all the newly added sentences or changes from the collected HTML pages and then counts the term weights in the changes by adapting a newly innovated algorithm called TF∗PDF (Term Frequency ∗ Proportional Document Frequency). The terms that deem to explain the emerging topic are heavily weighted. The sentences with the highest average weight are extracted to form a summary of emerging topics. We refer to our system as the Emerging Topic Tracking System (ETTS).

AB - We designed a system that track the changes to a particular area of a user's interests on the World Wide Web and to generate a summary of emerging topics back to the user. This system consists of three main components, which are the Area View System, the Web Spider and the Summary Generator. The Area View System, as a meta-search engine, directs the user's keywords to a commercial search engine, obtains the hits, performs further analysis and derives a number of most relevant domain sites. Then, the Web Spider dispatches and scans all these domains at a certain time interval to collect all the modified and newly added HTML pages. Lastly, the Summary Generator extracts all the newly added sentences or changes from the collected HTML pages and then counts the term weights in the changes by adapting a newly innovated algorithm called TF∗PDF (Term Frequency ∗ Proportional Document Frequency). The terms that deem to explain the emerging topic are heavily weighted. The sentences with the highest average weight are extracted to form a summary of emerging topics. We refer to our system as the Emerging Topic Tracking System (ETTS).

UR - http://www.scopus.com/inward/record.url?scp=24644440798&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=24644440798&partnerID=8YFLogxK

U2 - 10.1109/WECWIS.2001.933900

DO - 10.1109/WECWIS.2001.933900

M3 - Conference contribution

SN - 0769512240

SN - 9780769512242

SP - 2

EP - 11

BT - Proceedings - 3rd International Workshop on Advanced Issues of E-Commerce and Web-Based Information Systems, WECWIS 2001

PB - Institute of Electrical and Electronics Engineers Inc.

ER -