フシミ タカヤス
Takayasu Fushimi
伏見 卓恭 所属 コンピュータサイエンス学部 コンピュータサイエンス学科 職種 専任講師 |
|
言語種別 | 日本語 |
発行・発表の年月 | 2018/03 |
形態種別 | 学術論文 |
査読 | 査読あり |
標題 | 時系列文書に対するトピックフォレストの構築と構造解析 |
執筆形態 | 共著 |
掲載誌名 | 日本データベース学会和文論文誌 |
掲載区分 | 国内 |
巻・号・頁 | 16-J(1),1-8頁 |
著者・共著者 | 伏見 卓恭,佐藤 哲司 |
概要 | A large amount of documents are posted on the Web
from moment to moment such as news articles, blog articles, web pages, academic literature. There are strong and weak relationships between related and similar documents. The relationships between strongly relevant documents clearly exhibit like citations of scientific literature, trackbacks of blog posts, hyperlinks of Wikipedia articles and web pages, but in the case of news articles, connections with related documents are often not clearly indicated. As a simplest method, there is a method of calculating similarity between news articles and constructing a similarity network by linking between similar documents, but it is difficult to consider the time axis. Therefore, in this paper, we propose a topic forest construction method consisting of multiple time-evolving tree structures based on semantic cohesiveness and temporal cohesion of documents. By visualizing this topic forest, it is considered that an effective access order to the document can be presented. Experimental evaluations using real data show that the topic forest has semantic and temporal cohesiveness, which helps us to improve accessibility to the documents. |