PENS Repository

Automatic Representative News Generation Using On-Line Clustering

Sigita, Marlisa and Barakbah, Ali Ridho and Martiana, Entin and Winarno, Idris (2013) Automatic Representative News Generation Using On-Line Clustering. EMITTER International Journal of Engineering Technology, 01 (01). pp. 107-114. ISSN 2355-391X

[img] PDF (EMITTER 2013) - Published Version
Restricted to Registered users only
Available under License Creative Commons Attribution No Derivatives.

Download (603Kb)

    Abstract

    The increasing number of online news provider has produced large volume of news every day. The large volume can bring drawback in consuming information efficiently because some news contain similar contents but they have different titles that may appear. This paper presents a new system for automatically generating representative news using on-line clustering. The system allows the clustering to be dynamic with the features of centroid update and new cluster creation. Text mining is implemented to extract the news contents. The representative news is obtained from the closest distance to each centroid that calculated using Euclidean distance. For experimental study, we implement our system to 460 news in Bahasa Indonesia. The experiment performed 70.9% of precision ratio. The error is mainly caused by imprecise results from keyword extraction that generates only one or two keywords for an article. The distribution of centroid’s keywords also affects the clustering results.

    Item Type: Article
    Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
    Divisions: Faculty of Engineering, Science and Mathematics > School of Electronics and Computer Science
    Depositing User: Dr. Ali Ridho Barakbah
    Date Deposited: 22 Mar 2015 12:16
    Last Modified: 22 Mar 2015 12:16
    URI: http://repo.pens.ac.id/id/eprint/2743

    Actions (login required)

    View Item