PENS Repository

Text Mining Untuk Pencarian Dokumen Bahasa Inggris Menggunakan Suffix Tree Clustering

Wicaksono, Tatas and K., Martiana Entin and Mubtada'i, Nur Rosyid (2010) Text Mining Untuk Pencarian Dokumen Bahasa Inggris Menggunakan Suffix Tree Clustering. eepis final project.

[img]
Preview
PDF - Published Version
Download (253Kb) | Preview

    Abstract

    A search of the collection of documents generally provide excerpts of the documents are arranged according to rank matches in a long list. Not infrequently a search result in tens and even hundreds of fragments of documents that caused a user to scroll the screen up and down (scrolling) to examine the documents snippet one by one. This situation causes a user is having difficulty in determining which documents relevant to the topic he wants. In this Final Project developed an application web-based document segmentation with suffix tree clustering method. The basic concept of this method is to classify documents in the search results to form groups or clusters based on words or phrases contained in these documents. The application requires the search input and output will result in clusters containing the corresponding documents. This cluster can be stratified depending on the word or phrase that might be distinguished on the same parent cluster. Clusters generated is displayed to the user. Then on the last cluster is selected will display a collection of documents, each consisting of the title and snippet of the document. With this method expected results would be easier to trace. Keywords : text mining, suffix tree, suffix tree clustering, the grouping of documents.

    Item Type: Article
    Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
    Q Science > QA Mathematics > QA76 Computer software
    Divisions: Faculty of Engineering, Science and Mathematics > School of Electronics and Computer Science
    Depositing User: Ms Annisa Nurul Faridha
    Date Deposited: 29 Jul 2011 15:21
    Last Modified: 29 Jul 2011 15:21
    URI: http://repo.pens.ac.id/id/eprint/365

    Actions (login required)

    View Item