A NOVEL APPROACH FOR TEXT SIMILARITY MEASURE AND CLASSIFICATION

Abstract View PDF Download PDF

Published Jan 20, 2018

Download

PDF

Statistic

Downloads

Download data is not yet available.

Vol. 3 No. 2 (2018): IEJRD

Ms. Bhawna Gayakwad

MTech. Computer Science & Engineering

Dr. S. D. Choudhari

SBITM College of Engineering SBITM College of Engineering, Betul, India Betul, India

Abstract

In the text processing field finding the similarity between multiple documents is an important operation. In
this paper, we proposed a new similarity measure for document clustering. To figure out the similarity
between multiple documents with respect to a feature, our proposed similarity finding measure takes the
following cases into account:
1) The selected feature may appear in both documents, 2) the selected feature appears in only one document,
and 3) the selected feature appears in none of the documents. In the first case, the documents similarity
actually increases as the difference between the selected involved features values are less. Moreover, the
involvement of the difference is normally scaled by feature values. However in the second case, a constant
value is involved to find the similarity and in the last case, the selected feature are absent between the
documents and thus has no contribution to the document similarity. Our proposed measure is extended to
estimate the appropriate similarity between two document sets to get effective results with better performance.

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 Unported License.

How to Cite

[1]

Ms. Bhawna Gayakwad and Dr. S. D. Choudhari, “A NOVEL APPROACH FOR TEXT SIMILARITY MEASURE AND CLASSIFICATION”, IEJRD - International Multidisciplinary Journal, vol. 3, no. 2, p. 7, Jan. 2018.

About Journal

A NOVEL APPROACH FOR TEXT SIMILARITY MEASURE AND CLASSIFICATION

Downloads

Abstract

Most read articles by the same author(s)

IEJRD

Quick Links

Policies

Contact Us

About Journal

##plugins.themes.academic_pro.article.sidebar##

Downloads

##plugins.themes.academic_pro.article.main##

Abstract

##plugins.themes.academic_pro.article.details##

Most read articles by the same author(s)