A Strategy for Near-Deduplication Web Documents Considering Both Domain &Size of the Document

Authors

Dr.K.Bhargavi , K.Gowtham Reddy, Dodla Navadeep Reddy ,S.Vaishnavi

Abstract

The alike and nearduplicate abstracts are breeding a boundless botheration for seek engines appropriately decelerate or access the amount of confined answers Elimination of nearduplicates save arrangement bandwidth and reduces the accumulator amount and advances the superior of seek indexes It aswell decreases the amount on the limited host that is confined such web documents Server applications are aswell benefited by identification of abreast duplicates

Downloads

Published

2023-02-23 11:15:16

How to Cite

Near-Duplicate,TF-IDF,NLTK

Issue

Vol. 71 No. 4 (2022)

Section

Articles

Mathematical Statistician and Engineering Applications

A Strategy for Near-Deduplication Web Documents Considering Both Domain &Size of the Document

Authors

Abstract

Downloads

Published

How to Cite

Issue

Section

Make a Submission

Downloads

Important Links

Information