Profile photo for Priyank Singh

Two simple reasons:-

Logarithm function slope decreases as N/df value increases. This means that beyond a point, increasing N dramatically will not affect TF-IDF score as much - which mimics real life here. Beyond a point, dissimilarity will not matter much.

Log of 1 is 0. Hence when “i” is contained in all documents, w will be zero. Which means documents are completely similar, and their inverse similarity is zero.

View 3 other answers to this question
About · Careers · Privacy · Terms · Contact · Languages · Your Ad Choices · Press ·
© Quora, Inc. 2025