Concept classification via Google page counts

By involution

Two years ago, I explored the possibility of using Google page counts to measure similarity between concepts. The results are contained in this article. Although the results were not as strong as I had hoped, they may serve as a starting point for further exploration.

2 Responses to “Concept classification via Google page counts”

  1. Carnival of Mathematics IX « JD2718 Says:

    [...] attempts to Classify Concepts by search engine page Counts. [...]

  2. Coconuts Says:

    Interesting work! It reminds me a lot of Latent Semantic Analysis, which similarly calculates similarity between terms by how often they co-occur in documents. The algorithm that LSA uses is a little more complicated, and, I think, might address some of the weaknesses pointed out in the paper. I think it might be neat to try to combine the two approaches, maybe by using google to find a more “interesting” document set to feed to LSA.

Leave a Reply