Concept classification via Google page counts

Two years ago, I explored the possibility of using Google page counts to measure similarity between concepts. The results are contained in this article. Although the results were not as strong as I had hoped, they may serve as a starting point for further exploration.

2 Responses to “Concept classification via Google page counts”

  1. Carnival of Mathematics IX « JD2718 Says:

    [...] attempts to Classify Concepts by search engine page Counts. [...]

  2. Coconuts Says:

    Interesting work! It reminds me a lot of Latent Semantic Analysis, which similarly calculates similarity between terms by how often they co-occur in documents. The algorithm that LSA uses is a little more complicated, and, I think, might address some of the weaknesses pointed out in the paper. I think it might be neat to try to combine the two approaches, maybe by using google to find a more “interesting” document set to feed to LSA.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Connecting to %s


Follow

Get every new post delivered to your Inbox.