Google has itself in quite the pickle. On one hand, as an advertising company, more content – even if its worthless content – in more places for ad placement and revenue opportunities. On the other hand, content that was scraped or source from a content farm devalues search results and may lead to user defections. I can’t help but think that Google’s multiple projects (Apps, Android, TV, and Chrome OS) has been at the cost of the search index.
My simple solution would be semantic index where only the first/best result is shown with the rest clustered behind a “similar results” link.