Ambiguity-aware document similarity