Question A

For low score phrase, we can detect there is a common pattern which is they all have the preposition in it, for example [for, of, in, and], all these words are meant to be a non-noun phrase and shouldn't be in the list, in fact, Autophrase might give a lesser weight of POS label

'universally composable' have such a high score in the DBLP corpus, which is not a noun phrase. It might because 'universally composable' need to join with another word which is the computer science phrase that always show together, and AutoPhrase use its distance supervision wrongly choose the first two words as a noun phrase: the phrase didn't extend enough.

Question B

Question C

Discussion

There are some interesting observations that the highest similarity word is actually the abbreviation of itself, for example artificial intelligence(ai), natural language processing(nlp).

Some words such as pedagogical has a slightly higher score than its root word pedagogy, adjective has a slightly higher score than noun. Furthermore, the most_similar word of performance evaluation is performance analysis, which looks pretty good as they are synonym.