Word Name

The NIPs vocabulary contains 13,649 word types extracted from the full papers.

There are 30,799 word types in the CiteSeer  drawn automatically from PDF or postscript formats and thus are much more noisy then the NIPs Data Base.

Standard stop words are removed from the vocabulary in both cases.

 

BACK to ATM