Training Data Sets

Su Wang discusses the data sets used in training, including feature norms and probabilistic labels. The representative data set incorporates human common sense knowledge, while the distributional corpus takes advantage of distributional patterns in labeled concepts.