NLP-KG
Semantic Search

Publication:

Statistical Filtering and Subcategorization Frame Acquisition

A. KorhonenG. GorrellDiana McCarthy • @Workshop on Very Large Corpora • 07 October 2000

TLDR: Three different approaches to filtering out spurious hypotheses are compared, two hypothesis tests perform poorly, compared to filtering frames on the basis of relative frequency and directions for future research are considered.

Citations: 57
Abstract: Research into the automatic acquisition of subcategorization frames (SCFs) from corpora is starting to produce large-scale computational lexicons which include valuable frequency information. However, the accuracy of the resulting lexicons shows room for improvement. One significant source of error lies in the statistical filtering used by some researchers to remove noise from automatically acquired subcategorization frames. In this paper, we compare three different approaches to filtering out spurious hypotheses. Two hypothesis tests perform poorly, compared to filtering frames on the basis of relative frequency. We discuss reasons for this and consider directions for future research.

Related Fields of Study

loading

Citations

Sort by
Previous
Next

Showing results 1 to 0 of 0

Previous
Next

References

Sort by
Previous
Next

Showing results 1 to 0 of 0

Previous
Next