NLP-KG
Semantic Search

Publication:

Reflections on the Penn Discourse TreeBank, Comparable Corpora, and Complementary Annotation

R. PrasadBonnie L. WebberAravind K. Joshi • @Computational Linguistics • 01 December 2014

TLDR: A comprehensive introduction to the Penn Discourse Treebank is provided to correct some wrong (or perhaps inadvertent) assumptions about the PDTB and its annotation and to explain variations seen in the annotation of comparable resources in other languages and genres to allow developers of future comparable resources to recognize whether the variations are relevant to them.

Citations: 100
Abstract: The Penn Discourse Treebank (PDTB) was released to the public in 2008. It remains the largest manually annotated corpus of discourse relations to date. Its focus on discourse relations that are either lexically-grounded in explicit discourse connectives or associated with sentential adjacency has not only facilitated its use in language technology and psycholinguistics but also has spawned the annotation of comparable corpora in other languages and genres.Given this situation, this paper has four aims: (1) to provide a comprehensive introduction to the PDTB for those who are unfamiliar with it; (2) to correct some wrong (or perhaps inadvertent) assumptions about the PDTB and its annotation that may have weakened previous results or the performance of decision procedures induced from the data; (3) to explain variations seen in the annotation of comparable resources in other languages and genres, which should allow developers of future comparable resources to recognize whether the variations are relevant to them; and (4) to enumerate and explain relationships between PDTB annotation and complementary annotation of other linguistic phenomena. The paper draws on work done by ourselves and others since the corpus was released.

Related Fields of Study

loading

Citations

Sort by
Previous
Next

Showing results 1 to 0 of 0

Previous
Next

References

Sort by
Previous
Next

Showing results 1 to 0 of 0

Previous
Next