The compilation of a sample PFR Chinese corpus of Skeleton-parsed sentences

##plugins.themes.bootstrap3.article.main##

##plugins.themes.bootstrap3.article.sidebar##

Publicado 10-04-2005
May Lai-Yin Wong

Resumen

The approach taken in this paper for the construction of a treebank is inspired by the skeleton parsing approach. From the PFR Chinese Corpus, a sample text of some 100,000 word tokens was chosen for the production of the treebank. A clear account of the 17 non terminal constituents that are defined and instantiated in the corpus texts will be provided in a parsing scheme. A set of parsing guidelines on practical issues related to map any parses on to sentences in the application of the parsing scheme will also be considered. It is noteworthy also to discuss the major difficulties encountered in the course of skeleton parsing, as this illuminates some of the peculiarities of the Chinese language. The conclusion is an evaluation of the success of the treebank compilation.

Cómo citar

Wong, May Lai-Yin. 2005. «The Compilation of a Sample PFR Chinese Corpus of Skeleton-Parsed Sentences». Anuario Del Seminario De Filología Vasca "Julio De Urquijo" 39 (2):271-87. https://doi.org/10.1387/asju.4358.
Abstract 176 | PDF (English) Downloads 258

##plugins.themes.bootstrap3.article.details##

Sección
Artículos