home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-Beginner: POS Tags: PRON

There are 26 PRON lemmas (1%), 26 PRON types (1%) and 2157 PRON tokens (11%). Out of 15 observed tags, the rank of PRON is: 10 in number of lemmas, 10 in number of types and 5 in number of tokens.

The 10 most frequent PRON lemmas: 我、 你、 他、 我们、 她、 他们、 你们、 什么、 怎么、 谁

The 10 most frequent PRON types: 我、 你、 他、 我们、 她、 他们、 你们、 什么、 怎么、 谁

The 10 most frequent ambiguous lemmas: 什么 (PRON 58, ADV 4, NOUN 1), 怎么 (PRON 56, ADV 19), 自己 (PRON 9, NOUN 3, ADV 2), 干吗 (PRON 6, VERB 1), 它 (NOUN 3, PRON 2, PART 1), 其 (DET 1, PRON 1), 几 (NUM 41, DET 1, PRON 1), 别 (VERB 41, ADJ 8, PRON 1), 这么 (ADV 38, PRON 1)

The 10 most frequent ambiguous types: 什么 (PRON 58, ADV 4, NOUN 1), 怎么 (PRON 56, ADV 19), 自己 (PRON 9, NOUN 3, ADV 2), 干吗 (PRON 6, VERB 1), 它 (NOUN 3, PRON 2, PART 1), 其 (DET 1, PRON 1), 几 (NUM 41, DET 1, PRON 1), 别 (VERB 41, ADJ 8, PRON 1), 这么 (ADV 38, PRON 1)

Morphology

The form / lemma ratio of PRON is 1.000000 (the average of all parts of speech is 1.000000).

The 1st highest number of forms (1) was observed with the lemma “为什么”: 为什么.

The 2nd highest number of forms (1) was observed with the lemma “什么”: 什么.

The 3rd highest number of forms (1) was observed with the lemma “他”: 他.

PRON occurs with 3 features: Person (1959; 91% instances), Number (360; 17% instances), PronType (167; 8% instances)

PRON occurs with 6 feature-value pairs: Number=Plur, Person=1, Person=2, Person=3, PronType=Ind, PronType=Int

PRON occurs with 9 feature combinations. The most frequent feature combination is Person=1 (634 tokens). Examples: 我、 我们、 咱们

Relations

PRON nodes are attached to their parents using 13 different relations: nsubj (1390; 64% instances), nmod (397; 18% instances), obj (185; 9% instances), obl (77; 4% instances), obl:arg (73; 3% instances), root (12; 1% instances), det (10; 0% instances), ccomp (5; 0% instances), conj (3; 0% instances), parataxis (2; 0% instances), appos (1; 0% instances), iobj (1; 0% instances), nsubj:outer (1; 0% instances)

Parents of PRON nodes belong to 9 different parts of speech: VERB (1498; 69% instances), NOUN (491; 23% instances), ADJ (125; 6% instances), AUX (19; 1% instances), (12; 1% instances), PRON (7; 0% instances), NUM (2; 0% instances), PROPN (2; 0% instances), PART (1; 0% instances)

1898 (88%) PRON nodes are leaves.

234 (11%) PRON nodes have one child.

11 (1%) PRON nodes have two children.

14 (1%) PRON nodes have three or more children.

The highest child degree of a PRON node is 7.

Children of PRON nodes are attached using 23 different relations: case (211; 67% instances), conj (21; 7% instances), cop (16; 5% instances), punct (15; 5% instances), nsubj (13; 4% instances), appos (8; 3% instances), advmod (6; 2% instances), cc (3; 1% instances), obl:tmod (3; 1% instances), acl (2; 1% instances), nmod (2; 1% instances), obl:arg (2; 1% instances), parataxis (2; 1% instances), advcl (1; 0% instances), amod (1; 0% instances), aux (1; 0% instances), csubj (1; 0% instances), dep (1; 0% instances), fixed (1; 0% instances), mark (1; 0% instances), nsubj:outer (1; 0% instances), obj (1; 0% instances), vocative (1; 0% instances)

Children of PRON nodes belong to 13 different parts of speech: PART (137; 44% instances), ADP (74; 24% instances), NOUN (40; 13% instances), AUX (17; 5% instances), PUNCT (15; 5% instances), ADV (9; 3% instances), VERB (8; 3% instances), PRON (7; 2% instances), CCONJ (3; 1% instances), ADJ (1; 0% instances), DET (1; 0% instances), PROPN (1; 0% instances), SCONJ (1; 0% instances)