home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-Beginner: POS Tags: PROPN

There are 85 PROPN lemmas (5%), 85 PROPN types (5%) and 252 PROPN tokens (1%). Out of 15 observed tags, the rank of PROPN is: 5 in number of lemmas, 5 in number of types and 12 in number of tokens.

The 10 most frequent PROPN lemmas: 中国、 上海、 北京、 美国、 iphone、 小李、 john、 ktv、 四川、 小张

The 10 most frequent PROPN types: 中国、 上海、 北京、 美国、 iPhone、 小李、 John、 KTV、 四川、 小张

The 10 most frequent ambiguous lemmas: 苹果 (PROPN 3, NOUN 2), 严格 (PROPN 2, ADJ 1), 南京路 (NOUN 1, PROPN 1), 可乐 (NOUN 1, PROPN 1), 周 (NOUN 6, PROPN 1), 国 (NOUN 3, PROPN 1), 张 (NOUN 5, PROPN 1), 毛 (NOUN 5, PROPN 1), 钱 (NOUN 48, PROPN 1)

The 10 most frequent ambiguous types: 苹果 (PROPN 3, NOUN 2), 严格 (PROPN 2, ADJ 1), 南京路 (NOUN 1, PROPN 1), 可乐 (NOUN 1, PROPN 1), 周 (NOUN 6, PROPN 1), 国 (NOUN 3, PROPN 1), 张 (NOUN 5, PROPN 1), 毛 (NOUN 5, PROPN 1), 钱 (NOUN 48, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.000000 (the average of all parts of speech is 1.000000).

The 1st highest number of forms (1) was observed with the lemma “alana”: Alana.

The 2nd highest number of forms (1) was observed with the lemma “app”: APP.

The 3rd highest number of forms (1) was observed with the lemma “apple”: Apple.

PROPN does not occur with any features.

Relations

PROPN nodes are attached to their parents using 10 different relations: obj (92; 37% instances), nmod (70; 28% instances), obl:arg (33; 13% instances), nsubj (26; 10% instances), conj (17; 7% instances), obl:lmod (6; 2% instances), root (5; 2% instances), advcl (1; 0% instances), compound (1; 0% instances), nsubj:outer (1; 0% instances)

Parents of PROPN nodes belong to 8 different parts of speech: VERB (129; 51% instances), NOUN (83; 33% instances), PROPN (15; 6% instances), ADJ (13; 5% instances), (5; 2% instances), ADV (3; 1% instances), AUX (3; 1% instances), PRON (1; 0% instances)

155 (62%) PROPN nodes are leaves.

77 (31%) PROPN nodes have one child.

10 (4%) PROPN nodes have two children.

10 (4%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 5.

Children of PROPN nodes are attached using 15 different relations: case (57; 43% instances), punct (21; 16% instances), conj (20; 15% instances), cc (8; 6% instances), cop (6; 5% instances), nsubj (6; 5% instances), advmod (3; 2% instances), clf (3; 2% instances), discourse:sp (2; 2% instances), acl (1; 1% instances), amod (1; 1% instances), appos (1; 1% instances), discourse (1; 1% instances), nmod (1; 1% instances), parataxis (1; 1% instances)

Children of PROPN nodes belong to 12 different parts of speech: ADP (36; 27% instances), PART (26; 20% instances), PUNCT (21; 16% instances), PROPN (15; 11% instances), NOUN (10; 8% instances), CCONJ (8; 6% instances), AUX (6; 5% instances), ADV (3; 2% instances), ADJ (2; 2% instances), PRON (2; 2% instances), VERB (2; 2% instances), INTJ (1; 1% instances)