home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-Beginner: POS Tags: VERB

There are 372 VERB lemmas (21%), 372 VERB types (21%) and 2948 VERB tokens (15%). Out of 15 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent VERB lemmas: 去、 有、 来、 吃、 说、 做、 喜欢、 到、 看、 买

The 10 most frequent VERB types: 去、 有、 来、 吃、 说、 做、 喜欢、 到、 看、 买

The 10 most frequent ambiguous lemmas: 有 (VERB 180, AUX 17, ADJ 1, ADP 1, SCONJ 1), 来 (VERB 137, ADP 7, ADV 1, SCONJ 1), 做 (VERB 84, NOUN 1), 到 (VERB 73, ADP 15), 上 (VERB 48, NOUN 28, ADP 5), 是 (AUX 277, VERB 48, CCONJ 2, ADV 1), 在 (ADP 88, VERB 42, ADV 38), 别 (VERB 41, ADJ 8, PRON 1), 下 (VERB 34, NOUN 9, ADV 1), 学 (VERB 27, NOUN 4)

The 10 most frequent ambiguous types: 有 (VERB 180, AUX 17, ADJ 1, ADP 1, SCONJ 1), 来 (VERB 137, ADP 7, ADV 1, SCONJ 1), 做 (VERB 84, NOUN 1), 到 (VERB 73, ADP 15), 上 (VERB 48, NOUN 28, ADP 5), 是 (AUX 277, VERB 48, CCONJ 2, ADV 1), 在 (ADP 88, VERB 42, ADV 38), 别 (VERB 41, ADJ 8, PRON 1), 下 (VERB 34, NOUN 9, ADV 1), 学 (VERB 27, NOUN 4)

Morphology

The form / lemma ratio of VERB is 1.000000 (the average of all parts of speech is 1.000000).

The 1st highest number of forms (1) was observed with the lemma “一直吃”: 一直吃.

The 2nd highest number of forms (1) was observed with the lemma “上”: 上.

The 3rd highest number of forms (1) was observed with the lemma “下”: 下.

VERB occurs with 1 features: Polarity (3; 0% instances)

VERB occurs with 1 feature-value pairs: Polarity=Neg

VERB occurs with 2 feature combinations. The most frequent feature combination is _ (2945 tokens). Examples: 去、 有、 来、 吃、 说、 做、 喜欢、 到、 看、 买

Relations

VERB nodes are attached to their parents using 16 different relations: root (1564; 53% instances), parataxis (255; 9% instances), ccomp (252; 9% instances), conj (160; 5% instances), flat (122; 4% instances), acl (109; 4% instances), dep (109; 4% instances), compound:svc (103; 3% instances), compound:vv (96; 3% instances), advcl (92; 3% instances), csubj (42; 1% instances), obl:tmod (34; 1% instances), appos (4; 0% instances), obl:lmod (3; 0% instances), fixed (2; 0% instances), discourse (1; 0% instances)

Parents of VERB nodes belong to 13 different parts of speech: (1564; 53% instances), VERB (986; 33% instances), ADJ (183; 6% instances), NOUN (177; 6% instances), AUX (13; 0% instances), PRON (8; 0% instances), PART (7; 0% instances), ADV (3; 0% instances), CCONJ (2; 0% instances), PROPN (2; 0% instances), ADP (1; 0% instances), NUM (1; 0% instances), SCONJ (1; 0% instances)

264 (9%) VERB nodes are leaves.

383 (13%) VERB nodes have one child.

251 (9%) VERB nodes have two children.

2050 (70%) VERB nodes have three or more children.

The highest child degree of a VERB node is 9.

Children of VERB nodes are attached using 29 different relations: punct (2097; 20% instances), obj (1626; 16% instances), nsubj (1576; 15% instances), advmod (1327; 13% instances), aux (689; 7% instances), obl:tmod (354; 3% instances), discourse (339; 3% instances), obl (288; 3% instances), discourse:sp (284; 3% instances), ccomp (221; 2% instances), parataxis (218; 2% instances), mark (213; 2% instances), obl:arg (212; 2% instances), conj (142; 1% instances), advcl (132; 1% instances), cc (126; 1% instances), dep (124; 1% instances), compound:svc (107; 1% instances), compound:vv (103; 1% instances), flat (96; 1% instances), cop (52; 0% instances), obl:lmod (42; 0% instances), csubj (14; 0% instances), vocative (9; 0% instances), nsubj:outer (6; 0% instances), clf (5; 0% instances), xcomp (5; 0% instances), appos (1; 0% instances), iobj (1; 0% instances)

Children of VERB nodes belong to 15 different parts of speech: NOUN (2241; 22% instances), PUNCT (2097; 20% instances), PRON (1498; 14% instances), ADV (1434; 14% instances), VERB (986; 9% instances), PART (758; 7% instances), AUX (749; 7% instances), ADJ (238; 2% instances), PROPN (129; 1% instances), SCONJ (100; 1% instances), CCONJ (60; 1% instances), ADP (54; 1% instances), INTJ (30; 0% instances), DET (23; 0% instances), NUM (12; 0% instances)