home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Mbya_Guarani-Dooley: POS Tags: NOUN

There are 27 NOUN lemmas (22%), 28 NOUN types (21%) and 1278 NOUN tokens (11%). Out of 16 observed tags, the rank of NOUN is: 2 in number of lemmas, 2 in number of types and 4 in number of tokens.

The 10 most frequent NOUN lemmas: _, ava, me, yke’y, a’yxy, akã, bicicleta, couve, enda, jeruxi

The 10 most frequent NOUN types: _, ava, tyke’y, Neme, Vexa’i, Xakã, bicicleta, couve, ime, jeruxi

The 10 most frequent ambiguous lemmas: _ (PART 2380, VERB 2363, PUNCT 1843, NOUN 1248, SCONJ 1129, PRON 1094, ADP 924, ADV 242, NUM 104, PROPN 99, ADJ 41, CCONJ 32, INTJ 15, DET 11, X 8, AUX 3)

The 10 most frequent ambiguous types: _ (PART 2380, VERB 2363, PUNCT 1843, NOUN 1248, SCONJ 1129, PRON 1094, ADP 924, ADV 242, NUM 104, PROPN 99, ADJ 41, CCONJ 32, INTJ 15, DET 11, X 8, AUX 3)

Morphology

The form / lemma ratio of NOUN is 1.037037 (the average of all parts of speech is 1.056000).

The 1st highest number of forms (2) was observed with the lemma “me”: Neme, ime.

The 2nd highest number of forms (1) was observed with the lemma “_”: _.

The 3rd highest number of forms (1) was observed with the lemma “a’yxy”: ta’yxy.

NOUN occurs with 3 features: Number[psor] (71; 6% instances), Number (17; 1% instances), Clusivity[psor] (12; 1% instances)

NOUN occurs with 5 feature-value pairs: Clusivity[psor]=Ex, Clusivity[psor]=In, Number=Plur, Number[psor]=Plur, Number[psor]=Sing

NOUN occurs with 6 feature combinations. The most frequent feature combination is _ (1190 tokens). Examples: _, ava, tyke’y, Vexa’i, Xakã, bicicleta, couve, ime, jeruxi, ka’aguy

Relations

NOUN nodes are attached to their parents using 15 different relations: obl (430; 34% instances), nsubj (391; 31% instances), obj (243; 19% instances), nmod (136; 11% instances), conj (20; 2% instances), parataxis:rep (15; 1% instances), compound (14; 1% instances), appos (9; 1% instances), vocative (7; 1% instances), acl (3; 0% instances), dislocated (3; 0% instances), parataxis (3; 0% instances), obl:sentcon (2; 0% instances), ccomp (1; 0% instances), root (1; 0% instances)

Parents of NOUN nodes belong to 5 different parts of speech: VERB (1089; 85% instances), NOUN (184; 14% instances), PRON (3; 0% instances), ADV (1; 0% instances), (1; 0% instances)

484 (38%) NOUN nodes are leaves.

494 (39%) NOUN nodes have one child.

204 (16%) NOUN nodes have two children.

96 (8%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 8.

Children of NOUN nodes are attached using 22 different relations: case (394; 32% instances), dep:mod (283; 23% instances), nmod (164; 13% instances), nummod (85; 7% instances), mark (56; 4% instances), acl (55; 4% instances), punct (54; 4% instances), amod (30; 2% instances), det (28; 2% instances), compound (22; 2% instances), conj (18; 1% instances), appos (17; 1% instances), cc (13; 1% instances), parataxis (7; 1% instances), advcl (4; 0% instances), flat (4; 0% instances), nsubj (4; 0% instances), obl (3; 0% instances), obl:sentcon (2; 0% instances), advmod (1; 0% instances), aux (1; 0% instances), compound:svc (1; 0% instances)

Children of NOUN nodes belong to 15 different parts of speech: ADP (394; 32% instances), PART (283; 23% instances), NOUN (184; 15% instances), NUM (86; 7% instances), VERB (71; 6% instances), SCONJ (56; 4% instances), PUNCT (54; 4% instances), PRON (46; 4% instances), ADJ (30; 2% instances), CCONJ (13; 1% instances), DET (13; 1% instances), PROPN (10; 1% instances), INTJ (4; 0% instances), ADV (1; 0% instances), AUX (1; 0% instances)