home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Zaar-Autogramm: POS Tags: X

There are 120 X lemmas (13%), 123 X types (8%) and 217 X tokens (3%). Out of 16 observed tags, the rank of X is: 3 in number of lemmas, 3 in number of types and 11 in number of tokens.

The 10 most frequent X lemmas: nan, XX, shi, ba, a, kafin, OK, ke, tunda, wannan

The 10 most frequent X types: nan, XX, shi, ba, a, kafin, OK, ke, tunda, wannan

The 10 most frequent ambiguous lemmas: XX (X 8, ADV 2, DET 1, INTJ 1, PART 1, VERB 1), tunda (X 4, SCONJ 1), wannan (X 4, DET 1), lóːkaʧí (X 3, NOUN 1), wéy (PART 29, X 3), dàː (PART 7, X 2, NOUN 1), káɗá (X 2, PART 1), ɗa (ADP 24, PART 11, SCONJ 4, X 2, ADV 1), dóːlêː (ADV 5, X 1), faːrá (VERB 2, X 1)

The 10 most frequent ambiguous types: XX (X 8, ADV 2, DET 1, INTJ 1, PART 1, VERB 1), tunda (X 4, SCONJ 1), wannan (X 4, DET 1), wéy (PART 29, X 3), dàː (PART 7, X 2, NOUN 1), káɗá (X 2, PART 1), ɗa (ADP 20, PART 9, SCONJ 9, ADV 3, X 2), gàː (VERB 10, X 1), ka (AUX 24, X 1), ki (PRON 1, X 1)

Morphology

The form / lemma ratio of X is 1.025000 (the average of all parts of speech is 1.640000).

The 1st highest number of forms (2) was observed with the lemma “X”: ki, kira.

The 2nd highest number of forms (2) was observed with the lemma “dàːmuwa”: dàːmuwa, dàːmuwá.

The 3rd highest number of forms (2) was observed with the lemma “ya”: ya, yáː.

X occurs with 1 features: Foreign (199; 92% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 2 feature combinations. The most frequent feature combination is Foreign=Yes (199 tokens). Examples: nan, shi, ba, a, kafin, OK, ke, tunda, wannan, ɗaya

Relations

X nodes are attached to their parents using 22 different relations: flat:foreign (74; 34% instances), root (29; 13% instances), obl (25; 12% instances), discourse (14; 6% instances), obj (13; 6% instances), dep (10; 5% instances), nmod (10; 5% instances), reparandum (9; 4% instances), nsubj (6; 3% instances), parataxis (6; 3% instances), dislocated (5; 2% instances), xcomp (3; 1% instances), advcl (2; 1% instances), appos (2; 1% instances), conj (2; 1% instances), cc (1; 0% instances), cc:preconj (1; 0% instances), compound (1; 0% instances), fixed (1; 0% instances), flat:name (1; 0% instances), iobj (1; 0% instances), vocative (1; 0% instances)

Parents of X nodes belong to 12 different parts of speech: X (99; 46% instances), VERB (56; 26% instances), (29; 13% instances), AUX (9; 4% instances), PART (8; 4% instances), NOUN (6; 3% instances), PROPN (3; 1% instances), INTJ (2; 1% instances), NUM (2; 1% instances), ADV (1; 0% instances), PRON (1; 0% instances), SCONJ (1; 0% instances)

119 (55%) X nodes are leaves.

28 (13%) X nodes have one child.

31 (14%) X nodes have two children.

39 (18%) X nodes have three or more children.

The highest child degree of a X node is 7.

Children of X nodes are attached using 26 different relations: flat:foreign (81; 31% instances), punct (67; 26% instances), discourse (28; 11% instances), nmod (10; 4% instances), case (8; 3% instances), acl (7; 3% instances), ccomp (6; 2% instances), advmod (5; 2% instances), parataxis (5; 2% instances), reparandum (5; 2% instances), aux (4; 2% instances), nmod:poss (4; 2% instances), obj (4; 2% instances), dep (3; 1% instances), det (3; 1% instances), nsubj (3; 1% instances), xcomp (3; 1% instances), appos (2; 1% instances), dislocated (2; 1% instances), obl (2; 1% instances), acl:relcl (1; 0% instances), compound (1; 0% instances), fixed (1; 0% instances), flat:name (1; 0% instances), mark (1; 0% instances), obl:arg (1; 0% instances)

Children of X nodes belong to 14 different parts of speech: X (99; 38% instances), PUNCT (67; 26% instances), PART (18; 7% instances), VERB (16; 6% instances), INTJ (11; 4% instances), PROPN (11; 4% instances), NOUN (10; 4% instances), PRON (6; 2% instances), ADP (5; 2% instances), AUX (4; 2% instances), SCONJ (4; 2% instances), ADV (3; 1% instances), DET (3; 1% instances), CCONJ (1; 0% instances)