Treebank Statistics: UD_Zaar-Autogramm: POS Tags: X
There are 120 X
lemmas (13%), 123 X
types (8%) and 217 X
tokens (3%).
Out of 16 observed tags, the rank of X
is: 3 in number of lemmas, 3 in number of types and 11 in number of tokens.
The 10 most frequent X
lemmas: nan, XX, shi, ba, a, kafin, OK, ke, tunda, wannan
The 10 most frequent X
types: nan, XX, shi, ba, a, kafin, OK, ke, tunda, wannan
The 10 most frequent ambiguous lemmas: XX (X 8, ADV 2, DET 1, INTJ 1, PART 1, VERB 1), tunda (X 4, SCONJ 1), wannan (X 4, DET 1), lóːkaʧí (X 3, NOUN 1), wéy (PART 29, X 3), dàː (PART 7, X 2, NOUN 1), káɗá (X 2, PART 1), ɗa (ADP 24, PART 11, SCONJ 4, X 2, ADV 1), dóːlêː (ADV 5, X 1), faːrá (VERB 2, X 1)
The 10 most frequent ambiguous types: XX (X 8, ADV 2, DET 1, INTJ 1, PART 1, VERB 1), tunda (X 4, SCONJ 1), wannan (X 4, DET 1), wéy (PART 29, X 3), dàː (PART 7, X 2, NOUN 1), káɗá (X 2, PART 1), ɗa (ADP 20, PART 9, SCONJ 9, ADV 3, X 2), gàː (VERB 10, X 1), ka (AUX 24, X 1), ki (PRON 1, X 1)
- XX
- tunda
- wannan
- wéy
- dàː
- káɗá
- ɗa
- gàː
- ka
- ki
Morphology
The form / lemma ratio of X
is 1.025000 (the average of all parts of speech is 1.640000).
The 1st highest number of forms (2) was observed with the lemma “X”: ki, kira.
The 2nd highest number of forms (2) was observed with the lemma “dàːmuwa”: dàːmuwa, dàːmuwá.
The 3rd highest number of forms (2) was observed with the lemma “ya”: ya, yáː.
X
occurs with 1 features: Foreign (199; 92% instances)
X
occurs with 1 feature-value pairs: Foreign=Yes
X
occurs with 2 feature combinations.
The most frequent feature combination is Foreign=Yes
(199 tokens).
Examples: nan, shi, ba, a, kafin, OK, ke, tunda, wannan, ɗaya
Relations
X
nodes are attached to their parents using 22 different relations: flat:foreign (74; 34% instances), root (29; 13% instances), obl (25; 12% instances), discourse (14; 6% instances), obj (13; 6% instances), dep (10; 5% instances), nmod (10; 5% instances), reparandum (9; 4% instances), nsubj (6; 3% instances), parataxis (6; 3% instances), dislocated (5; 2% instances), xcomp (3; 1% instances), advcl (2; 1% instances), appos (2; 1% instances), conj (2; 1% instances), cc (1; 0% instances), cc:preconj (1; 0% instances), compound (1; 0% instances), fixed (1; 0% instances), flat:name (1; 0% instances), iobj (1; 0% instances), vocative (1; 0% instances)
Parents of X
nodes belong to 12 different parts of speech: X (99; 46% instances), VERB (56; 26% instances), (29; 13% instances), AUX (9; 4% instances), PART (8; 4% instances), NOUN (6; 3% instances), PROPN (3; 1% instances), INTJ (2; 1% instances), NUM (2; 1% instances), ADV (1; 0% instances), PRON (1; 0% instances), SCONJ (1; 0% instances)
119 (55%) X
nodes are leaves.
28 (13%) X
nodes have one child.
31 (14%) X
nodes have two children.
39 (18%) X
nodes have three or more children.
The highest child degree of a X
node is 7.
Children of X
nodes are attached using 26 different relations: flat:foreign (81; 31% instances), punct (67; 26% instances), discourse (28; 11% instances), nmod (10; 4% instances), case (8; 3% instances), acl (7; 3% instances), ccomp (6; 2% instances), advmod (5; 2% instances), parataxis (5; 2% instances), reparandum (5; 2% instances), aux (4; 2% instances), nmod:poss (4; 2% instances), obj (4; 2% instances), dep (3; 1% instances), det (3; 1% instances), nsubj (3; 1% instances), xcomp (3; 1% instances), appos (2; 1% instances), dislocated (2; 1% instances), obl (2; 1% instances), acl:relcl (1; 0% instances), compound (1; 0% instances), fixed (1; 0% instances), flat:name (1; 0% instances), mark (1; 0% instances), obl:arg (1; 0% instances)
Children of X
nodes belong to 14 different parts of speech: X (99; 38% instances), PUNCT (67; 26% instances), PART (18; 7% instances), VERB (16; 6% instances), INTJ (11; 4% instances), PROPN (11; 4% instances), NOUN (10; 4% instances), PRON (6; 2% instances), ADP (5; 2% instances), AUX (4; 2% instances), SCONJ (4; 2% instances), ADV (3; 1% instances), DET (3; 1% instances), CCONJ (1; 0% instances)