Statistics of X in UD_Zaar-Autogramm

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Zaar-Autogramm: POS Tags: `X`

There are 120 X lemmas (13%), 123 X types (8%) and 217 X tokens (3%). Out of 16 observed tags, the rank of X is: 3 in number of lemmas, 3 in number of types and 11 in number of tokens.

The 10 most frequent X lemmas: nan, XX, shi, ba, a, kafin, OK, ke, tunda, wannan

The 10 most frequent X types: nan, XX, shi, ba, a, kafin, OK, ke, tunda, wannan

The 10 most frequent ambiguous lemmas: XX (X 8, ADV 2, DET 1, INTJ 1, PART 1, VERB 1), tunda (X 4, SCONJ 1), wannan (X 4, DET 1), lóːkaʧí (X 3, NOUN 1), wéy (PART 29, X 3), dàː (PART 7, X 2, NOUN 1), káɗá (X 2, PART 1), ɗa (ADP 24, PART 11, SCONJ 4, X 2, ADV 1), dóːlêː (ADV 5, X 1), faːrá (VERB 2, X 1)

The 10 most frequent ambiguous types: XX (X 8, ADV 2, DET 1, INTJ 1, PART 1, VERB 1), tunda (X 4, SCONJ 1), wannan (X 4, DET 1), wéy (PART 29, X 3), dàː (PART 7, X 2, NOUN 1), káɗá (X 2, PART 1), ɗa (ADP 20, PART 9, SCONJ 9, ADV 3, X 2), gàː (VERB 10, X 1), ka (AUX 24, X 1), ki (PRON 1, X 1)

XX
- X 8: but XX &//
- ADV 2: XX &//
- DET 1: XX //
- INTJ 1: XX &//
- PART 1: XX &//
- VERB 1: XX &//
tunda
- X 4: tunda Zəgì àː kap gə̀ɗíː kàm < ma kap gə̀ɗíː //
- SCONJ 1: tunda káː ngúp wúlɣə̂n vìː tə́ yáːníː káwây < to shi ke nan //
wannan
- X 4: wéy wannan Lim ne ?//
- DET 1: dzàn gíː mə̀tàyiɣá ə̀ː táːɗi èː gyaː mókʃi =wòpm < tə́ gyaː m̀ː wannan Làːdí |a ŋaː mə́n Bawʧí ɗa átâ mâníː //
wéy
- PART 29: wéy á < éy yâːn tá fî ni maːndə tə́ kúnê =tn < bâː dàːmuwa //
- X 3: tə́ ʒìɗ =ə̀m tu wéy gàː || wéy kú# //&
dàː
- PART 7: bàː ʧík hŋ́ < dàː mìː və̀r =tə̀ //
- X 2: kúmá yâːn míyí tu ɬəɣə̂níː yáːwón < dàː || dàː ʧíyí mòɓ òː > gáskíyáː //
- NOUN 1: yâːn nə ŋaː mwâːn < dàː gòs wù tu káɗá tə̀ fí ngə́tn wón ə̌n //
káɗá
- X 2: yâːn nə ŋaː mwâːn < dàː gòs wù tu káɗá tə̀ fí ngə́tn wón ə̌n //
- PART 1: káɗá mə̀ tsə̌tn də̀n ɗan kapkə́ dzàŋ Kímsə !//
ɗa
- ADP 20: gáːrá mə̀ ngeláŋ áː sòːséy mə̀ yè ngə́r wón ɗan ʧìɣá ríːngə̂n ɗa ɗə́ki //
- PART 9: yâːn hali ɗa kàm < má ɗìːɓíː //
- SCONJ 9: Ànês < ɗa áyǎː ɓan wul tu ngə̌tn hawʃi < əndá yi ɓan fi //
- ADV 3: á wul tu yáːwón wò ɬə́ ɗu =ʃí wáya mə́n ɗa //
- X 2: ɗa áː tə́ ɗa èː &//
gàː
- VERB 10: tóː < gìːr =wàːsə̀n gəní < àː gàː =ʃí báː ʒà hŋ́ oː //
- X 1: tə́ ʒìɗ =ə̀m tu wéy gàː || wéy kú# //&
ka
- AUX 24: ka haɗá lûː tə́ sə̀kéːɗíː máː //
- X 1: ka san daga èː ə̀ːm dàtə̂pm Gèʤì |c tə́ || tə́ || tə́ gìp Bawʧi < bàː á fǔpm sòːséy ǐn //
ki
- PRON 1: tòː < ki tə́ ngə́tn hŋ́ oː < tá níː ɬə́ːr =ɣə nə́ &//
- X 1: ìdán || yâːn ki &//

Morphology

The form / lemma ratio of X is 1.025000 (the average of all parts of speech is 1.640000).

The 1st highest number of forms (2) was observed with the lemma “X”: ki, kira.

The 2nd highest number of forms (2) was observed with the lemma “dàːmuwa”: dàːmuwa, dàːmuwá.

The 3rd highest number of forms (2) was observed with the lemma “ya”: ya, yáː.

X occurs with 1 features: Foreign (199; 92% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 2 feature combinations. The most frequent feature combination is Foreign=Yes (199 tokens). Examples: nan, shi, ba, a, kafin, OK, ke, tunda, wannan, ɗaya

Relations

X nodes are attached to their parents using 22 different relations: flat:foreign (74; 34% instances), root (29; 13% instances), obl (25; 12% instances), discourse (14; 6% instances), obj (13; 6% instances), dep (10; 5% instances), nmod (10; 5% instances), reparandum (9; 4% instances), nsubj (6; 3% instances), parataxis (6; 3% instances), dislocated (5; 2% instances), xcomp (3; 1% instances), advcl (2; 1% instances), appos (2; 1% instances), conj (2; 1% instances), cc (1; 0% instances), cc:preconj (1; 0% instances), compound (1; 0% instances), fixed (1; 0% instances), flat:name (1; 0% instances), iobj (1; 0% instances), vocative (1; 0% instances)

Parents of X nodes belong to 12 different parts of speech: X (99; 46% instances), VERB (56; 26% instances), (29; 13% instances), AUX (9; 4% instances), PART (8; 4% instances), NOUN (6; 3% instances), PROPN (3; 1% instances), INTJ (2; 1% instances), NUM (2; 1% instances), ADV (1; 0% instances), PRON (1; 0% instances), SCONJ (1; 0% instances)

119 (55%) X nodes are leaves.

28 (13%) X nodes have one child.

31 (14%) X nodes have two children.

39 (18%) X nodes have three or more children.

The highest child degree of a X node is 7.

Children of X nodes are attached using 26 different relations: flat:foreign (81; 31% instances), punct (67; 26% instances), discourse (28; 11% instances), nmod (10; 4% instances), case (8; 3% instances), acl (7; 3% instances), ccomp (6; 2% instances), advmod (5; 2% instances), parataxis (5; 2% instances), reparandum (5; 2% instances), aux (4; 2% instances), nmod:poss (4; 2% instances), obj (4; 2% instances), dep (3; 1% instances), det (3; 1% instances), nsubj (3; 1% instances), xcomp (3; 1% instances), appos (2; 1% instances), dislocated (2; 1% instances), obl (2; 1% instances), acl:relcl (1; 0% instances), compound (1; 0% instances), fixed (1; 0% instances), flat:name (1; 0% instances), mark (1; 0% instances), obl:arg (1; 0% instances)

Children of X nodes belong to 14 different parts of speech: X (99; 38% instances), PUNCT (67; 26% instances), PART (18; 7% instances), VERB (16; 6% instances), INTJ (11; 4% instances), PROPN (11; 4% instances), NOUN (10; 4% instances), PRON (6; 2% instances), ADP (5; 2% instances), AUX (4; 2% instances), SCONJ (4; 2% instances), ADV (3; 1% instances), DET (3; 1% instances), CCONJ (1; 0% instances)

Treebank Statistics: UD_Zaar-Autogramm: POS Tags: X

Morphology

Relations

Treebank Statistics: UD_Zaar-Autogramm: POS Tags: `X`