home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Icelandic-IcePaHC: POS Tags: X

There are 1195 X lemmas (3%), 1220 X types (2%) and 2275 X tokens (0%). Out of 16 observed tags, the rank of X is: 6 in number of lemmas, 6 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: anno, dominus, item, in, sankti, et, majst, trankival, etc, darius

The 10 most frequent X types: anno, item, in, domini, et, Dominus, Majst, Trankival, sankti, etc

The 10 most frequent ambiguous lemmas: anno (X 142, NOUN 17, PROPN 5), dominus (X 78, PROPN 42), item (X 61, ADV 3), in (X 53, ADV 1), sankti (PROPN 55, X 40, ADJ 2, NOUN 2), et (X 30, CCONJ 1, NOUN 1, SCONJ 1), majst (X 25, PROPN 12), trankival (X 25, PROPN 16), darius (PROPN 105, X 15, ADJ 2, NOUN 1), ektor (PROPN 48, X 14, NOUN 2)

The 10 most frequent ambiguous types: anno (NOUN 5, X 5), item (X 24, ADV 3), in (X 50, DET 38, ADV 1, NOUN 1), domini (X 9, PROPN 1), et (X 24, VERB 3, CCONJ 1, NOUN 1, SCONJ 1), Dominus (PROPN 30, X 28), Majst (X 25, PROPN 12), Trankival (X 25, PROPN 13), sankti (PROPN 48, X 23), sanktus (PROPN 15, X 14, ADJ 2)

Morphology

The form / lemma ratio of X is 1.020921 (the average of all parts of speech is 1.842490).

The 1st highest number of forms (8) was observed with the lemma “kristur”: Christi, Christum, Christus, Kristo, Kristum, Kristus, Kristí, kristi.

The 2nd highest number of forms (6) was observed with the lemma “jesús”: Iesu, Iesus, Jesu, Jesus, Jesú, Jesúm.

The 3rd highest number of forms (5) was observed with the lemma “dominus”: Domini, Domino, Dominus, domine, dominum.

X occurs with 13 features: Foreign (1542; 68% instances), Number (634; 28% instances), Case (614; 27% instances), Definite (596; 26% instances), Gender (574; 25% instances), Degree (56; 2% instances), VerbForm (29; 1% instances), Voice (29; 1% instances), Person (21; 1% instances), Mood (20; 1% instances), Tense (20; 1% instances), NumType (6; 0% instances), PronType (6; 0% instances)

X occurs with 29 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Nom, Definite=Def, Definite=Ind, Degree=Cmp, Degree=Pos, Degree=Sup, Foreign=Yes, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Ind, Mood=Sub, NumType=Card, Number=Plur, Number=Sing, Person=1, Person=3, PronType=Ind, PronType=Prs, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act, Voice=Mid

X occurs with 79 feature combinations. The most frequent feature combination is Foreign=Yes (1542 tokens). Examples: anno, in, item, domini, et, Dominus, etc, de, Achior, corpus

Relations

X nodes are attached to their parents using 17 different relations: flat:foreign (941; 41% instances), dep (458; 20% instances), obl (299; 13% instances), root (99; 4% instances), nmod:poss (96; 4% instances), nsubj (87; 4% instances), appos (85; 4% instances), conj (70; 3% instances), obj (53; 2% instances), xcomp (36; 2% instances), ccomp (14; 1% instances), amod (11; 0% instances), acl:relcl (9; 0% instances), iobj (7; 0% instances), advcl (6; 0% instances), acl (2; 0% instances), vocative (2; 0% instances)

Parents of X nodes belong to 14 different parts of speech: VERB (851; 37% instances), X (779; 34% instances), NOUN (334; 15% instances), (99; 4% instances), PROPN (74; 3% instances), PRON (31; 1% instances), ADJ (26; 1% instances), DET (25; 1% instances), ADV (13; 1% instances), AUX (13; 1% instances), NUM (13; 1% instances), CCONJ (11; 0% instances), ADP (5; 0% instances), PART (1; 0% instances)

1022 (45%) X nodes are leaves.

736 (32%) X nodes have one child.

259 (11%) X nodes have two children.

258 (11%) X nodes have three or more children.

The highest child degree of a X node is 17.

Children of X nodes are attached using 26 different relations: flat:foreign (793; 32% instances), punct (522; 21% instances), dep (209; 8% instances), nummod (149; 6% instances), conj (129; 5% instances), appos (102; 4% instances), amod (88; 4% instances), cc (82; 3% instances), nmod:poss (59; 2% instances), case (55; 2% instances), det (55; 2% instances), obl (55; 2% instances), acl:relcl (34; 1% instances), cop (28; 1% instances), advmod (24; 1% instances), nsubj (21; 1% instances), mark (17; 1% instances), parataxis (9; 0% instances), xcomp (8; 0% instances), ccomp (6; 0% instances), nmod (4; 0% instances), acl (3; 0% instances), advcl (3; 0% instances), compound:prt (3; 0% instances), discourse (1; 0% instances), obj (1; 0% instances)

Children of X nodes belong to 15 different parts of speech: X (779; 32% instances), PUNCT (522; 21% instances), ADP (244; 10% instances), NOUN (226; 9% instances), NUM (155; 6% instances), ADJ (93; 4% instances), PROPN (85; 3% instances), CCONJ (82; 3% instances), VERB (77; 3% instances), DET (62; 3% instances), PRON (58; 2% instances), AUX (31; 1% instances), ADV (28; 1% instances), SCONJ (17; 1% instances), INTJ (1; 0% instances)