home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-Poetry: POS Tags: X

There are 47 X lemmas (0%), 47 X types (0%) and 55 X tokens (0%). Out of 17 observed tags, the rank of X is: 8 in number of lemmas, 10 in number of types and 16 in number of tokens.

The 10 most frequent X lemmas: е, и, сань, цю, Dànse, EIFEL, FANDANGO, IMPROMPTU, In, Jesekiila

The 10 most frequent X types: е, и, сань, цю, Dànse, EIFEL, FANDANGO, IMPROMPTU, In, Jesekiila

The 10 most frequent ambiguous lemmas: и (CCONJ 2273, PART 96, X 3), ни (CCONJ 64, PART 35, X 1), се (PART 1, X 1)

The 10 most frequent ambiguous types: и (CCONJ 1319, PART 96, X 1), Во (ADP 10, X 1), ни (CCONJ 42, PART 28, X 1)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.831021).

The 1st highest number of forms (1) was observed with the lemma “Dànse”: Dànse.

The 2nd highest number of forms (1) was observed with the lemma “EIFEL”: EIFEL.

The 3rd highest number of forms (1) was observed with the lemma “FANDANGO”: FANDANGO.

X occurs with 1 features: Foreign (53; 96% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 2 feature combinations. The most frequent feature combination is Foreign=Yes (53 tokens). Examples: е, и, сань, цю, Dànse, EIFEL, FANDANGO, IMPROMPTU, In, Jesekiila

Relations

X nodes are attached to their parents using 10 different relations: flat:foreign (34; 62% instances), root (10; 18% instances), appos (2; 4% instances), conj (2; 4% instances), parataxis (2; 4% instances), dep (1; 2% instances), goeswith (1; 2% instances), nsubj (1; 2% instances), obj (1; 2% instances), obl (1; 2% instances)

Parents of X nodes belong to 5 different parts of speech: X (36; 65% instances), (10; 18% instances), VERB (5; 9% instances), ADV (2; 4% instances), NOUN (2; 4% instances)

31 (56%) X nodes are leaves.

11 (20%) X nodes have one child.

2 (4%) X nodes have two children.

11 (20%) X nodes have three or more children.

The highest child degree of a X node is 8.

Children of X nodes are attached using 7 different relations: flat:foreign (34; 51% instances), punct (26; 39% instances), conj (2; 3% instances), det (2; 3% instances), case (1; 1% instances), nmod (1; 1% instances), parataxis (1; 1% instances)

Children of X nodes belong to 5 different parts of speech: X (36; 54% instances), PUNCT (26; 39% instances), DET (2; 3% instances), NOUN (2; 3% instances), ADP (1; 1% instances)