home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Classical_Armenian-CAVaL: POS Tags: NUM

There are 17 NUM lemmas (1%), 28 NUM types (1%) and 72 NUM tokens (1%). Out of 15 observed tags, the rank of NUM is: 9 in number of lemmas, 9 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: երկու, մի, երկոտասան, երեք, հինգ, երկոքին, եւթն, յիսուն, եւթանասուն, ութ

The 10 most frequent NUM types: երկուս, մի, հինգ, երիս, երկոքին, երկոտասան, երկու, եւթն, յիսուն, երից

The 10 most frequent ambiguous lemmas: մի (PART 61, DET 44, NUM 11)

The 10 most frequent ambiguous types: մի (PART 57, DET 40, NUM 8), միոյ (DET 1, NUM 1), միում (DET 2, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.647059 (the average of all parts of speech is 1.960180).

The 1st highest number of forms (4) was observed with the lemma “երկոտասան”: երկոտասան, երկոտասանից, երկոտասանս, երկոտասանք.

The 2nd highest number of forms (4) was observed with the lemma “երկու”: երկու, երկուս, երկուց, երկուք.

The 3rd highest number of forms (4) was observed with the lemma “մի”: մի, միոյ, միով, միում.

NUM occurs with 3 features: NumType (72; 100% instances), Case (70; 97% instances), Number (70; 97% instances)

NUM occurs with 10 feature-value pairs: Case=Abl, Case=Acc, Case=Gen, Case=Ins, Case=Loc, Case=Nom, NumType=Card, NumType=Sets, Number=Plur, Number=Sing

NUM occurs with 11 feature combinations. The most frequent feature combination is Case=Acc|Number=Plur|NumType=Card (25 tokens). Examples: երկուս, երիս, հինգ, յիսուն, երկոտասան, երկոտասանս, վեց, քառասուն

Relations

NUM nodes are attached to their parents using 10 different relations: nummod (42; 58% instances), nsubj (9; 13% instances), conj (8; 11% instances), orphan (3; 4% instances), compound (2; 3% instances), compound:redup (2; 3% instances), obj (2; 3% instances), obl (2; 3% instances), advcl (1; 1% instances), appos (1; 1% instances)

Parents of NUM nodes belong to 5 different parts of speech: NOUN (37; 51% instances), VERB (19; 26% instances), NUM (9; 13% instances), ADJ (4; 6% instances), PRON (3; 4% instances)

41 (57%) NUM nodes are leaves.

17 (24%) NUM nodes have one child.

8 (11%) NUM nodes have two children.

6 (8%) NUM nodes have three or more children.

The highest child degree of a NUM node is 7.

Children of NUM nodes are attached using 16 different relations: det (8; 15% instances), conj (7; 13% instances), orphan (7; 13% instances), case (6; 11% instances), cc (6; 11% instances), punct (6; 11% instances), advmod (2; 4% instances), compound (2; 4% instances), compound:redup (2; 4% instances), nmod (2; 4% instances), obl (2; 4% instances), acl (1; 2% instances), appos (1; 2% instances), cop (1; 2% instances), mark (1; 2% instances), nsubj (1; 2% instances)

Children of NUM nodes belong to 13 different parts of speech: DET (9; 16% instances), NUM (9; 16% instances), CCONJ (7; 13% instances), PUNCT (6; 11% instances), ADV (4; 7% instances), VERB (4; 7% instances), ADP (3; 5% instances), NOUN (3; 5% instances), PRON (3; 5% instances), PROPN (3; 5% instances), SCONJ (2; 4% instances), ADJ (1; 2% instances), AUX (1; 2% instances)