home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Faroese-FarPaHC: POS Tags: NUM

There are 1 NUM lemmas (5%), 24 NUM types (1%) and 104 NUM tokens (0%). Out of 16 observed tags, the rank of NUM is: 10 in number of lemmas, 10 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: _

The 10 most frequent NUM types: tveir, fýra, fjöruti, fimm, hundrað, tríggjar, tólv, seks, fimmti, trimum

The 10 most frequent ambiguous lemmas: _ (PRON 6053, VERB 5349, ADP 4320, NOUN 4132, CCONJ 2670, ADV 2369, AUX 2277, PROPN 1830, SCONJ 1778, DET 1598, PUNCT 1100, ADJ 741, PART 282, NUM 104, INTJ 25, X 3)

The 10 most frequent ambiguous types: hundrað (NUM 8, NOUN 1)

Morphology

The form / lemma ratio of NUM is 24.000000 (the average of all parts of speech is 168.681818).

The 1st highest number of forms (24) was observed with the lemma “_”: ellivu, fimm, fimmti, fimtan, fjöruti, fýra, hundrað, seks, sjey, sjúti, tjúgu, trimum, tríati, tríggir, tríggjar, trý, tveggja, tveimum, tveir, tvey, tvær, tólv, túsund, átta.

NUM occurs with 1 features: Case (104; 100% instances)

NUM occurs with 4 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Nom

NUM occurs with 4 feature combinations. The most frequent feature combination is Case=Acc (60 tokens). Examples: tveir, tríggjar, fjöruti, hundrað, fýra, tólv, fimm, seks, tríati, tvey

Relations

NUM nodes are attached to their parents using 4 different relations: nummod (89; 86% instances), obl (10; 10% instances), conj (3; 3% instances), obj (2; 2% instances)

Parents of NUM nodes belong to 6 different parts of speech: NOUN (70; 67% instances), NUM (17; 16% instances), VERB (8; 8% instances), DET (5; 5% instances), PRON (3; 3% instances), PROPN (1; 1% instances)

71 (68%) NUM nodes are leaves.

14 (13%) NUM nodes have one child.

11 (11%) NUM nodes have two children.

8 (8%) NUM nodes have three or more children.

The highest child degree of a NUM node is 4.

Children of NUM nodes are attached using 9 different relations: nummod (16; 25% instances), cc (11; 17% instances), advmod (8; 13% instances), punct (7; 11% instances), case (5; 8% instances), conj (5; 8% instances), det (4; 6% instances), obl (4; 6% instances), nmod (3; 5% instances)

Children of NUM nodes belong to 9 different parts of speech: NUM (17; 27% instances), CCONJ (11; 17% instances), ADV (8; 13% instances), NOUN (7; 11% instances), PUNCT (7; 11% instances), ADP (5; 8% instances), PRON (5; 8% instances), DET (2; 3% instances), PROPN (1; 2% instances)