Treebank Statistics: UD_Dutch-LassySmall: POS Tags: NUM
There are 583 NUM
lemmas (4%), 596 NUM
types (4%) and 3400 NUM
tokens (3%).
Out of 16 observed tags, the rank of NUM
is: 6 in number of lemmas, 6 in number of types and 9 in number of tokens.
The 10 most frequent NUM
lemmas: één, twee, 2004, 2006, 2005, 1, 2003, drie, 2, 2002
The 10 most frequent NUM
types: twee, 2004, 2006, 2005, 1, 2003, één, drie, 2, een
The 10 most frequent ambiguous lemmas: één (ADJ 225, NUM 128, PROPN 6), twee (NUM 101, ADJ 45), 1 (NUM 84, ADJ 5, PROPN 1), drie (NUM 62, ADJ 21), 2 (NUM 55, ADJ 6, PROPN 1, SYM 1), 3 (NUM 37, ADJ 8), 2000 (NUM 36, PROPN 1), 5 (NUM 36, ADJ 1), 4 (NUM 35, ADJ 2), vijf (NUM 28, ADJ 4)
The 10 most frequent ambiguous types: 1 (NUM 84, PROPN 1), één (NUM 72, PROPN 6), 2 (NUM 55, PROPN 1, SYM 1), een (DET 1598, NUM 45), 2000 (NUM 36, PROPN 1), 10 (NUM 27, PROPN 1), 20 (NUM 25, SYM 1), 7 (NUM 24, SYM 1), vier (NUM 21, VERB 1), 8 (NUM 23, SYM 1)
- 1
- één
- 2
- NUM 55: 2 bestuurlijke arrondissementen : Halle-Vilvoorde , Leuven
- PROPN 1: In 2006 is Urbanus ook te zien op de Vlaamse zender één en het Nederlandse Nederland 2 met een real-life-soap genaamd Urbain .
- SYM 1: Door een actieve ledenzorg stijgt dit stelselmatig bij elke verkiezing tot een niveau van 2 à 3.000 omstreeks 1989 ( in 2005 is dat ongeveer 6.500 ) .
- een
- 2000
- 10
- 20
- 7
- vier
- 8
Morphology
The form / lemma ratio of NUM
is 1.022298 (the average of all parts of speech is 1.168496).
The 1st highest number of forms (4) was observed with the lemma “één”: Eén, een, eentje, één.
The 2nd highest number of forms (2) was observed with the lemma “150”: 125-150, 150.
The 3rd highest number of forms (2) was observed with the lemma “1975”: 1955-1975, 1975.
NUM
does not occur with any features.
Relations
NUM
nodes are attached to their parents using 20 different relations: nummod (1062; 31% instances), obl (650; 19% instances), root (536; 16% instances), nmod (320; 9% instances), flat (250; 7% instances), parataxis (175; 5% instances), appos (131; 4% instances), conj (118; 3% instances), acl (62; 2% instances), fixed (30; 1% instances), nsubj (22; 1% instances), advcl (11; 0% instances), orphan (7; 0% instances), obj (6; 0% instances), acl:relcl (5; 0% instances), det (5; 0% instances), amod (4; 0% instances), xcomp (3; 0% instances), nsubj:pass (2; 0% instances), ccomp (1; 0% instances)
Parents of NUM
nodes belong to 13 different parts of speech: NOUN (1191; 35% instances), VERB (672; 20% instances), (536; 16% instances), PROPN (391; 12% instances), NUM (347; 10% instances), SYM (100; 3% instances), DET (67; 2% instances), ADJ (56; 2% instances), X (17; 1% instances), ADP (9; 0% instances), PRON (7; 0% instances), ADV (6; 0% instances), INTJ (1; 0% instances)
1212 (36%) NUM
nodes are leaves.
1241 (37%) NUM
nodes have one child.
400 (12%) NUM
nodes have two children.
547 (16%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 8.
Children of NUM
nodes are attached using 23 different relations: punct (1273; 33% instances), case (953; 24% instances), parataxis (572; 15% instances), flat (512; 13% instances), nmod (159; 4% instances), conj (102; 3% instances), cc (63; 2% instances), amod (55; 1% instances), cop (49; 1% instances), nsubj (44; 1% instances), fixed (32; 1% instances), mark (14; 0% instances), advmod (13; 0% instances), appos (11; 0% instances), det (10; 0% instances), obl (10; 0% instances), acl:relcl (9; 0% instances), acl (3; 0% instances), advcl (2; 0% instances), cc:preconj (2; 0% instances), orphan (2; 0% instances), nmod:poss (1; 0% instances), nummod (1; 0% instances)
Children of NUM
nodes belong to 15 different parts of speech: PUNCT (1273; 33% instances), ADP (957; 25% instances), PROPN (742; 19% instances), NUM (347; 9% instances), NOUN (264; 7% instances), CCONJ (71; 2% instances), AUX (49; 1% instances), ADV (39; 1% instances), PRON (39; 1% instances), ADJ (26; 1% instances), DET (22; 1% instances), VERB (22; 1% instances), X (18; 0% instances), SCONJ (12; 0% instances), SYM (11; 0% instances)