home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-SiMoNERo: POS Tags: ADP

There are 41 ADP lemmas (0%), 44 ADP types (0%) and 20078 ADP tokens (14%). Out of 16 observed tags, the rank of ADP is: 8 in number of lemmas, 10 in number of types and 2 in number of tokens.

The 10 most frequent ADP lemmas: de, în, la, cu, din, pentru, prin, pe, dintre, după

The 10 most frequent ADP types: de, în, la, cu, din, pentru, prin, pe, dintre, după

The 10 most frequent ambiguous lemmas: de (ADP 6722, X 2), în (ADP 3627, NOUN 2), pentru (ADP 797, VERB 1), fără (ADP 135, SCONJ 3), sub (ADP 126, X 2, ADV 1), peste (ADP 125, ADV 32), până (ADP 92, SCONJ 28), versus (ADP 36, ADV 3, PROPN 2, X 1), drept (ADJ 23, ADP 20, NOUN 2, ADV 1), a (PART 302, DET 13, ADP 12, NOUN 8, X 4)

The 10 most frequent ambiguous types: de (ADP 6605, ADV 1, NOUN 1, X 1), în (ADP 3145, NOUN 1), pentru (ADP 747, VERB 1), fără (ADP 135, SCONJ 3), sub (ADP 124, X 2, ADV 1), peste (ADP 123, ADV 32), până (ADP 86, SCONJ 27), versus (ADP 36, ADV 3, PROPN 2, X 1), drept (ADP 19, ADJ 12, ADV 1), a (DET 1826, AUX 793, PART 302, ADP 12, NOUN 8, X 5)

Morphology

The form / lemma ratio of ADP is 1.073171 (the average of all parts of speech is 1.666637).

The 1st highest number of forms (2) was observed with the lemma “de”: de, de-.

The 2nd highest number of forms (2) was observed with the lemma “după”: Dupa, după.

The 3rd highest number of forms (2) was observed with the lemma “întru”: într, într-.

ADP occurs with 4 features: AdpType (20075; 100% instances), Case (20075; 100% instances), Variant (157; 1% instances), Abbr (3; 0% instances)

ADP occurs with 6 feature-value pairs: Abbr=Yes, AdpType=Prep, Case=Acc, Case=Dat, Case=Gen, Variant=Short

ADP occurs with 5 feature combinations. The most frequent feature combination is AdpType=Prep|Case=Acc (19617 tokens). Examples: de, în, la, cu, din, pentru, prin, pe, dintre, după

Relations

ADP nodes are attached to their parents using 18 different relations: case (18099; 90% instances), fixed (824; 4% instances), advmod (669; 3% instances), mark (408; 2% instances), amod (34; 0% instances), conj (16; 0% instances), obl (8; 0% instances), nmod (7; 0% instances), appos (3; 0% instances), xcomp (2; 0% instances), acl (1; 0% instances), advcl (1; 0% instances), compound (1; 0% instances), dep (1; 0% instances), det (1; 0% instances), flat (1; 0% instances), obj (1; 0% instances), root (1; 0% instances)

Parents of ADP nodes belong to 13 different parts of speech: NOUN (16537; 82% instances), NUM (868; 4% instances), VERB (862; 4% instances), ADP (550; 3% instances), PRON (510; 3% instances), ADV (294; 1% instances), PROPN (152; 1% instances), ADJ (144; 1% instances), X (118; 1% instances), SCONJ (27; 0% instances), DET (13; 0% instances), CCONJ (2; 0% instances), (1; 0% instances)

18385 (92%) ADP nodes are leaves.

1040 (5%) ADP nodes have one child.

462 (2%) ADP nodes have two children.

191 (1%) ADP nodes have three or more children.

The highest child degree of a ADP node is 8.

Children of ADP nodes are attached using 15 different relations: fixed (2178; 84% instances), punct (332; 13% instances), conj (21; 1% instances), cc (20; 1% instances), advmod (18; 1% instances), nummod (16; 1% instances), amod (3; 0% instances), cop (3; 0% instances), mark (3; 0% instances), nsubj (3; 0% instances), nmod (2; 0% instances), obj (2; 0% instances), appos (1; 0% instances), iobj (1; 0% instances), obl (1; 0% instances)

Children of ADP nodes belong to 13 different parts of speech: NOUN (919; 35% instances), ADP (550; 21% instances), PUNCT (332; 13% instances), PRON (299; 11% instances), ADV (227; 9% instances), ADJ (84; 3% instances), VERB (70; 3% instances), DET (39; 1% instances), NUM (32; 1% instances), CCONJ (29; 1% instances), SCONJ (14; 1% instances), PART (6; 0% instances), AUX (3; 0% instances)