Treebank Statistics: UD_Estonian-EDT: POS Tags: CCONJ
There are 18 CCONJ
lemmas (0%), 21 CCONJ
types (0%) and 16087 CCONJ
tokens (4%).
Out of 16 observed tags, the rank of CCONJ
is: 14 in number of lemmas, 15 in number of types and 9 in number of tokens.
The 10 most frequent CCONJ
lemmas: ja, ning, või, aga, kuid, kui, ega, vaid, ehk, ent
The 10 most frequent CCONJ
types: ja, ning, või, aga, kuid, kui, ega, vaid, ehk, ent
The 10 most frequent ambiguous lemmas: või (CCONJ 1250, ADV 63, NOUN 3), aga (CCONJ 1015, ADV 708), kui (SCONJ 2652, CCONJ 348, ADV 205), ega (CCONJ 275, ADV 83), vaid (ADV 409, CCONJ 226), ehk (CCONJ 204, ADV 70), kuni (ADP 84, ADV 72, SCONJ 69, CCONJ 55), & (SYM 22, CCONJ 8), e (NOUN 20, CCONJ 5), nii (ADV 1214, CCONJ 4)
The 10 most frequent ambiguous types: või (CCONJ 1181, ADV 62, AUX 11, VERB 2, NOUN 1), aga (ADV 708, CCONJ 656), kuid (CCONJ 609, NOUN 4), kui (SCONJ 1927, CCONJ 348, ADV 165), ega (CCONJ 270, ADV 46), vaid (ADV 388, CCONJ 223), ehk (CCONJ 198, ADV 61), kuni (ADP 75, ADV 71, SCONJ 59, CCONJ 55), & (SYM 15, CCONJ 8), e (CCONJ 4, X 1)
- või
- CCONJ 1181: Uskuge või mitte , maalide autoriks on päris noor mees .
- ADV 62: “ Oli sihuke plevna , et oi-oi-joo , pane kas või metsa . ”
- AUX 11: Aga vanameistreid ei või kunagi teada .
- VERB 2: Mis Eesti olusid arvestades võivad või ei või sattuda tööandja ja/või kindlustusfirmade kätte .
- NOUN 1: Apteegid elavad nagu või sees .
- aga
- kuid
- kui
- ega
- vaid
- ehk
- kuni
- &
- e
Morphology
The form / lemma ratio of CCONJ
is 1.166667 (the average of all parts of speech is 1.914127).
The 1st highest number of forms (2) was observed with the lemma “aga”: A, aga.
The 2nd highest number of forms (2) was observed with the lemma “e”: e, e..
The 3rd highest number of forms (2) was observed with the lemma “ja”: -ja, ja.
CCONJ
occurs with 3 features: Polarity (274; 2% instances), Abbr (13; 0% instances), Foreign (1; 0% instances)
CCONJ
occurs with 3 feature-value pairs: Abbr=Yes
, Foreign=Yes
, Polarity=Neg
CCONJ
occurs with 4 feature combinations.
The most frequent feature combination is _
(15799 tokens).
Examples: ja, ning, või, aga, kuid, kui, vaid, ehk, ent, kuni
Relations
CCONJ
nodes are attached to their parents using 8 different relations: cc (16074; 100% instances), cc:preconj (4; 0% instances), fixed (2; 0% instances), mark (2; 0% instances), root (2; 0% instances), advmod (1; 0% instances), case (1; 0% instances), nsubj:cop (1; 0% instances)
Parents of CCONJ
nodes belong to 14 different parts of speech: NOUN (6557; 41% instances), VERB (5603; 35% instances), ADJ (1773; 11% instances), PROPN (1137; 7% instances), ADV (496; 3% instances), PRON (268; 2% instances), NUM (163; 1% instances), SYM (37; 0% instances), DET (21; 0% instances), X (11; 0% instances), ADP (9; 0% instances), INTJ (6; 0% instances), SCONJ (4; 0% instances), (2; 0% instances)
16083 (100%) CCONJ
nodes are leaves.
2 (0%) CCONJ
nodes have one child.
1 (0%) CCONJ
nodes have two children.
1 (0%) CCONJ
nodes have three or more children.
The highest child degree of a CCONJ
node is 5.
Children of CCONJ
nodes are attached using 4 different relations: punct (6; 67% instances), amod (1; 11% instances), appos (1; 11% instances), det (1; 11% instances)
Children of CCONJ
nodes belong to 4 different parts of speech: PUNCT (6; 67% instances), ADJ (1; 11% instances), DET (1; 11% instances), NOUN (1; 11% instances)