Treebank Statistics: UD_Icelandic-Modern: POS Tags: PROPN
There are 883 PROPN
lemmas (14%), 1135 PROPN
types (11%) and 2743 PROPN
tokens (3%).
Out of 16 observed tags, the rank of PROPN
is: 2 in number of lemmas, 4 in number of types and 12 in number of tokens.
The 10 most frequent PROPN
lemmas: ísland, þingmaður, ólympíuleikar, hrafnhildur, alþingi, rúv, ríó, em, íslendingur, evrópusamband
The 10 most frequent PROPN
types: þm., Íslands, RÚV, Hrafnhildur, Ríó, EM, Ísland, Ólympíuleikunum, Alþingi, Íslandi
The 10 most frequent ambiguous lemmas: þingmaður (NOUN 299, PROPN 82, ADV 1), em (PROPN 32, X 1), evrópumót (PROPN 21, NOUN 1), london (PROPN 20, ADV 1), hm (PROPN 17, ADV 1), forseti (NOUN 409, PROPN 11), fram (ADP 223, PROPN 11, ADV 4), de (PROPN 10, ADV 1, X 1), helgi (PROPN 10, NOUN 7), stjarna (PROPN 9, NOUN 2)
The 10 most frequent ambiguous types: þm. (PROPN 80, NOUN 1), EM (PROPN 32, X 1), London (PROPN 20, ADV 1), HM (PROPN 17, ADV 1), Forseti (NOUN 132, PROPN 11), Fram (PROPN 11, ADP 1), de (PROPN 10, ADV 1, X 1), Millar (PROPN 6, ADV 1), Djokovic (PROPN 5, ADV 1), pírata (NOUN 2, PROPN 1)
- þm.
- PROPN 80: Virðulegi forseti . Ég þakka hv. þm. Pétri H. Blöndal fyrir svarið .
- NOUN 1: Herra forseti . Ég vil spyrja hv. 4. þm. Reykv. svo , Pétur H. Blöndal , út í eitt má segja lagatæknilegt eða þinglegt atriði sem varðar atriði þessa máls og það er sú staðreynd að inn í þetta frumvarp eiga nú að bætast samkvæmt breytingartillögum meiri hlutans þrír nýir kaflar eða það á að opna upp löggjöf á þremur nýjum sviðum hér við 2. umr. málsins .
- EM
- London
- HM
- Forseti
- Fram
- PROPN 11: Fram hefur 2 stig eftir fyrstu þrjár umferðirnar en Haukar hafa 4 stig .
- ADP 1: Fram kom í formála ráðherra að þarna væru í rauninni þrjár sviðsmyndir : Það er að hægja á verkefninu , fara strax í könnunarviðræður eða halda áfram að safna þessum upplýsingum , eins og ráðgjafarhópurinn gerir grein fyrir í skýrslunni , og sú varð niðurstaðan .
- de
- Millar
- Djokovic
- pírata
- NOUN 2: Ég hef sérstaklega nefnt 8. gr. og líka 11. gr. sem eru í stefnu okkar pírata og ég sé ekki hvers vegna ekki mátti hafa þær með .
- PROPN 1: Ég sé að hér er samstaða milli hv. þingmanns Samfylkingar , hv. þingmanns Sjálfstæðisflokks Forseti hringir . og vissulega okkar pírata og því vona ég að þetta starf muni bara halda áfram og styrkjast .
Morphology
The form / lemma ratio of PROPN
is 1.285391 (the average of all parts of speech is 1.734405).
The 1st highest number of forms (7) was observed with the lemma “ólympíuleikar”: Ólympíuleika, Ólympíuleikana, Ólympíuleikanna, Ólympíuleikar, Ólympíuleikarnir, Ólympíuleikum, Ólympíuleikunum.
The 2nd highest number of forms (6) was observed with the lemma “evrópumót”: Evrópumót, Evrópumóti, Evrópumótinu, Evrópumótið, Evrópumóts, Evrópumótsins.
The 3rd highest number of forms (6) was observed with the lemma “framsóknarflokkur”: Framsóknarflokki, Framsóknarflokkinn, Framsóknarflokknum, Framsóknarflokksins, Framsóknarflokkur, Framsóknarflokkurinn.
PROPN
occurs with 5 features: Case (2036; 74% instances), Definite (2036; 74% instances), Number (2036; 74% instances), Gender (1994; 73% instances), Foreign (88; 3% instances)
PROPN
occurs with 12 feature-value pairs: Case=Acc
, Case=Dat
, Case=Gen
, Case=Nom
, Definite=Def
, Definite=Ind
, Foreign=Yes
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Number=Plur
, Number=Sing
PROPN
occurs with 50 feature combinations.
The most frequent feature combination is _
(619 tokens).
Examples: þm., RÚV, EM, H., HM, London, Collins, KSÍ, United, KR
Relations
PROPN
nodes are attached to their parents using 18 different relations: obl (694; 25% instances), nsubj (516; 19% instances), flat:name (481; 18% instances), dep (325; 12% instances), nmod:poss (313; 11% instances), conj (149; 5% instances), appos (94; 3% instances), obj (75; 3% instances), root (29; 1% instances), iobj (27; 1% instances), xcomp (17; 1% instances), advcl (6; 0% instances), acl:relcl (5; 0% instances), amod (5; 0% instances), ccomp (3; 0% instances), parataxis (2; 0% instances), acl (1; 0% instances), discourse (1; 0% instances)
Parents of PROPN
nodes belong to 12 different parts of speech: PROPN (1030; 38% instances), VERB (865; 32% instances), NOUN (668; 24% instances), ADJ (56; 2% instances), PRON (40; 1% instances), (29; 1% instances), ADV (22; 1% instances), AUX (13; 0% instances), DET (8; 0% instances), ADP (6; 0% instances), NUM (3; 0% instances), PART (3; 0% instances)
1175 (43%) PROPN
nodes are leaves.
758 (28%) PROPN
nodes have one child.
472 (17%) PROPN
nodes have two children.
338 (12%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 18.
Children of PROPN
nodes are attached using 28 different relations: case (699; 23% instances), flat:name (481; 16% instances), punct (426; 14% instances), dep (322; 11% instances), obl (272; 9% instances), conj (152; 5% instances), cc (134; 4% instances), amod (133; 4% instances), nmod:poss (68; 2% instances), acl:relcl (65; 2% instances), appos (48; 2% instances), cop (46; 2% instances), advmod (44; 1% instances), nsubj (22; 1% instances), nummod (15; 1% instances), mark (13; 0% instances), compound:prt (12; 0% instances), det (11; 0% instances), acl (7; 0% instances), xcomp (7; 0% instances), obj (4; 0% instances), nmod (3; 0% instances), parataxis (3; 0% instances), aux (2; 0% instances), expl (2; 0% instances), advcl (1; 0% instances), csubj (1; 0% instances), discourse (1; 0% instances)
Children of PROPN
nodes belong to 16 different parts of speech: PROPN (1030; 34% instances), ADP (712; 24% instances), PUNCT (426; 14% instances), NOUN (217; 7% instances), ADJ (142; 5% instances), CCONJ (134; 4% instances), NUM (92; 3% instances), VERB (67; 2% instances), AUX (53; 2% instances), ADV (48; 2% instances), PRON (32; 1% instances), DET (24; 1% instances), SCONJ (12; 0% instances), X (3; 0% instances), INTJ (1; 0% instances), PART (1; 0% instances)