home edit page issue tracker

This page pertains to UD version 2.

UD Greek GDT

Language: Greek (code: el)
Family: Indo-European, Greek

This treebank has been part of Universal Dependencies since the UD v1.1 release.

The following people have contributed to making this treebank part of UD: Prokopis Prokopidis.

Repository: UD_Greek-GDT
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.2

License: CC BY-NC-SA 3.0

Genre: news, wiki, spoken

Questions, comments? General annotation questions (either Greek-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [prokopis (æt) ilsp • gr]. Development of the treebank happens outside the UD repository. If there are bugs, either the original data source or the conversion procedure must be fixed. Do not submit pull requests against the UD repository.

Annotation Source
Lemmas annotated manually, natively in UD style
UPOS annotated manually in non-UD style, automatically converted to UD
XPOS annotated manually in non-UD style, automatically converted to UD
Features annotated manually in non-UD style, automatically converted to UD
Relations annotated manually in non-UD style, automatically converted to UD, with some manual corrections of the conversion

Description

The Greek UD treebank (UD_Greek-GDT) is derived from the Greek Dependency Treebank (http://gdt.ilsp.gr), a resource developed and maintained by researchers at the Institute for Language and Speech Processing/Athena R.C. (http://www.ilsp.gr).

The Greek UD treebank consists of 2,521 sentences (61,673 tokens). The data in the current release derive from primary texts that are in the public domain, including wikinews articles and european parliament sessions. The treebank is licensed under the terms of Creative Commons Attribution-NonCommercial-ShareAlike, CC BY-NC-SA 3.0.

The morphological and syntactic annotation of the Greek UD treebank was originally created through a semi-automatic conversion of PDT-style annotations in GDT data. The syntactic annotation of the 2.1 release was generated by manual corrections of several constructions of the UD annotation, which is now the only manual syntactic annotation used for new data added to the resource. The harmonization with UD v2 is work in progress.

Acknowledgments

We wish to thank all contributors to the original annotation efforts. A large part of those annotations was work by students of the postgraduate programme Technoglossia IV, organised by the Institute for Language and Speech Processing, the University of Athens and the National Technical University of Athens.

Statistics of UD Greek GDT

POS Tags

ADJADPADVAUXCCONJDETNOUNNUMPARTPRONPROPNPUNCTSCONJSYMVERBX

Features

AbbrAspectCaseDefiniteDegreeForeignGenderMoodNumberNumTypePersonPossPronTypeTenseVerbFormVoice

Relations

aclacl:relcladvcladvmodamodapposauxcaseccccompcompoundconjcopcsubjcsubj:passdetdiscourseexplfixedflatiobjmarknmodnsubjnsubj:passnummodobjoblobl:agentorphanparataxispunctrootvocativexcomp

Tokenization and Word Segmentation

Morphology

Tags

Nominal Features

Degree and Polarity

Verbal Features

Pronouns, Determiners, Quantifiers

Other Features

Syntax

Auxiliary Verbs and Copula

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).

Relations Overview