home edit page issue tracker

This page pertains to UD version 2.

UD Indonesian GSD

Language: Indonesian (code: id)
Family: Austronesian, Malayo-Sumbawan

This treebank has been part of Universal Dependencies since the UD v1.1 release.

The following people have contributed to making this treebank part of UD: Ryan McDonald, Joakim Nivre, Daniel Zeman, Septina Dian Larasati.

Repository: UD_Indonesian-GSD
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.2

License: CC BY-NC-SA 3.0 US

Genre: news, blog

Questions, comments? General annotation questions (either Indonesian-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [zeman (æt) ufal • mff • cuni • cz]. The UD version of this treebank currently does not have a maintainer. If you know the language and want to help, please consider adopting the treebank.

Annotation Source
Lemmas assigned by a program, not checked manually
UPOS annotated manually in non-UD style, automatically converted to UD
XPOS assigned by a program, not checked manually
Features assigned by a program, not checked manually
Relations annotated manually in non-UD style, automatically converted to UD


The Indonesian UD is converted from the content head version of the universal dependency treebank v2.0 (legacy).

Lemmas, XPOS and morphological features added by MorphInd (created by Septina Dian Larasati, run and converted by Dan Zeman, http://septinalarasati.com/morphind/).


Statistics of UD Indonesian GSD

POS Tags






Tokenization and Word Segmentation



Nominal Features

Degree and Polarity

Verbal Features

Pronouns, Determiners, Quantifiers

Other Features


Auxiliary Verbs and Copula

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).

Relations Overview