Resources for Dutch
From ACL Wiki
Corpora
- Araneum Nederlandicum, Gigaword Dutch web corpus
- Dutch Plain text and Co-occurrences at LCC
- Europarl corpus - sentence-aligned with English
- CLiPS Stylometry Investigation (CSI) corpus - multi-purpose text corpus, main use in stylometry
- HamleDT, harmonized dependency treebanks of many languages, common annotation style.
Tools
- Dutch HPSG-based parser Includes the Alpino treebank (7137 sentences, newspaper, manually corrected)