02_browsing:01_sub_corpora
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revision | Last revisionBoth sides next revision | ||
02_browsing:01_sub_corpora [2022/01/05 17:30] – Simone Ueberwasser | 02_browsing:01_sub_corpora [2022/01/26 09:37] – [Sub-corpora] Simone Ueberwasser | ||
---|---|---|---|
Line 1: | Line 1: | ||
====== Sub-corpora ====== | ====== Sub-corpora ====== | ||
- | The following sub-corpora are available: | + | The corpus all-tagged contains all SMS in all languages. Data for all languages except Romansh are tagged with TreeTagger. |
+ | |||
+ | Next to that, the following sub-corpora | ||
* deu-rftagged: | * deu-rftagged: | ||
* deu-tagged: non-dialectal German data tagged with TreeTagger | * deu-tagged: non-dialectal German data tagged with TreeTagger |
02_browsing/01_sub_corpora.txt · Last modified: 2022/06/27 09:21 by 127.0.0.1