02_browsing:01_sub_corpora
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
02_browsing:01_sub_corpora [2022/01/05 17:30] – Simone Ueberwasser | 02_browsing:01_sub_corpora [2022/06/27 09:21] (current) – external edit 127.0.0.1 | ||
---|---|---|---|
Line 1: | Line 1: | ||
====== Sub-corpora ====== | ====== Sub-corpora ====== | ||
- | The following sub-corpora are available: | + | The corpus all-tagged contains all SMS in all languages. Data for all languages except Romansh are tagged with TreeTagger. |
+ | |||
+ | Next to that, the following sub-corpora | ||
* deu-rftagged: | * deu-rftagged: | ||
* deu-tagged: non-dialectal German data tagged with TreeTagger | * deu-tagged: non-dialectal German data tagged with TreeTagger |
02_browsing/01_sub_corpora.1641400249.txt.gz · Last modified: 2022/06/27 09:21 (external edit)