02_browsing:01_sub_corpora
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| 02_browsing:01_sub_corpora [2022/01/05 16:30] – simone.ueberwasser.ds.uzh.ch | 02_browsing:01_sub_corpora [2022/06/27 07:21] (current) – external edit 127.0.0.1 | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| ====== Sub-corpora ====== | ====== Sub-corpora ====== | ||
| - | The following sub-corpora are available: | + | The corpus all-tagged contains all SMS in all languages. Data for all languages except Romansh are tagged with TreeTagger. |
| + | |||
| + | Next to that, the following sub-corpora | ||
| * deu-rftagged: | * deu-rftagged: | ||
| * deu-tagged: non-dialectal German data tagged with TreeTagger | * deu-tagged: non-dialectal German data tagged with TreeTagger | ||
02_browsing/01_sub_corpora.1641400249.txt.gz · Last modified: (external edit)
