User Tools

Site Tools


start

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revisionBoth sides next revision
start [2022/01/04 15:43] – ↷ Links adapted because of a move operation Simone Ueberwasserstart [2022/01/05 12:34] – ↷ Links adapted because of a move operation Simone Ueberwasser
Line 4: Line 4:
  
 ===== The corpus ===== ===== The corpus =====
-The Swiss SMS corpus consists of 25'947 SMS (~650'000 tokens), which were sent in by the Swiss public in 2009/2010. Of all SMS, 41% are in Swiss German (dialect), 28% in non-dialectal German, 18% in French, 6% in Italian, and 4% in Romansh. More information about the corpus can be found in the section [[05_facts_and_figures:00_facts_and_figures|facts and figures]].+The Swiss SMS corpus consists of 25'947 SMS (~650'000 tokens), which were sent in by the Swiss public in 2009/2010. Of all SMS, 41% are in Swiss German (dialect), 28% in non-dialectal German, 18% in French, 6% in Italian, and 4% in Romansh. More information about the corpus can be found in the section [[05_facts_and_figures|facts and figures]].
  
 ===== Using the corpus ===== ===== Using the corpus =====
start.txt · Last modified: 2022/09/12 19:18 by Stefan Bircher

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki