skip to content
User Tools
Log In
Site Tools
Search
Tools
Show pagesource
Old revisions
Export to PDF
Fold/unfold all
Backlinks
Recent Changes
Media Manager
Sitemap
Log In
>
Recent Changes
Media Manager
Sitemap
You are here:
start
»
03_processing
Sidebar
FACTS AND FIGURES
SMS in the corpus
Mother Tongues
Languages in the Corpus
Participants
The Corpus
THE QUESTIONNAIRE
DATA COLLECTION
DATA PROCESSING
Tokenizing
Language tagging
Part of speech tagging
Cleaning the data up
Anonymization
Normalization
BROWSING THE CORPUS
Search options
Sub-corpora
Additional functions
Frequency Analysis
Export
Queries
Combined queries
Regular Expressions
Simple queries
Meta data
Layers of information
03_processing
DATA PROCESSING
The data collected was processed in the following steps:
General cleaning up
Anonymization
Language tagging
Tokenizing
Normalization
PoS tagging
03_processing.txt
· Last modified: 2022/06/27 09:21 (external edit)
Page Tools
Show pagesource
Old revisions
Backlinks
Export to PDF
Fold/unfold all
Back to top