User Tools

Site Tools


01_collection

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
01_corpus [2022/01/04 07:09] – ↷ Page moved and renamed from 01_corpus:start to 01_corpus Simone Ueberwasser01_collection [2022/06/27 09:21] (current) – external edit 127.0.0.1
Line 1: Line 1:
-====== The collection ====== +====== DATA COLLECTION ======
-===== Data collection =====+
  
-The data were collected in two stages. A first collection took place Nov 2009 - Feb 2010. Because this collection did not produce enough SMS for research in Italian and Romansh, a second collection took place between May and July 2011 and produced SMS mainly in those two languages. The two collections are now fully integrated and appear as one single corpus. In your everyday work with the corpus, you will not know whether a specific SMS was collected in the first or in the second round. A small difference can still be seen in the [[02_questionnaire:|questionnaire]], where additional questions were asked in the second collection.+The data were collected in two stages. A first collection took place Nov 2009 - Feb 2010. Because this collection did not produce enough SMS for research in Italian and Romansh, a second collection took place between May and July 2011 and produced SMS mainly in those two languages. The two collections are now fully integrated and appear as one single corpus. In your everyday work with the corpus, you will not know whether a specific SMS was collected in the first or in the second round. A small difference can still be seen in the [[02_questionnaire:|questionnaire]], where additional questions were asked in the second collection. 
 ===== The informants ===== ===== The informants =====
  
Line 12: Line 11:
 ===== Privacy ===== ===== Privacy =====
  
-No member of the team ever saw a phone number of the informants. People and their SMS can therefore not be traced back. Furthermore, the first step after the data collection was to [[Anonymisation|remove]] any type of personal information from the corpus. These steps were performed by means of computational linguistics. They show a reliability of more than 90% so data can be assumed to comply with Swiss and international regulations about data privacy.+No member of the team ever saw a phone number of the informants. People and their SMS can therefore not be traced back. Furthermore, the first step after the data collection was to [[01_corpus:anonymisation|remove]] any type of personal information from the corpus. These steps were performed by means of computational linguistics. They show a reliability of more than 90% so data can be assumed to comply with Swiss and international regulations about data privacy.
 If you still recognize authors of specific SMS based on the topics that they write about, you are asked to comply with common [[https://en.wikipedia.org/wiki/Research#Research_ethics|research ethics]] and keep that knowledge to yourself. If you still recognize authors of specific SMS based on the topics that they write about, you are asked to comply with common [[https://en.wikipedia.org/wiki/Research#Research_ethics|research ethics]] and keep that knowledge to yourself.
01_collection.1641276586.txt.gz · Last modified: 2022/06/27 09:21 (external edit)

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki