Abstract
Copyright and personal data protection are two of the most important legal aspects of collecting
data for a learner corpus. The paper explains the challenges in data collection for the learner
corpus of Latvian “LaVA” and describes the procedure undertaken to ensure protection of the
texts’ authors’ rights. An agreement / metadata questionnaire form was created to inform the
authors of the ways their texts are used and to receive the authors’ permission to use them in the
stated way. The information, permission, and the metadata questionnaire are printed on one side
of an A4 size paper sheet, and the author is supposed to write the text on the other side by hand,
thus eliminating the need to identify the author of the text separately. After scanning and adding
to the corpus, the text originals are returned to the authors.
data for a learner corpus. The paper explains the challenges in data collection for the learner
corpus of Latvian “LaVA” and describes the procedure undertaken to ensure protection of the
texts’ authors’ rights. An agreement / metadata questionnaire form was created to inform the
authors of the ways their texts are used and to receive the authors’ permission to use them in the
stated way. The information, permission, and the metadata questionnaire are printed on one side
of an A4 size paper sheet, and the author is supposed to write the text on the other side by hand,
thus eliminating the need to identify the author of the text separately. After scanning and adding
to the corpus, the text originals are returned to the authors.
Original language | English |
---|---|
Title of host publication | Selected Papers from the CLARIN Annual Conference 2019 |
Editors | Kiril Simov, Maria Eskevich |
Pages | 41-47 |
Number of pages | 7 |
Volume | 172 |
DOIs | |
Publication status | Published - 3 Jul 2020 |
Event | CLARIN Annual Conference 2019 - Leipzig, Germany Duration: 30 Sept 2019 → 2 Oct 2019 |
Publication series
Name | Linköping Electronic Conference Proceedings |
---|---|
ISSN (Print) | 1650-3740 |
Conference
Conference | CLARIN Annual Conference 2019 |
---|---|
Country/Territory | Germany |
City | Leipzig |
Period | 30/09/19 → 2/10/19 |
Keywords*
- copyright
- personal data protection
- learner corpus
- Latvian
Field of Science*
- 6.2 Languages and Literature
- 1.1 Mathematics
Publication Type*
- 3.2. Articles or chapters in other proceedings other than those included in 3.1., with an ISBN or ISSN code