In:Corpus Dialectology:
Edited by Elissa Pustka, Carmen Quijada Van den Berghe and Verena Weiland
[Studies in Corpus Linguistics 110] 2023
► pp. 10–33
On the validity of crowdsourced data
Published online: 14 August 2023
https://doi.org/10.1075/scl.110.01kru
https://doi.org/10.1075/scl.110.01kru
This chapter demonstrates the validity of
crowdsourced data by comparing the crowdsourced data from the VinKo
project with traditionally collected data from the AThEME project.
Both datasets target non-standard language varieties of the South
Tyrol, Trentino, and Veneto regions in north-eastern Italy. Three
different morphosyntactic phenomena are discussed, each relating to
a particular language variety, providing evidence that the
crowdsourced data is of comparable quality to the traditionally
gathered data and has the added advantage of yielding a larger
overall dataset covering a denser location network.
Article outline
- 1.Introduction
- 2.The AThEME and VinKo projects
- 3.VinKo platform design and data collection methods
- 3.1Data collection
- 3.2Representation
- 3.3Technical aspects
- 4.VinKo and AThEME data in comparison: three case studies
- 4.1Tyrolean dialects: pronominal case patterns
- 4.2Trentino dialects: agreement with a postverbal subject
- 4.3Venetan dialects: obligatory and optional subject proclitics
- 5.Conclusions
Abbreviations Notes References
References (29)
ASIS/ASIt = Atlante
sintattico d’Italia (formerly: Atlante Sintattico
dell’Italia Settentrionale). ❬[URL]❭ (1 July
2022).
Avanzi, Mathieu & Thibault, André. 2021. Cartographier
l’amuïssement et la restitution des consonnes finales en
français grâce à la production
participative. In Nouveaux
regards sur la variation dialectale/New Ways of Analyzing
Dialectal Variation, André Thibault, Mathieu Avanzi, Nicholas Lo Vecchio & Alice Millour (eds), 205–274. Strasbourg: ELiPhi.
Bauer, Roland. 2012. Zur
inneren Arealgliederung des Trentino. Eine dialektometrische
Nachschau. In Das
diskrete Tatenbuch. Digitale Festschrift für Dieter
Kattenbusch zum 60.
Geburtstag, Carola Köhler & Fabio Tosques (eds), 7–17. Berlin: Humboldt-Universität. ❬[URL]❭ (1
July 2022).
Benincà, Paola. 1994. Il
clitico a nel dialetto
padovano. In La
variazione sintattica: Studi di dialettologia
romanza, Paola Benincà (ed.), 15–27. Bologna: Il Mulino.
Bondardo, Marcello. 1972. Il
dialetto Veronese: Lineamenti di grammatica storica e
descrittiva. Verona: Edizioni di “Vita veronese”.
Bonfadini, Giovanni. 1983. Il
confine linguistico
veneto-lombardo. In Guida
ai dialetti
veneti, Vol. 5, Manlio Cortelazzo (ed.), 23–59. Padua: CLEUP.
Brandi, Luciana & Cordin, Patrizia. 1989. Two
Italian dialects and the null subject
parameter. In The
Null Subject Parameter, Osvaldo Jaeggli & Ken Safir (eds), 111–142. Dordrecht: Kluwer.
Bry, François, Kneissl, Fabian, Krefeld, Thomas, Lücke, Stephan & Wieser, Christoph. 2013. Crowdsourcing
for a geographical and social mapping of Italian
dialects. In 2nd
International Workshop on Social Media for Crowdsourcing and
Human Computation (SoHuman),
Paris, 2013, 11–20.
Casalicchio, Jan & Cordin, Patrizia. 2020. Grammar
of Central Trentino. A Romance Dialect from North-East
Italy. Leiden: Brill.
Cordin, Patrizia, Rabanus, Stefan, Alber, Birgit, Mattei, Antonio, Casalicchio, Jan, Tomaselli, Alessandra, Bidese, Ermenegildo & Padovan, Andrea. 2018. VinKo. In Lo
spazio comunicativo dell’Italia e delle varietà italiane.
Korpus im Text, Thomas Krefeld & Roland Bauer (eds). Munich: Ludwig-Maximilians-Universität. ❬[URL]❭ (1
July 2022).
Fischer, Hanna & Limper, Juliane. 2019. Regionalsprachliche
Forschungsergebnisse
online. In Sprache
und Raum. Ein internationales Handbuch der
Sprachvariation, Vol. 4: Deutsch, Joachim Herrgen & Jürgen Erich Schmidt (eds), 879–897. Berlin: De Gruyter Mouton.
Frascarelli, Mara. 2000. The
Syntax-Phonology Interface in Focus and Topic Constructions
in
Italian. Dordrecht: Kluwer.
Krefeld, Thomas & Lücke, Stephan (eds). 2014 –
present. VerbaAlpina: Der alpine
Kulturraum im Spiegel seiner
Mehrsprachigkeit. Munich: Ludwig-Maximilians-Universität.
Lanthaler, Franz. 1997. Varietäten
des Deutschen in
Südtirol. In Varietäten
des Deutschen – Regional- und
Umgangssprachen, Gerhard Stickel (ed.), 364–382. Berlin: De Gruyter.
Möller, Robert & Elspaß, Stephan. 2015. Atlas
zur deutschen Alltagssprache
(AdA). In Regionale
Variation des Deutschen. Projekte und
Perspektiven, Roland Kehrein, Alfred Lameli & Stefan Rabanus (eds), 129–156. Berlin: De Gruyter.
Prieth, Magdalena. 2020. Der
Dialekt von Graun: Analysen auf der Grundlage der
Wenkersätze. BA dissertation, University of Verona.
Rabanus, Stefan. 2018. Varietà
Alloglotte –
Tedesco. In Lo
spazio comunicativo dell’Italia e delle varietà italiane.
Korpus im Text, Thomas Krefeld & Roland Bauer (eds). Munich: Ludwig-Maximilians-Universität. ❬[URL]❭ (1
July 2022).
. 2020. Morphosyntax
des Possessivums im Zimbrischen. Evidenz aus den
Wenker-Materialien. In Minderheitensprachen
und Sprachminderheiten. Deutsch und seine Kontaktsprachen in
der Dokumentation der
Wenker-Materialien, Jürg Fleischer, Alfred Lameli, Christiane Schiller & Luca Szucsich (eds), 209–243. Hildesheim: Olms.
Rabanus, Stefan, Bidese, Ermenegildo & Dal Negro, Silvia. 2019. Deutsch
als Minderheitensprache in
Italien. In Sprache
und Raum. Ein internationales Handbuch der Sprachvariation,
Vol. 4: Deutsch, Joachim Herrgen & Jürgen Erich Schmidt (eds), 1096–1114. Berlin: De Gruyter Mouton.
Rabanus, Stefan, Kruijt, Anne, Tagliani, Marta, Tomaselli, Alessandra, Padovan, Andrea, Alber, Birgit, Cordin, Patrizia, Zamparelli, Roberto & Vogt, Barbara Maria. 2022. VinKo
(Varieties in Contact) Corpus
v1.1. Eurac Research CLARIN
Centre. ❬[URL]❭ (1
July 2022).
Scheutz, Hannes. 2016. Insre
Sproch. Deutsche Dialekte in Südtirol mit dem ersten
‘sprechenden Dialektatlas’ auf
CD-ROM. Bolzano: Athesia.
Seiler, Guido. 2010. Investigating
language in space: Questionnaire and
interview. In Language
and Space. An International Handbook of Linguistic
Variation, Vol. 1: Theories and
Methods, Peter Auer & Jürgen Erich Schmidt (eds), 512–527. Berlin: De Gruyter.
Tomaselli, Alessandra, Kruijt, Anne, Alber, Birgit, Bidese, Ermenegildo, Casalicchio, Jan, Cordin, Patrizia, Kokkelmans, Joachim, Padovan, Andrea, Rabanus, Stefan & Zuin, Francesco. 2022. AThEME
Verona-Trento Corpus. Eurac Research CLARIN Centre. ❬[URL]❭ (12
December 2022).
TSA =
Klein, Karl Kurt, Schmitt, Ludwig Erich & Kühebacher, Egon. 1965-1971. Tirolischer
Sprachatlas. Vol. 1–3. Marburg: Elwert; Innsbruck: Tyrolia.
Wiesinger, Peter. 1962–1969. Ergänzungskarten
zum Deutschen Sprachatlas. Nacherhebungen in Süd- und
Osteuropa. Deutscher Sprachatlas Marburg. ❬[URL]❭ (15
January 2023).
Cited by (2)
Cited by two other publications
Tomaselli, Alessandra & Ermenegildo Bidese
2024. On the threefold typology of Scheinsubjekte
. Evolutionary Linguistic Theory 6:1-2 ► pp. 158 ff.
This list is based on CrossRef data as of 1 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
