Article published In: English World-Wide
Vol. 44:3 (2023) ► pp.403–434
Broadening horizons in the diachronic and sociolinguistic study of Philippine English with the Twitter Corpus of Philippine Englishes (TCOPE)
Published online: 16 May 2023
https://doi.org/10.1075/eww.22047.gon
https://doi.org/10.1075/eww.22047.gon
Abstract
This paper presents the Twitter Corpus of Philippine Englishes (TCOPE): a dataset of 27 million tweets amounting
to 135 million words collected from 29 cities across the Philippines. It provides an overview of the dataset, and then shows how
it can be employed to examine Philippine English (PhilE) and its relationship with extralinguistic factors (e.g. ethno-geographic
region, time, sex). The focus is on the patterns of variation involving four PhilE features: (1) irregular past tense morpheme
-t, (2) double comparatives, (3) subjunctive were, and (4) phrasal verb base
from. My analyses corroborate previous work and further demonstrate structured heterogeneity within PhilE, indicating
that it is a multifaceted and dynamic variety. TCOPE has shown itself to be useful for exploring both the “general” features of
contemporary PhilE and the different forms of variation within it. It contributes to a deeper understanding of Philippine
English(es) over time and in different social contexts.
Article outline
- 1.Introduction
- 2.Considerations in corpus compilation
- 3.Procedure
- 4.Composition and distribution
- 4.1Island group
- 4.2City
- 4.3Year
- 5.Format, sub-corpora, and accessibility
- 6.TCOPE in relation to other corpora
- 7.TCOPE in action: Exploratory sociolinguistic analyses
- 7.1Preliminaries
- 7.2Past tense morphology: Use of irregular -t
- 7.3Comparison of adjectives: The use of double comparatives
- 7.4Subordinate counterfactual clauses: Subjunctive were
- 7.5Phrasal verbs: The use of base from
- 8.Conclusion
- Notes
Sources References
References (50)
Gonzales, Wilkinson Daniel Wong. 2021. TweetCorp. 〈[URL]〉
Anthony, Laurence. 2022. AntConc. Version
4.0.5. 〈[URL]〉 (accessed May 21, 2022).
Araneta, Anica, Bruno Carrasco, Hanif Rahemtulla, Stella Balgos, and Stephanie Sy. 2021. “Mapping
Digital Poverty in PH”. Inquirer February 22, 2021. 〈[URL]〉 (accessed February 22, 2022).
Ardehaly, Ehsan Mohammady, and Aron Culotta. 2017. “Co-training
for Demographic Classification Using Deep Learning from Label Proportions”. Paper presented at
the 2017 IEEE International Conference on Data Mining Workshops (ICDMW), The Roosevelt New Orleans
Waldorf Astoria Hotel.
Bamman, David, Jacob Eisenstein, and Tyler Schnoebelen. 2014. “Gender
Identity and Lexical Variation in Social Media”. Journal of
Sociolinguistics 181: 135–160.
Bautista, Ma. Lourdes S. 2000. Defining Standard Philippine English:
Its Status and Grammatical Features. Manila: De La Salle University Press.
2004a. “An Overview of the Philippine
Component of the International Corpus of English (ICE-PHI)”. Asian
Englishes 71: 8–26.
2004b. “Researching English in the
Philippines: Bibliographical Resources”. World
Englishes 231: 199–210.
Bohmann, Axel. 2016. “Language
Change Because Twitter? Factors Motivating Innovative Uses of Because Across the English-Speaking
Twittersphere”. In Lauren Squires, ed. English
in Computer-Mediated Communication. Berlin: De Gruyter, 149–178.
Borlongan, Ariane Macalinga. 2011. “Some Aspects of the
Morphosyntax of Philippine English”. In Ma. Lourdes S. Bautista, ed. Studies
of Philippine English: Exploring the Philippine component of the International Corpus of
English. Manila: Anvil Publishing, 187–200.
Borlongan, Ariane Macalinga, and Shirley N. Dita. 2015. “Taking
a look at Expanded Predicates in Philippine English Across Time”. Asian
Englishes 171: 240–247.
Collins, Peter, Ariane Macalinga Borlongan, Joo-Hyuk Lim, and Xinyue Yao. 2014. “The
Subjunctive Mood in Philippine English: A Diachronic
Analysis”. In Simone E. Pfenninger, Olga Timofeeva, Anne-Christine Gardner, Alpo Honkapohja, Marianne Hundt, and Daniel Schreier, eds. Studies
in Language Companion Series. Amsterdam: John Benjamins, 259–280.
Collins, Peter, Ariane Macalinga Borlongan, and Xinyue Yao. 2014. “Modality
in Philippine English: A Diachronic Study”. Journal of English
Linguistics 421: 68–88.
Coupland, Nikolas. 2016. “Labov,
Vernacularity and Sociolinguistic Change”. Journal of
Sociolinguistics 201: 409–430.
Davies, Mark, and Robert Fuchs. 2015. “Expanding
Horizons in the Study of World Englishes with the 1.9 Billion Word Global Web-Based English Corpus
(GloWbE)”. English
World-Wide 361: 1–28.
Doyle, Gabriel. 2014. “Mapping
Dialectal Variation by Querying Social Media”. In Shuly Wintner, Sharon Goldwater, and Stefan Riezler, eds. Proceedings
of the 14th Conference of the European Chapter of the Association for Computational
Linguistics. Gothenburg: Association for Computational Linguistics, 98–106.
Eisenstein, Jacob, Brendan O’Connor, Noah Smith, and Eric Xing. 2010. “A
Latent Variable Model for Geographic Lexical Variation”. In Hang Li, and Lluís Màrquez, eds. Proceedings
of the 2010 Conference on Empirical Methods in Natural Language Processing. Cambridge, MA: Association for Computational Linguistics, 1277–1287.
Evans, Stephen. 2015. “Testing
the Dynamic Model: The Evolution of the Hong Kong English Lexicon (1858–2012)”. Journal of
English Linguistics 431: 175–200.
. 2022a. “Hybridization”. In Ariane Macalinga Borlongan, ed. Philippine
English: Development, Structure, and Sociology of English in the
Philippines. London: Routledge, 170–183.
. 2022b. “Interactions of Sinitic
Languages in the Philippines: Sinicization, Filipinization, and Sino-Philippine Language
Creation”. In Zhengdao Ye, ed. The
Palgrave Handbook of Chinese Language
Studies. Singapore: Springer Nature Singapore, 369–408.
Gonzales, Wilkinson Daniel Wong, and Mie Hiramoto. 2020. “Two
Englishes Diverged in the Philippines? A Substratist Account of Manila Chinese
English”. Journal of Pidgin and Creole
Languages 351: 125–159.
Gustilo, Leah, and Chenee M. Dino. 2017. “Old
Speak or Young Speak: An Analysis of Netspeak Features in Filipino Netspeak”. Advanced Science
Letters 231: 1099–1103.
Gustilo, Leah, Abdul Wahid Ibrahim Tocalo, and King Arman Calingasan. 2019. “The
Intelligibility and Acceptability of Internet Philippine English (IPE): Their Implications for English Language Teaching in
the New English Varieties”. Asian EFL
Journal 211: 83–104.
Honnibal, Matthew, Ines Montani, Sofie Van Landeghem, and Adriane Boyd. 2020. spaCy:
Industrial-strength Natural Language Processing in Python. Version 3.5.1. 〈 〉 (accessed May 1, 2022).
Imao, Yasu. 2022. CasualConc. Version
2.1.7. 〈[URL]〉 (accessed May 1, 2022).
Jenkins, Jennifer. 2015. Global
Englishes: A Resource Book for Students (3rd
ed.). London: Routledge.
Lee, Aldrin P., and Ariane Macalinga Borlongan. 2022. “Internal
Variation”. In Ariane Macalinga Borlongan, ed. Philippine
English: Development, Structure, and Sociology of English in the Philippines (1st
ed.). London: Routledge, 125–134.
Maclagan, Margaret, Elizabeth Gordon, and Gillian Lewis. 1999. “Women
and Sound Change: Conservative and Innovative Behavior by the Same Speakers”. Language
Variation and Change 111: 19–41.
Martin, Isabel Pefianco. 2014. “Beyond nativization?:
Philippine English in Schneider’s Dynamic Model”. In Sarah Buschfeld, Thomas Hoffman, Magnus Huber, and Alexander Kautzsch, eds. The
evolution of Englishes: The Dynamic Model and
Beyond. Amsterdam: John Benjamins, 70–85.
Pietikäinen, Sari, Alexandra Jaffe, Helen Kelly-Holmes, and Nikolas Coupland. 2016. Sociolinguistics
from the Periphery: Small Languages in New Circumstances (1st
ed.). Cambridge: Cambridge University Press.
R Core Team. 2015. R: A language and
environment for statistical computing. 〈[URL]〉 (accessed May
1, 2022).
Rocklin, Matthew. 2015. “Dask:
Parallel Computation with Blocked Algorithms and Task
Scheduling”. In Kathryn Huff, and James Bergstra, eds. Proceedings
of the 14th Python in Science
Conference. Austin: SciPy, 130–136.
Rudis, Bob. 2020. Package
“Pluralize”: Pluralize and “Singularize” Any (English) Word. R package version
0.2.0. 〈[URL]〉 (accessed May 1, 2022).
Samejon, Kevin. 2022. “Acoustic
and Social Dimensions of Word-Final /z/ Production: Data from Acrolectal Speakers of Philippine
English”. Paper presented at the 7th International Conference on
English Pronunciation: Issues & Practices (EPIP7), Université Grenoble-Alpes.
Sankoff, Gillian. 2006. “Age:
Apparent Time and Real Time”. In Keith Brown, ed. The
Encyclopedia of Language and Linguistics (2nd
ed.). Oxford: Elsevier, 110–116.
Schneider, Edgar. 2003. “The
Dynamics of New Englishes: From Identity Construction to Dialect
Birth”. Language 791: 233–281.
Tagliamonte, Sali A., and Derek Denis. 2008. “Linguistic
Ruin? LOL! Instant Messaging and Teen Language”. American
Speech 831: 3–34.
Tayao, Ma. Lourdes G. 2004. “The Evolving Study of
Philippine English Phonology”. World
Englishes 231: 77–90.
Thompson, Roger Mark. 2003. Filipino English and Taglish:
Language Switching from Multiple
Perspectives. Amsterdam: Benjamins.
Villanueva, Rey John Castro. 2016. “The Features of Philippine
English Across Regions”. Ph.D. dissertation, De La Salle University.
Wang, Zijian, Scott A. Hale, David Adelani, Przemyslaw A. Grabowicz, Timo Hartmann, Fabian Flöck, and David Jurgens. 2019. “Demographic
Inference and Representative Population Estimates from Multilingual Social Media
Data”. In Ling Liu, and Ryen White, eds. Proceedings
of the 2019 World Wide Web Conference. New York: Association for Computing Machinery, 2056–2067.
Wijffels, Jan. 2022. Package
“UDPipe”: Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing with the “UDPipe” “NLP”
Toolkit. R package version 0.8.11. 〈[URL]〉 (accessed May 1, 2022).
Cited by (11)
Cited by 11 other publications
Chatrine Bachtiar, Youna & Devie Rahmawati
Huang, Nick, Li Lin, Kunmei Han, Jia Wen Hing, Luwen Cao, Vincent Ooi & Zhiming Bao
2025. Treebanks and World Englishes. English World-Wide. A Journal of Varieties of English 46:1 ► pp. 93 ff.
McKeown, Jamie, Amber Wanwen Wang & Meng Ye
Bolton, Kingsley, Priscilla Angela T. Cruz & Isabel Pefianco Martin
Gonzales, Wilkinson Daniel Wong
GONZALES, WILKINSON DANIEL WONG
Gonzales, Wilkinson Daniel Wong
2024. Advancing Sino-Philippine linguistics and sociolinguistics using the Lannang Corpus (LanCorp). International Journal of Corpus Linguistics 29:2 ► pp. 213 ff.
Gonzales, Wilkinson Daniel Wong
Gonzales, Wilkinson Daniel Wong
Gonzales, Wilkinson Daniel Wong
This list is based on CrossRef data as of 9 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
