Article published In: International Journal of Corpus Linguistics
Vol. 21:2 (2016) ► pp.192–218
Profiling verb complementation constructions across New Englishes
A two-step random forests analysis of ing vs. to complements
Published online: 8 September 2016
https://doi.org/10.1075/ijcl.21.2.03des
https://doi.org/10.1075/ijcl.21.2.03des
In this paper, we explore verb complementation patterns with to and ing in native English (British and American English) as compared to three Asian Englishes (Hong Kong, Indian, and Singaporean English). Based on data from the International Corpus of English annotated for variables describing the matrix verb and the complement, we run two random forests analyses to determine where the Asian Englishes have developed complementation preferences different from the two native speaker varieties. We find not only a variety of differences between the Asian and the native Englishes, but also that the Asian Englishes are more similar (i.e. ‘better predicted by’) the American English data. Further, as the first study of its kind to extend the MuPDAR approach from the now frequent regression analyses to random forests analysis, this study adds a potentially useful analytical tool to the often messy and skewed observational data corpus linguists need to deal with.
Keywords: random forests, MuPDAR, New (Asian) Englishes, ing vs. to, verb complementation
References (55)
Bernaisch, T., Gries, St. Th., & Mukherjee, J. (2014). The dative alternation in South Asian English(es): Modelling predictors and predicting prototypes. English World-Wide, 35(1), 7–31.
Bolton, K. (2008). Varieties of World Englishes. In B.B. Kachru, Y. Kachru & C.L. Nelson (Eds.), The Handbook of World Englishes (pp. 289–312). Singapore: Wiley-Blackwell.
Bresnan, J., Cueni A, Nikitina, T., & Baayen, R.H. (2007). Predicting the dative alternation. In G. Bourne, I. Kraemer & J. Zwarts (Eds.), Cognitive Foundations of Interpretation (pp. 69–94). Amsterdam: Royal Netherlands Academy of Science.
Burnham, K.P., & Anderson, D.R. (2002). Model Selection and Multimodel Inference: A Practical Information-theoretic Approach (2nd ed.). New York, NY: Springer.
Collins, P. (1995). The indirect object construction in English: An informational approach. Linguistics, 33(1), 35–49.
Cuyckens, H., Frauke D., & Szmrecsanyi, B. (2014). Variability in verb complemention in late modern English: Finite vs. non-finite patterns. In M. Hundt (Ed.), Late Modern English Syntax (pp. 182–204). Cambridge: Cambridge University Press.
De Smet, H. (2013). Spreading Patterns: Diffusional Change in the English System of Complementation. Oxford: Oxford University Press.
Deshors, S.C. (2014a). Towards an identification of prototypical non-native modal constructions in EFL: A corpus-based approach. Corpus Linguistics and Linguistic Theory, 11(1), 19–50.
. (2014b). A case for a unified treatment of EFL and ESL: A multifactorial approach. English World Wide, 35(3), 279–307.
. (2015). A constructionist approach to gerundial and infinitival verb-complementation patterns in native and Hong Kong English varieties. English Text Construction, 8(2), 207–235.
Divjak, D.S., & Arppe, A. (2013). Extracting prototypes from exemplars: What can corpus data tell us about concept representation? Cognitive Linguistics, 24(2), 221–274.
Duffley, P. (1999). The use of the infinitive and the -ing after verbs denoting the beginning, middle and end of an event. Folio Linguistica, 23(3), 295–331.
Edwards, A. (2014). The progressive aspect in the Netherlands and the ESL/EFL continuum. World Englishes, 33(2), 173–94.
Greenbaum, S. (Ed.) (1996). Comparing English Worldwide: The International Corpus of English. Oxford: Clarendon Press.
Gries, St. Th. (2003). Multifactorial Analysis in Corpus Linguistics: A Study of Particle Placement. London: Continuum Press.
. (2015a). Quantitative linguistics. In J.D. Wright (Ed.), International Encyclopedia of the Social and Behavioral Sciences (2nd ed.) (pp. 725–732). Oxford: Elsevier.
. (2015b). The role of quantitative methods in Cognitive Linguistics: Corpus and experimental data on (relative) frequency and contingency of words and constructions. In J. Daems, E. Zenner, K. Heylen, D. Speelman, & H. Cuyckens (Eds.), Change of Paradigms - New Paradoxes: Recontextualizing Language and Linguistics (pp. 311–325). Berlin: De Gruyter Mouton.
Gries, St. Th., & Adelman, A.S. (2014). Subject realization in Japanese conversation by native and non-native speakers: Exemplifying a new paradigm for learner corpus research. In J. Romero-Trillo (Ed.), Yearbook of Corpus Linguistics and Pragmatics 2014: New Empirical and Theoretical Paradigms (pp. 35–54). Cham: Springer.
Gries, St. Th., & Bernaisch, T. (2016). Exploring epicenters empirically: Focus on South Asian Englishes. English World-Wide, 37(1), 1–25.
Gries, St. Th., & Deshors, S.C. (2014). Using regressions to explore deviations between corpus data and a standard/target: Two suggestions. Corpora, 9(1), 109–136.
. (2015). EFL and/vs. ESL? A multi-level regression modeling perspective on bridging the paradigm gap. International Journal of Learner Corpus Research, 1(1), 130–159.
Gries, St. Th., & Wulff, S. (2013). The genitive alternation in Chinese and German ESL learners: Towards a multifactorial notion of context in learner corpus research. International Journal of Corpus Linguistics, 18(3), 327–356.
Hoffmann, T. (2014). The cognitive evolution of Englishes: The role of constructions in the Dynamic Model. In S. Buschfeld, T. Hoffmann, M. Huber & A. Kautzsch (Eds.), The Evolution of Englishes: The Dynamic Model and Beyond (pp. 160–180). Amsterdam: John Benjamins.
Kaleta, A. (2012). The English gerund vs. the to-infinitive: The case of aspectual constructions. Selected papers from
UK-CLA Meetings
. Retrieved from [URL] (last accessed June 2014).
Khamis, A. (2015, July). Cross-varietal variation in English verb complementation: A multivariate corpus analysis. Paper presented at the
International Cognitive Linguistics Conference 2015
, Newcastle upon Tyne.
Koch, C. (2015). Routines in lexis and grammar: A ‘gravity’ approach within the International Corpus of English. Paper presented at the
ICAME 36 conference
, Universität Trier, 27-29 May 2015.
Kuperman, V., & Bresnan, J. (2012). The effects of construction probability on word durations during spontaneous incremental sentence production. Journal of Memory and Language, 66(4), 588–611.
Liaw, A., & Wiener, M. (2015). randomForest. Version 4.6-12. A package for R. Retrieved from [URL] (last accessed February 2016).
Mair, C. (2002). Three changing patterns of verb complementation in Late Modern English: A real-time study based on matching text corpora. English Language and Linguistics, 6(1), 105–131.
. (2013). The world system of English. English World-Wide, 34(3), 253–278.
Martínez-García, M.T., & Wulff, S. (2012). Not wrong, yet not quite right: Spanish ESL students’ use of gerundial and infinitival complementation. International Journal of Applied Linguistics, 22(2), 225–244.
Matsuki, K., Kuperman, V., & Van Dyke, J.A. (2016). The Random Forests statistical technique: An examination of its value for the study of reading. Scientific Studies of Reading, 20(1), 20–33.
Mukherjee, J., & Gries, St. Th. (2009). Collostructional nativisation in New Englishes: Verb-construction associations in the International Corpus of English. English World-Wide, 30(1), 27–51.
Mukherjee, J., & Hoffmann, S. (2006). Describing verb-complementational profiles of New Englishes: A pilot study of Indian English. English World-Wide, 27(2), 147–173.
Mukherjee, J., & Schilk, M. (2008). Verb-complementational profiles across varieties of English: Comparing verb classes in Indian English and British English. In T. Nevalainen, I Taavitsainen, P. Pahta & M. Korhonen (Eds.), The Dynamics of Linguistic Variation: Corpus Evidence on English Past and Present (pp. 163–181). Amsterdam: John Benjamins.
Nam, C., Mukherjee, S., Schilk, M., & Mukherjee, J. (2013). Statistical analysis of varieties of English. Journal of the Royal Statistical Society, 176(3), 777–793.
Noël, D. (2003). Is there semantics in all syntax? The case of accusative and infinitive constructions vs. that-clauses. In G. Rohdenburg & B. Mondorf (Eds.), Language Typology and Syntactic Description: Vol.2, Complex Constructions (pp. 52–150). Cambridge: Cambridge University Press.
Noonan, M. (1985). Complementation. In T. Shopen (Ed.), Language Typology and Syntactic Description. Vol. 2. Complex Constructions (pp. 42–110). Cambridge: Cambridge University Press.
Olavarría de Ersson, E., & Shaw, P. (2003)). Verb complementation patterns in Indian standard English. English World-Wide, 24(2), 137–161.
R Development Core Team. 2012. R: A Language and Environment for Statistical Computing. Foundation for Statistical Computing. Vienna, Austria. <[URL]> (last accessed July 2012)
Ransom, E. (1979). Definiteness and animacy constraints on passives and double object constructions in English. Glossa, 13(2), 215–240.
Rohdenburg, G. (1995). On the replacement of finite complement clauses by infinitives in English. English Studies, 76(4), 367–388.
Schilk, M., Bernaisch, T., & Mukherjee, J. (2012). Mapping unity and diversity in South Asian English lexicogrammar. In M. Hundt & U. Gut (Eds.), Mapping Unity and Diversity World-wide: Corpus-based Studies of New Englishes (pp. 137–166). Amsterdam: John Benjamins.
Schilk, M., Mukherjee, J., Nam, C., & Mukherjee, S. (2013). Complementation of ditransitive verbs in south Asian Englishes: A multifactorial analysis. Corpus Linguistics and Linguistic Theory, 9(2), 187–225.
Shastri, S.V. (1996). Using computer corpora in the description of language with special reference to complementation in Indian English. In R.J. Baumgardner (Ed.), South Asian English: Structure, Use, and Users (pp. 70–81). Urbana & Chicago, IL: University of Illinois Press.
Schneider, E. (2007). Postcolonial English: Varieties around the World. Cambridge: Cambridge University Press.
Smith, M.B., & Escobedo, J. (2002). The semantics of to-infinitival vs. -ing verb complement constructions in English. In M. Andronis, C. Ball, H. Elston & S. Neuvel (Eds.), Proceedings from the Main Session in the Chicago Linguistics Society’s Thirty-Seventh Meeting (pp. 549–564). Chicago, IL: Chicago Linguistic Society.
Szmrecsanyi, B., & Kortmann, B. (2011). Typological profiling: Learner Englishes versus L2 varieties of English. In J. Mukherjee & M. Hundt (Eds.), Exploring Second-language Varieties of English and Learner Englishes: Bridging the Paradigm Gap (pp. 167–207). Amsterdam: John Benjamins.
Vosberg, U. (2003). The role of extractions and horror aequi in the evolution of -ing complements in modern English. In G. Rohdenburg & B. Mondorf (Eds.), Determinants of Grammatical Variation in English (pp. 329–345). Berlin: Mouton de Gruyter.
Wierzbicka, A. (1988). The Semantics of Grammar. Amsterdam: John Benjamins.
Wulff, S., & Gries, St. Th. (2015). Prenominal adjective order preferences in Chinese and German L2 English: A multifactorial corpus study. Linguistic Approaches to Bilingualism, 5(1), 122–150.
Cited by (45)
Cited by 45 other publications
Collins, Peter Craig
Ivaska, Ilmari, Mirva Johnson & Tommi Kurki
Olguín Martínez, Jesús & Stefan Th. Gries
2025. Similative-pretence constructions in language contact situations. Cognitive Linguistic Studies 12:2 ► pp. 292 ff.
P. Romasanta, Raquel
Romasanta, Raquel P.
Romasanta, Raquel P.
Ruohonen, Juho & Juhani Rudanko
Tao, Yating & Gaëtanelle Gilquin
Adolf, Philippa
Bernaisch, Tobias & Nina Funke
Ivaska, Ilmari & Anne Tamm
Meakins, Felicity, Lindell Bromham & Xia Hua
Partridge, Maristi
2024. Investigating the complementiser that
in the verb complementation of Black South African English. In Constraints on Language Variation and Change in Complex Multilingual Contact Settings [Contact Language Library, 60], ► pp. 87 ff.
Laliberté, Catherine, Melanie Keller & Diana Wengler
Moretti, Lorenzo
Youssef, Chadi Ben & Stefan Th. Gries
Kekki, Niina & Ilmari Ivaska
2022. The use of synonymous adjectives by learners of Finnish as a second language. International Journal of Learner Corpus Research 8:1 ► pp. 67 ff.
Gardner, Matt Hunt, Eva Uffing, Nicholas Van Vaeck, Benedikt Szmrecsanyi & Stefan Th. Gries
Shadrova, Anna, Pia Linscheid, Julia Lukassek, Anke Lüdeling & Sarah Schneider
Béchet, Christophe
Deshors, Sandra C. & Stefan Th. Gries
Deshors, Sandra C. & Sandra Götz
Egbert, Jesse & Luke Plonsky
García‐Castro, Laura
Gries, Stefan Th., Santa Barbara, Justus Liebig & Sandra C. Deshors
Wulff, Stefanie & Stefan Th. Gries
Kaunisto, Mark & Juhani Rudanko
Kaunisto, Mark & Juhani Rudanko
Kaunisto, Mark & Juhani Rudanko
Lester, Nicholas A.
Deshors, Sandra C. & Gaëtanelle Gilquin
2018. Modeling World Englishes in the 21st century. In Modeling World Englishes [Varieties of English Around the World, G61], ► pp. 281 ff.
Gilquin, Gaëtanelle
2018. American and/or British influence on L2 Englishes – Does context tip the scale(s)?. In Modeling World Englishes [Varieties of English Around the World, G61], ► pp. 187 ff.
Grafmiller, Jason & Benedikt Szmrecsanyi
Gries, Stefan Th., Tobias Bernaisch & Benedikt Heller
2018. A corpus-linguistic account of the history of the genitive alternation in Singapore English. In Modeling World Englishes [Varieties of English Around the World, G61], ► pp. 245 ff.
Kruger, Haidee & Gert De Sutter
2018. Alternations in contact and non-contact varieties. Translation, Cognition & Behavior 1:2 ► pp. 251 ff.
Deshors, Sandra C.
Deshors, Sandra C.
Deshors, Sandra C.
2018. Does the passé composé influence L2 learners’ use of English past tenses?. International Journal of Learner Corpus Research 4:1 ► pp. 23 ff.
Gries, Stefan Th.
Gries, Stefan Th.
2022. MuPDAR for corpus-based learner and variety studies. In Broadening the Spectrum of Corpus Linguistics [Studies in Corpus Linguistics, 105], ► pp. 257 ff.
Heller, Benedikt, Tobias Bernaisch & Stefan Th. Gries
Lee,Yong-hun & 유지희
MERILÄINEN, LEA
[no author supplied]
[no author supplied]
This list is based on CrossRef data as of 12 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
