Edited by Adam Pawłowski, Sheila Embleton, Jan Mačutek and Aris Xanthos
This book is a panorama of contemporary quantitative linguistics, as developed over decades. It highlights the main topics of QL: statistical laws of language, taxonomy of linguistic phenomena, authorial attribution, quantitative analysis of syntax (e.g., dependency grammar), measurement of text… read more
Edited by Adam Pawłowski, Jan Mačutek, Sheila Embleton and George Mikros
Specialists in quantitative linguistics the world over have recourse to a solid and universal methodology. These days, their methods and mathematical models must also respond to new communication phenomena and the flood of data produced daily. While various disciplines (computer science, media… read more
The aim of this paper is to investigate the use of the term for green, zielony, in the press released in Poland in 1945–1963 and in 2010. The data have been extracted from ChronoPress: Portal Tekstów Prasowych (Pawłowski 2021), a corpus of Polish newspapers and magazines, and the National Corpus… read more
This chapter provides a comparative analysis of bibliographic corpora including titles extracted from large national bibliographies (Czech, Finnish, German, Norwegian, and Polish). From the examined corpora, subsets were obtained, corresponding to the basic categories of the DDC/UDC formal… read more
The subject of this chapter is the application of automatic taxonomy methods to the corpus of microtexts, consisting of book titles. We test two hypotheses. The first one claims that simply on the basis of a book title one can automatically recognize its genre (writing species). The second… read more
The subject of this chapter is bibliographic corpus analysis, with data from the Polish national bibliography from the period 1801–2019. The research allowed us to discover and compare quantitative characteristics of the bibliographic corpus and of the reference corpus of general language. It… read more
The use of the Polish term for red, czerwony, was investigated in the press released in the period 1945–1954. The data have been extracted from ChronoPress: Chronologiczny Korpus Polskich Tekstów Prasowych (1945–1954), a corpus of Polish newspapers and magazines. At the time of writing this… read more
The aim of this paper is to investigate axiological attributes of Polish colour terms. We pose the following questions: Which colours are evaluated mostly positively, negatively and neutrally? Are there objects which are provided as both positive and negative associations of a single colour? Are… read more
The aim of this paper is to present the colour lexicon, including both basic and non-basic terms, found in Kashubian (or Cassubian), a West Slavic language spoken by a relatively small community inhabiting the coast of the Baltic Sea (the Pomorskie Province in Poland). The results of the… read more
La stylométrie est une branche de la linguistique qui se donne pour objet la description quantitative des particularités stylistiques des textes. Dans certains cas, cette description permet, entre autres, d’identifier les auteurs de textes anonymes et de déterminer la chronologie des textes d’un… read more