Article published In: Historiographia Linguistica
Vol. 40:3 (2013) ► pp.303–330
Letter counting
A stem cell for cryptology, quantitative linguistics, and statistics
Published online: 3 September 2013
https://doi.org/10.1075/hl.40.3.01yca
https://doi.org/10.1075/hl.40.3.01yca
Summary
Counting letters in written texts is a very ancient practice. It has accompanied the development of cryptology, quantitative linguistics, and statistics. In cryptology, counting frequencies of the different characters in an encrypted message is the basis of the so called frequency analysis method. In quantitative linguistics, the proportion of vowels to consonants in different languages was studied long before authorship attribution. In statistics, the alternation vowel-consonants was the only example that Markov ever gave of his theory of chained events. A short history of letter counting is presented. The three domains, cryptology, quantitative linguistics, and statistics, are then examined, focusing on the interactions with the other two fields through letter counting. As a conclusion, the eclecticism of the scholars of past centuries, their background in humanities, and their familiarity with cryptograms, are identified as contributing factors to the mutual enrichment process which is described here.
Résumé
Compter les lettres d’un texte écrit est une pratique très ancienne. Elle a accompagné le développement de la cryptologie, de la linguistique quantitative et de la statistique. En cryptologie, compter les fréquences des lettres dans un message codé est la base de la méthode appelée analyse de fréquence. En linguistique quantitative, la proportion relative des voyelles et des consonnes dans les différentes langues a été étudiée bien avant les problèmes de paternité des écrits. En statistique, l’alternance des voyelles et des consonnes a été le seul exemple que Markov ait donné de sa théorie des événements en chaîne. Une brève histoire des décomptes de lettres est présentée. Les trois domaines, cryptologie, linguistique quantitative et statistique, sont examinés ensuite, l’accent étant mis sur les inte ractions avec les deux autres domaines au travers des décomptes de lettres. En conclusion, l’éclectisme des savants des siècles passés, leur formation classique et leur familiarité avec les cryptogrammes, sont identifiés comme facteurs ayant contribué au processus d’enrichissement mutuel qui est décrit ici.
Zusammenfassung
Seit langem schon hatte man die Buchstaben gezählt, aus die ein Text besteht. Das galt besonders für die Entwicklung der Kryptologie, für die quantitative Linguistik und für die Statistik. In der Kryptologie bildete die Frequenz der Buchstaben einer schriftlichen Botschaft die Basis jener Methode, die Frequenzanalyse genannt wird. In der quantitativen Linguistik spielten die Proportion der Zahl der Vokale und Konsonanten in den verschiedenen Sprachen schon lange eine Rolle, bevor es um die Zuweisung der Texte zu verschiedenen Autoren ging. In der Statistik war der Wechsel zwischen Vokalen und Konsonanten das einzige Beispiel, welches Markov für seine Ketten-Theorie angeführt hatte. In den Beitrag wird eine kurze Geschichte der Zählweisen für Buchstaben geboten. Anschließend werden die drei Bereiche Kryptologie, quantitative Linguistik und Statistik untersucht, in wieweit ihre jeweiligen Zählweisen mit einander in Beziehung stehen. Zusammenfassend werden der Eklektizismus der Gelehrten früherer Jahrhunderte, ihre klassische Bildung und ihre Vertrautheit mit Kryptogrammen identifiziert als Faktoren, welche zur wechselseitigen Bereicherung der hier untersuchten Bereiche beigetragen haben.
References (81)
Alberti, Leon Battista. 2010 [1467]. “De Componendis Cifris”. Translated by Kim Williams. The Mathematical Work of Leon Battista Alberti ed. by Kim Williams, Lionel March & Stephen R. Wassel, 171–200. Basel: Birkäuser-Springer.
Andresen, Julie T[etel]. 1996. Linguistics in America, 1769–1924: A critical history. London: Routledge.
Babbage, Charles. 1832. “On the Advantage of a Collection of Numbers, to be Entitled the Constants of Nature and Art”. Edinburgh Journal of Science, New Series 61.334–340.
Bailey, Richard W[eld]. 1969. “Statistics and Style: A historical survey”. Statistics and Style ed. by Lubomir Doložel & Richard W. Bailey, 217–236. New York: American Elsevier.
Bauer, Friedrich L. 2007 [1
1997]. Decrypted Secrets: Methods and maxims of cryptology. 4th ed. Berlin: Springer.
Birgegård, Ulla. 1985. Johan Gabriel Sparwenfeld and the Lexicon Slavonicum: His contribution to 17th century Slavonic lexicography. Uppsala: Almqvist & Wiksell for the Bibliotheca R[egalis] Universitatis Upsaliensis.
Breithaupt, Christian. 1737. Ars Decifratoria sive Sciencia occultas scripturas solvendi et legendi. Helmstedt: Weygand.
Burke, Peter. 2012. A Social History of Knowledge. Volume II: From the Encyclopédie to Wikipedia. Cambridge: Polity Press.
Carnicer García, Carlos J. & Javier Marcos Rivas. 2005. Espías de Felipe II: Los servicios secretos del imperio español. Madrid: la Esfera de los Libros.
[Conradi, David Arnold]. 1742. “The Art of Deciphering”. The Gentleman’s Magazine 121.133–135, 185–186, 241–242, 473–475.
Davies, Damian W. 2002. Presences that Disturb: Models of Romantic Influence in the Literature and Culture of the 1790s. Cardiff: University of Wales Press.
De Morgan, Augustus. 1882 [1851]. “Letter to Rev. W. Heald Aug. 18, 1851”. Memoir of Augustus De Morgan ed. by Sophia Elizabeth De Morgan, 214–216. London: Green.
Falconer, John. 1685. Cryptomenysis Patefacta or the Art of Secret Information Disclosed without a Key. London: Brown.
Good, Irving J. 1979. “Studies in the History of Probability and Statistics. XXXVII: A. M. Turing’s statistical work in World War II”. Biometrika 661.393–396.
Grafton, Anthony. 2000. Leon Battista Alberti, Master Builder of the Italian Renaissance. Cambridge, Mass.: Harvard University Press.
Grieve, Jack W. 2005. Quantitative Authorship Attribution: A history and an evaluation of techniques. M.A. thesis, Simon Fraser University, Burnaby, B.C.
Hastings, James. 1902. Dictionary of the Bible Dealing with its Language, Literature, and Contents. Vol. IV (pleroma – zuzim). New York: Charles Scribner’s Sons.
Hitt, Parker. 1916. Manual for the Solution of Military Ciphers. Fort Leavenworth, Kansas: Press of the Army Service Schools.
Hufbauer, Karl. 1982. The Formation of the German Chemical Community (1720–1795). Berkeley: University of California Press.
Kahn, David. 1996 [1
1967]. The Codebreakers: The story of secret writing. Rev. ed. New York: Scribner.
Kelley, Page H., Daniel S. Mynatt & Timothy G. Crawford. 1998. The Masorah of Biblia Hebraica Stuttgartensia. Grand Rapids, Mich.: Eerdmans.
Klüber, Johannes Ludwig. 1809. Kryptographik: Lehrbuch der Geheimschreibekunst (Chiffrirund Dechiffrirkunst). Tübingen: J. G. Cotta.
Kraus, Kari. 2009. “Conjectural Criticism: Computing past and future texts”. Digital Humanities Quarterly 3:4. No page given.
Kullback, Solomon. 1976. Statistical Methods in Cryptanalysis. Laguna Hills, Calif.: Aegean Park Press.
Lieber, Francis. 1835. “Consonants”. Encyclopedia Americana ed. by Francis Lieber, Edward Wigglesworth, Thomas G. Bradford & Henry Vethake, vol. III1, 449–453 Philadelphia: Desilvern.
Link, David. 2006a. “Traces of the Mouth: Andrei Adreyevich Markov’s mathematization of writing”. History of Science 441.321–348.
. 2006b. “Chains to the West. Markov’s theory of connected events and its transmission to Western Europe”. Science in Context 191.561–589.
Markov, Andrey A. 2006 [1913]. “An Example of Statistical Investigation of the Text Eugene Onegin Concerning the Connection of Samples in Chains”. Translated by Gloria Custance and David Link. Science in Context 191.591–600.
Meister, Aloys. 1906. Die Geheimschrift im Dienste der Päpstlichen Kurie von ihren Anfängen bis zum Ende des XVI. Jahrhunderts. Paderborn: Ferdinand Schöningh.
Mercanti, Fabio & Paola Landra. 2007. “I ‘Ludi Matematici’ di Leon Batista Alberti”. EIRIS 21.15–47.
Meirion [i.e., William Owen Pughe]. 1799. “Proportion of Vowels and Consonants in Different Languages”. The Monthly Magazine 71.5–6.
Mollin, Richard A. 2005. Codes: The guide to secrecy from ancient to modern times. Boca Raton, Fla.: Chapman & Hall.
Morse, Edward L., ed. 1914. Samuel F. B. Morse: His letters and journals. Vol. II1. Boston: Houghton Mifflin.
Mrayati, Mohammed, Yahia Meer Alam & Hassan at-Tayyan. 2002. Al Kindī’s Treatise on Cryptanalysis. (=
Series on Arabic Origins of Cryptology, 1.) Damascus: KFCRIS & KACST.
. 2003a. Ibn ‘Adlān’s Treatise al-mu’allaf lil-malik al-’Ašraf. (=
Series on Arabic Origins of Cryptology, 2.) Ibid.
. 2003b. Ibn ad-Durayhim’s Treatise on Cryptanalysis. (=
Series on Arabic Origins of Cryptology, 3.) Ibid.
Mrayati, Mohammed, Yahia Meer Alam, & Hassan at-Tayyan. 2005. Ibn Dunaynīr’s book: Expositive Chapters on Cryptanalysis. (=
Series on Arabic Origins of Cryptology, 4.) Ibid.
Mrayati, Mohammed, Yahia Meer Alam & Hassan at-Tayyan. 2007. Two Treatises on Cryptanalysis. (=
Series on Arabic Origins of Cryptology, 6.) Ibid.
Peng, Roger D. & Nicolas W. Hengartner. 2002. “Quantitative Analysis of Literary Styles”. The American Statistician 561.175–185.
Perec, Georges. 1973. “Histoire du lipogramme”. Oulipo: La littérature potentielle, 73–89. Paris: Gallimard.
Perret, Paul-Michel. 1890. “Les règles de Cicco Simonetta pour le déchiffrement des écritures secretes (4 juillet 1474)”. Bibliothèque de l’École des Chartes 511.516–525. Paris.
Petruszewycz, Micheline. 1979a. “A. A. Markov, ses probabilités en chaîne et les statistiques linguistiques”. Mathématiques et Sciences Humaines 661.5–42.
. 1979b. “Contribution pour servir à l’étude du choix que fit A. A. Markov d’un domaine d’application de sa théorie des chaînes”. Mathématiques et Sciences Humaines 661.43–49.
Pierce, John R. 1973. “The Early Days of Information Theory”. IEEE Transactions on Information Theory 191.3–8.
Poe, Edgar Allan. 1986. The Annotated Tales of Edgar Allan Poe. Ed. by Stephen Peithman. New York: Avenel.
. 1873. “Notice sur Charles Babbage”. Annuaire de l’Académie Royale des sciences des lettres et des Beaux-arts de Belgique 391.149–165.
Richardson, Mary, John Grabosek, Diann Reischman & Phyllis Curtiss. 2004. “Morse Code, Scrabble, and the Alphabet”. Journal of Statistics Education 12:3. No page given.
Selenus, Gustavus [i.e., August the Younger of Brunswick-Lüneburg]. 1624. Cryptomenytices et Cryptographiae Libri IX. Lüneburg: Johann & Heinrich Stern.
Shannon, Claude E[lwood]. 1948. “A Mathematical Theory of Communication”. Bell System Technical Journal 271.379–423, 623–656.
Shannon, Claude E. 1949. “Communication Theory of Secrecy Systems”. Bell System Technical Journal 281.656–715.
Stigler, Stephen M. 1999. Statistics on the Table: The history of statistical concepts and methods. Cambridge, Mass.: Harvard University Press.
Strasser, Gerhard F. 1983. “The Noblest Cryptologist: August the Younger of Brunswick-Lüneburg (Gustavus Selenus) and his cryptological activities”. Cryptologia 7:3.193–217.
Thicknesse, Philip. 1772. A Treatise on the Art of Decyphering, and of Writing in Cypher. London: Brown.
Valerio, Paul-Louis Eugène. 1893. De la Cryptographie: Essai sur les méthodes de déchiffrement. Paris: Baudoin.
Vitevitch, Michael S. & Paul A. Luce. 1999. “Probabilistic Phonotactics and Neighborhood Activation in Spoken Word Recognition”. Journal of Memory and Language 401.374–408.
Wepman, Dennis. “Butts, Alfred Mosher”. American National Biography Online. New York: Oxford University Press.
Williams, Charles B. 1956. “Studies in the History of Probability and Statistics IV: A note on early statistical study of literary style”. Biometrika 431.248–256.
Ycart, Bernard. 2013. “Alberti’s Letter Counts”. Literary and Linguistic Computing. (print version in press).
Cited by (1)
Cited by one other publication
This list is based on CrossRef data as of 10 december 2025. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.
