Corpus Linguistics and Digital Humanities. Intersecting Paths. A Case Study from Twitter

  • Angela Zottola Università di Torino, Italia
Parole chiave: Digital Humanities, linguistica dei corpora, American studies, Twitter, WMatrix


L’obiettivo di questo saggio è proporre un’analisi critica del ruolo della Linguistica dei Corpora all’interno delle Digital Humanities. Lo studio suggerisce che gli strumenti accessibili e user-friendly della Linguistica dei Corpora possono essere considerati uno strumento ottimale attraverso il quale operare all’interno delle Scienze Umane e avanzare nell’obiettivo delle Digital Humanities di includere varie tecniche di computazione negli studi umanistici. Il saggio presenta un caso studio basato su dati raccolti sul social medium Twitter che rappresenta un esempio in cui i due approcci (Linguistica dei Corpora e Digital Humanities) possono essere utilizzati unitamente all’interno degli American Studies. Nel saggio, gli American Studies sono intesi in senso lato come una disciplina inclusiva di diverse prospettive che si approcciano allo studio del continente americano. In conclusione, l’incontro tra la Linguistica dei Corpora e le Digital Humanities viene definito come una necessità per il futuro delle Digital Humanities stesse.


I dati di download non sono ancora disponibili

Riferimenti bibliografici

Aarts, Jan and Theo van den Heuvel. 1982. “Corpus-based syntax studies.” Gramma: tijdschrift voor taalkunde in Nijmegen 7 (2-3): 153–174.

Aarts, Jan and Willem Meijs. 1984. Corpus linguistics: recent developments in the use of computer corpora in English language research. Amsterdam: Rodopi.

Arnold, Taylor, Nicolas, Ballier, Paula, Lissón and Lauren Tilton. 2019. “Beyond lexical frequencies: using R for text analysis in the digital humanities”. Language Resources and Evalua- tion 53: 707-733.

Baroni, Marco, Kilgarriff, Adam, Pomikálek, Jan and Pavel Rychlý. 2006. “WebBootCaT: instant domain-specific corpora to support human translators.” Proceedings of EAMT. 11th Annual Conference of the European Association for Machine Trans- lation. Norway, 247–252.

Brinson Curiel, Barbara, Kazanjian, David, Kinney, Kather- ine, Mailloux, Steven, Mechling, Jay, Rowe, John Carlos, Sánchez, George, Streeby, Shelley and Henry Yu. “Introduction.” In Post-Nationalist American Studies, edited by John Carlos Rowe, 1-21. Berkeley, Los Angeles: University of Cal- ifornia Press.

Bronner, J. Simon. 2012. “American Studies: A Discipline.” In En- cyclopedia of America Studies. Baltimore: Johns Hopkins University Press.

Brooks, Gavin and Tony McEnery. 2020. “Corpus linguistics.” In The Routledge Handbook of English language and Digital Humanities, edited by Svenja Adolphs and Dawn Knight, 378–404. Oxon, New York: Routledge.

Burdick, Anne, Drucker, Johanna, Lunenfeld, Peter, Presner, Todd and Jeffrey Schnapp. 2012. Digital Humanities. Oxford: MIT Press.

Burnard, Lou. 1999. Users Reference Guide for the BNC Sampler. The British National Corpus Consortium. http://www.

Culpeper, Jonathan. 2009. “Keyness: Words, parts-of-speech and semantic categories in the character-talk of Shakespeare’s Romeo and Juliet.” International Journal of Corpus Linguistics 14(1): 29–59.

Davies, Mark. 2008-. The Corpus of Contemporary American English (COCA): 520 million words, 1990-present. Available online at

Demata, Massimiliano. 2018. ““I think that maybe I wouldn’t be here if it wasn’t for Twitter”. Donald Trump’s Populist Style on Twitter.” Textus 31 (1), 67-90.

Dunst, Alexander. 2016. “Digital American Studies: An Introduction and Rationale”. Amerikastudien 61 (3), 381- 395.

Evert, Stefan. 2008. “Corpora and collocations”. In Corpus Linguistics. An International Handbook, edited by Anke Lüdeling and Merja Kytö, 1212-1248. Berlin: Mouton de Gruyter.

Francis, Nelson, and Henry Kucˇera. 1964. Manual of Information to Accompany a Standard Corpus of Present-Day Edited Amer- ican English for use with Digital Computers. Department of Linguistics, Brown University, Providence. http://icame.uib. no/brown/bcm.html.

Ferraresi, Adriano and Silvia Bernardini. 2019. “Building EPTIC: A many-sided, multi-purpose corpus of EU parliament proceed- ings.” In Parallel Corpora for Contrastive and Translation Studies. New resources and applications, 123–139. Amster- dam, Philadelphia: John Benjamins.

Garside, Roger, and Nicholas Smith. 1997. “A hybrid grammatical tagger: CLAWS4.” In Corpus Annotation: Linguistic Information from Computer Text Corpora, edited by Roger Gar- side, Geoffrey, Leech, and Anthony McEnery, 102-121. Lon- don: Longman.

Gilquin, Gaëtanelle, and Stefan Gries. 2009. “Corpora and experimental methods: A state-of-the-art review.” Corpus Linguis- tics and Linguistic Theory 5(1): 1-26 .

Gregory, Ian, and Andrew Hardie. 2011. “Visual GISting: bringing together corpus linguistics and Geographical Information Systems.” Literary and Linguistic Computing 26 (3): 297–314.

Gries, Stefan. 2009. “What is Corpus Linguistics?” Language and Linguistics Compass 3: 1-17.

Gold, Matthew and Lauren Klein. 2016. Debates in the Digital Humanities 2016. Minneapolis: University of Minnesota Press.

Ide, Nancy. 2004. “Preparation and analysis of linguistic corpora.” In A Companion to Digital Humanities, edited by Susan Schreibman, Ray Siemens and John Unsworth, 289-305. Ox- ford: Blackwell.

Izzo, Donatella. 2018. “American Studies in Europe/European American Studies: Local and Global Challenges.” RSA Journal 29: 184-194

Jackson, Sarah J., Bailey Moya and Brooke Foucault Welles. 2020. HashtagActivism. Networks of Race and Gender Justice. Cambridge: The MIT Press.

Jensen, Kim E. 2014. “Linguistics and the Digital Humanities: (Computational) Corpus Linguistics.” Journal of Media and Communication Research 57: 115-134.

Jones, Steven. 2016. Roberto Busa, S. J., and the Emergence of Humanities Computing. London and New York: Routledge.

KhosraviNik, Majid. 2017. “Right Wing Populism in the West: Social Media Discourse and Echo Chambers.” Insight Turkey 19 (3): 53-68.

KhoshraviNik, Majid, and Eleonora Esposito. 2018. “Online hate, digital discourse and critique: Exploring digitally-mediated discursive e practices of gender-based hostility.” Lodz Papers in Pragmatics 14(1): 45-68.

Kopytowska, Monika. 2017. “Discourses of hate and radicalism in action”. In Contemporary Discourses of Hate and Radicalism across Space and Genres, edited by Monika Kopytowska, 1-12. Amsterdam, Philadelphia: John Benjamins.

Levander, Caroline, and Robert Levine (Eds.). 2007. Hemispheric American Studies. New Brunswick: Rutgers University Press.

Mahlberg, Michaela, Wiegand, Viola, Stockwell, Peter, and Anthony Hennessey. 2019. “Speech-bundles in the 19th-century English novel.” Language and Literature 28(4): 326–353.

McEnery, Tony and Andrew Hardie. 2012. Corpus Linguistics: Method, Theory and Practice. Cambridge: Cambridge University Press.

McEnery, Tony and Andrew Wilson. 1996. Corpus Linguistics. An Introduction. Edinburgh: Edinburgh University Press.

McEnery, Tony, Xiao, Richard and Yukio Tono. 2006. Corpus-based Language Studies: An Advanced Resource Book. London, New York: Routledge.

McGillivray, Barbara, Thierry, Poibeau and Pablo Ruiz Fabo. 2020. “Digital Humanities and Natural Language Processing: “Je t’aime... Moi non plus”. Digital Humanities Quarterly 14(2): 1-11.

Nyhan, Julianne, and Andrew Flinn. 2016. Computation and the humanities: towards an oral history of digital humanities. Switzerland: Springer Nature.

O’Keeffe, Anne and Michael McCarthy. 2010. The Routledge Hand- book of Corpus Linguistics. London: Routledge.

Partington, Alan S., Duguid, Alison and Charlotte Taylor. 2013. Patterns and Meaning in Discourse. Theory and Practice in Corpus-Assisted Discourse Studies (CADS). Amsterdam, Philadelphia: John Benjamins Publishing Company.

Rayson, Paul. 2009. “Wmatrix: a web-based corpus processing environment.” Computing Department, Lancaster University.

Santos, Diana. 2019. “Literature studies in Literateca: between dig- ital humanities and corpus linguistics.” In Humanists and the Digital Toolbox: In Honour of Christian-Emil Smith Ore, edited by Martin Doerr, 89-109. Oslo: Novus Forlag.

Schreibman, Susan, Ray Siemens, and John Unsworth. 2004. A Companion to Digital Humanities. Malden Oxford, Victoria: Blackwell Publishing.

Spiro, Lisa. 2012. ““This is why we fight”: Defining the Values of the Digital Humanities.” In Debates on the Digital Humanities, edited by Matthew K. Gold. London, Minneapolis: University of Minnesota Press.

Sprugnoli, Rachele, Gabriella, Pardelli, Federico, Boschetti and Ric- cardo Del Gratta. 2019. “Un’Analisi Multidimensionale della Ricerca Italiana nel Campo delle Digital Humanities e della Linguistica Computazionale”. Umanistica Digitale 5: 59-89.

Stubbs, Michael. 1996. Text and Corpus Linguistics. Oxford: Blackwell.

Taylor, Charlotte. 2008. “What is Corpus Linguistics? What the data says.” In ICAME Journal, 32.
Wilson, Andrew and Paul Rayson. 1993. “Automatic Content Anal- ysis of Spoken Discourse.” In Corpus Based Computational Linguistics, edited by Clive Souter and Eric Atwell, 215-226. Amsterdam: Rodopi.

Zappavigna, Michele. 2012. Discourse of Twitter and Social Media: How we Use Language to Create Affiliation on the Web. Lon- don, New York: Continuum.

Zappavigna, Michele. 2018. Searchable Talk. Hashtags and Social Media Metadiscourse. London, Oxford, New York, new Delhi, Sidney: Bloomsbury.
Come citare
Zottola, A. (2020). Corpus Linguistics and Digital Humanities. Intersecting Paths. A Case Study from Twitter. América Crítica, 4(2), 131-141.
Dossier: An Epistemology of Doing. Digital Humanities and American Studies