The expression of hate speech against Afro-descendant, Roma, and LGBTQ+ communities in YouTube comments

Carvalho, Paula; Caled, Danielle; Silva, Cláudia; Batista, Fernando; Ribeiro, Ricardo

doi:10.1075/jlac.00085.car

Article published In:

Journal of Language Aggression and Conflict
Vol. 12:2 (2024) ► pp.171–206

The expression of hate speech against Afro-descendant, Roma, and LGBTQ+ communities in YouTube comments

This paper addresses the specificities of online hate speech against the Afro-descendant, Roma, and LGBTQ+ communities in Portugal. The research is based on the analysis of CO-HATE, a corpus composed of 20,590 YouTube comments, which were manually annotated following detailed guidelines created for that purpose. We applied methods from corpus linguistics to assess the prevalence of overt and covert hate speech, counter-speech, and offensive speech, considering different grounds of discrimination, and to investigate the main linguistic and rhetorical strategies underlying hatred messages. The research results highlight the importance of tackling covert hate speech, a recurring phenomenon often anchored in irony and fallacious argumentation, including the emotional appeal to fear and the implicit call to action. We believe this study will aid in advancing the analysis of online hate speech, while promoting the development of efficient automated detection models, specifically regarding the Portuguese language.

Keywords: overt hate speech, covert hate speech, counter-speech, Afrophobia, Romaphobia, LGBTQphobia

Article outline

1.Introduction
2.Hate speech corpora
3.Methods
- 3.1Data collection
- 3.2Annotators profile
- 3.3Annotation guidelines
  - 3.3.1Speech acts
  - 3.3.2Grounds of discrimination
  - 3.3.3Rhetorical devices
  - 3.3.4Sentiment
4.Inter-annotator agreement
5.Results and discussion
- 5.1Categories distribution
- 5.2Linguistic realization of hate speech
- 5.3Rhetorical and discursive strategies underlying covert hate speech
  - 4.3.1Appeal to action
  - 4.3.2Irony, sarcasm and negative stereotyping
6.Concluding remarks
Declaration of conflicting interests
Notes
References

Published online: 19 June 2023

https://doi.org/10.1075/jlac.00085.car

References (89)

References

Achim, Viorel. 2004. The Roma in Romanian History. Budapest, Hungary: Central European University Press.

Akhtar, Sohail, Valerio Basile, and Viviana Patti. 2020. “Modeling Annotator Perspective and Polarized Opinions to Improve Hate Speech Detection.” In Proceedings of the AAAI Conference on Human Computation and Crowdsourcing, 151–154.

Albelda Marco, Marta. 2022. “Rhetorical Questions as Reproaching Devices.” Journal of Language Aggression and Conflict.

Assimakopoulos, Stavros, Fabienne Baider, and Sharon Millar. 2017. Online Hate Speech in the European Union: A Discourse-analytic Perspective. New York: Springer Nature.

Attardo, Salvatore. 2020. “Irony as Relevant Inappropriateness.” Journal of Pragmatics 32 (6): 793–826.

Baider, Fabienne. 2020. “Pragmatics lost?: Overview, Synthesis and Proposition in Defining Online Hate Speech.” Pragmatics and Society 11(2): 196–218.

. 2022. “Covert Hate Speech, Conspiracy Theory and Anti-semitism: Linguistic Analysis versus Legal Judgment.” International Journal for the Semiotics of Law 35(6): 2347–2371.

Baider, Fabienne, and Christina Romain. Forthcoming. “Irony, Sarcasm and other Playful Devices Used in Online Covert Hate Speech.” Language and Social Life, de Gruyter.

Baider, Fabienne, Anna Constantinou, and Anastasia Petrou. 2017. “Metaphors Related to Othering the Non-natives”. In Online Hate Speech in the European Union: A Discourse-Analytic Perspective, edited by Stavros Assimakopoulos, Fabienne H. Baider, and Sharon Millar, 38–42. Berlin: Springer.

Baider, Fabienne, and Maria Constantinou. 2020. “Covert Hate Speech: A Contrastive Study of Greek and Greek Cypriot Online Discussions with an Emphasis on Irony.” Journal of Language Aggression and Conflict 8(2): 262–287.

Baker, Paul, and Tony McEnery. 2005. “A Corpus-based Approach to Discourses of Refugees and Asylum Seekers in UN and Newspaper Texts.” Journal of Language and Politics 4(2): 197–226.

Baker, Paul, Costas Gabrielatos, Majid KhosraviNik, Michał Krzyżanowski, Tony McEnery, and Ruth Wodak. 2008. “A Useful Methodological Synergy? Combining Critical Discourse Analysis and Corpus Linguistics to Examine Discourses of Refugees and Asylum seekers in the UK Press.” Discourse & Society 19(3): 273–306.

Basile, Valerio, Cristina Bosco, Elisabetta Fersini, Debora Nozza, Viviana Patti, Francisco Pardo, Paolo Rosso, and Manuela Sanguinetti. 2019. “SemEval-2019 Task 5: Multilingual Detection of Hate Speech against Immigrants and Women in Twitter.” In Proceedings of the 13th International Workshop on Semantic Evaluation, 54–63. Minneapolis, Minnesota, USA: Association for Computational Linguistics.

Ben Chikha, Fourat. 2021. Combating Rising Hate against LGBTI People in Europe. [URL]

Benesch, Susan, Derek Ruths, Kelly Dillon, Haji Mohammad Saleem, and Lucas Wright. 2016. “Counterspeech on Twitter: A Field Study.” A Report for Public Safety Canada under the Kanishka Project.

Bhat, Prashanth, and Ofra Klein. 2020. “Covert Hate Speech: White Nationalists and Dog Whistle Communication on Twitter.” In Twitter, the Public Sphere, and the Chaos of Online Deliberation, edited by Gwen Bouvier, and Judith E. Rosenbaum, 151–172. Cham: Palgrave Macmillan.

Billig, Michael. 2001. “Humour and Hatred: The Racist Jokes of the Ku Klux Klan.” Discourse & Society 12(3): 267–289.

Breazu, Petre, and David Machin. 2019. “Racism toward the Roma through the Affordances of Facebook: Bonding, Laughter and Spite.” Discourse & Society 30(4): 376–394.

. 2022. “Using Humor to Disguise Racism in Television News: The Case of the Roma.” HUMOR 35(1).

Brindle, Andrew. 2016. The Language of Hate: A Corpus Linguistic Analysis of White Supremacist Language. New York and London: Routledge.

Buturoiu, Dana Raluca, and Nicoleta Corbu. 2020. “Exposure to Hate Speech in the Digital Age. Effects on Stereotypes about Roma People.” Journal of Media Research 13(2).

Cádima, Francisco Rui, Carla Baptista, Marisa Silva, and Patrícia Abreu. 2021. Monitoring Media Pluralism in the Digital Era: Application of the Media Pluralism Monitor in the European Union, Albania, Montenegro, The Republic of North Macedonia, Serbia & Turkey in the Year 2020. Country Report: Portugal.

Carter, Evelyn R., and Mary C. Murphy. 2015. “Group-based Differences in Perceptions of Racism: What Counts, to Whom, and Why?.” Social and Personality Psychology Compass 9(6): 269–280.

Casa-Nova, Maria José. 2021. “Reflecting on Public Policies for Portuguese Roma since Implementation of the NRIS: Theoretical and Practical Issues.” Journal of Contemporary European Studies 29(1): 20–32.

Chovanec, Jan. 2021. “‘Re-educating the Roma? You Must Be Joking…’: Racism and Prejudice in Online Discussion Forums.” Discourse & Society 32(2): 156–174.

Council of Europe. 2021. Combating Racism and Racial Discrimination against People of African Descent in Europe. Round-table with Human Rights Defenders Organised by the Office of the Council of Europe Commissioner for Human Rights. [URL]

Dahiya, Snehil, Shalini Sharma, Dhruv Sahnan, Vasu Goel, Emilie Chouzenoux, Víctor Elvira, Angshul Majumdar, Anil Bandhakavi, and Tanmoy Chakraborty. 2021. “Would your Tweet Invoke Hate on the Fly? Forecasting Hate Intensity of Reply Threads on Twitter.” In Proceedings of the ACM SIGKDD 2021, 2732–2742. Virtual Event Singapore: ACM.

Davidson, Thomas, Dana Warmsley, Michael Macy, and Ingmar Weber. 2017. “Automated Hate Speech Detection and the Problem of Offensive Language.” In Proceedings of the International AAAI Conference on Web and Social Media, 512–515. Montreal, Canada: AAAI.

Dynel, Marta. 2017. “Academics vs. American Scriptwriters vs. Academics: A Battle over the Etic and Emic ‘Sarcasm’ and ‘Irony’ Labels.” Language & Communication 551: 69–87.

. 2018a. Irony, Deception and Humour: Seeking the Truth about Overt and Covert Untruthfulness. Berlin: De Gruyter Mouton.

. 2018b. “Deconstructing the Myth of Positively Evaluative Irony.” In The Pragmatics of Irony and Banter, edited by Manuel Jobert, and Sandrine Sorlin, 1–17. Berlin: John Benjamins.

. 2019. “Ironic Intentions in Action and Interaction.” Language Sciences 751: 1–14.

ElSherief, Mai, Caleb Ziems, David Muchlinski, Vaishnavi Anupindi, Jordyn Seybolt, Munmun De Choudhury, and Diyi Yang. 2021. “Latent Hatred: A Benchmark for Understanding Implicit Hate Speech.” In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 345–363. Online and Punta Cana, Dominican Republic: Association for Computational Linguistics.

Erjavec, Karmen. 2001. “Media Representation of the Discrimination against the Roma in Eastern Europe: The Case of Slovenia.” Discourse & Society 12(6): 699–727.

European Commission. 2018a. Afrophobia: Acknowledging and Understanding the Challenges to Ensure Effective Responses. [URL]

. 2018b. Antigypsyism: Increasing its Recognition to Better Understand and Address its Manifestations. [URL]

ECRI. 2018. ECRI General Policy Recommendation N°7 (revised) on National Legislation to Combat Racism and Racial Discrimination. [URL]

European Union Agency for Fundamental Rights. 2020. A Long Way to Go for LGBTQ Equality. [URL]

Fortuna, Paula, João Silva, Juan Soler Company, Leo Wanner, and Sérgio Nunes. 2019. “A Hierarchically-labeled Portuguese Hate Speech Dataset.” In Proceedings of the Third Workshop on Abusive Language Online 4–104. Florence, Italy: Association for Computational Linguistics.

Fortuna, Paula, and Sérgio Nunes. 2018. “A Survey on Automatic Detection of Hate Speech in Text.” ACM Computing Surveys 51(4): 1–30.

Geyer, Klaus, Eckhard Bick, and Andrea Kleene. 2022. “‘I Am No Racist, but…’: A Corpus-Based Analysis of Xenophobic Hate Speech Constructions in Danish and German Social Media Discourse.” In The Grammar of Hate: Morphosyntactic Features of Hateful, Aggressive, and Dehumanizing Discourse, edited by Natalia Knoblock, 241–261. Cambridge: Cambridge University Press.

Habernal, Ivan, Henning Wachsmuth, Iryna Gurevych, and Benno Stein. 2018. “Before Name-calling: Dynamics and Triggers of Ad Hominem Fallacies in Web Argumentation.” In Proceedings of the NAACL HLT 2018, 386–396, New Orleans, Louisiana: Association for Computational Linguistics.

Han, Chung-hye. 2002. “Interpreting Interrogatives as Rhetorical Questions.” Lingua 112(3): 201–229.

Hancock Alfaro, Ange-Marie. 2022. “When Words don’t Disappear: An Intersectional Analysis of Hate Speech.” In Citizenship on the Edge: Sex/Gender/Race, edited by Nancy J. Hirschmann, and Deborah A. Thomas, 19–40. Philadelphia: University of Pennsylvania Press.

Hill, Jane H. 2008. The Everyday Language of White Racism. Malden, MA: Wiley-Blackwell.

Ho, Janet. 2021. “Metaphors, Powerlessness and Online Aggression: How Wuhan Lockdown Escapees were Dehumanised during the COVID-19 Pandemic.” Journal of Language Aggression and Conflict 111: 77–100.

Hodson, Gordon, and Cara C. MacInnis. 2016. “Derogating Humor as a Delegitimization Strategy in Intergroup Contexts.” Translational Issues in Psychological Science 2(1): 63–74.

Krobová, Tereza, and Jan Zàpotocký. 2021. “‘I Am Not Racist, But…’: Rhetorical Fallacies in Arguments about the Refugee Crisis on Czech Facebook.” Journal of Intercultural Communication 21(2): 58–69.

Krzyżanowski, Michał, and Mats Ekström. 2022. “The Normalization of Far-right Populism and Nativist Authoritarianism: Discursive Practices in Media, Journalism and the Wider Public Sphere/s.” Discourse & Society 33(6): 719–729.

Kumar, Ritesh, Atul Kr. Ojha, Shervin Malmasi, and Marcos Zampieri. 2018. “Benchmarking Aggression Identification in Social Media.” In Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC-2018), 1–11, Santa Fe, New Mexico: USA. Association for Computational Linguistics.

Leite, João, Diego Silva, Kalina Bontcheva, and Carolina Scarton. 2020. “Toxic Language Detection in Social Media for Brazilian Portuguese: New Dataset and Multilingual Analysis”. arXiv preprint arXiv:2010.04543.

Maeso, R. Silvia. 2021. O Estado do Racismo em Portugal: Racismo Antinegro e Anticiganismo no Direito e nas Políticas Públicas. Lisbon: Tinta-da-China.

Macagno, Fabrizio. 2022. “Argumentation Profiles and the Manipulation of Common Ground. The Arguments of Populist Leaders on Twitter.” Journal of Pragmatics 1911: 67–82.

Magano, Olga, and Maria Manuela Mendes. 2021. “Structural Racism and Racialization of Roma/Ciganos in Portugal: The Case of Secondary School Students during the COVID-19 Pandemic.” Social Sciences 10(6): 1–14.

Magu, Rijul, and Jiebo Luo. 2018. “Determining Code Words in Euphemistic Hate Speech Using Word Embedding Networks.” In Proceedings of the 2nd Workshop on Abusive Language Online (ALW2), 93–100. Brussels, Belgium: Association for Computational Linguistics.

Mathew, Binny, Navish Kumar, Pawan Goyal, and Animesh Mukherjee. 2018. “Analyzing the Hate and Counter Speech Accounts on Twitter.” arXiv preprint arXiv:1812.02712.

McEnery, Tony, and Andrew Hardie. 2011. Corpus Linguistics: Method, Theory and Practice. Cambridge Textbooks in Linguistics. Cambridge: Cambridge University Press.

McEnery, Tony, and Vaclav Brezina. 2022. Fundamental Principles of Corpus Linguistics. Cambridge: Cambridge University Press.

Munt, Sally R. 2019. “Gay Shame in a Geopolitical Context.” Cultural Studies 33(2): 223–248.

Paz, María Antonia, Julio Montero-Díaz, and Alicia Moreno-Delgado. 2020. “Hate Speech: A Systematized Review.” Sage Open 10(4).

De Pelle, Rogers Prates, and Viviane Moreira. 2017. “Offensive Comments in the Brazilian Web: A Dataset and Baseline Results.” In Proceedings of BraSNAM. Porto Alegre. SBC.

Pohjonen, Matti, and Sahana Udupa. 2017. “Extreme Speech Online: An Anthropological Critique of Hate Speech Debates.” International Journal of Communication 111: 1173–1191.

Poletto, Fabio, Valerio Basile, Manuela Sanguinetti, Cristina Bosco, and Viviana Patti. 2021. “Resources and Benchmark Corpora for Hate Speech Detection: A Systematic Review.” Language Resources and Evaluation 55(2): 477–523.

Poletto, Fabio, Marco Stranisci, Manuela Sanguinetti, Viviana Patti, and Cristina Bosco. 2017. “Hate Speech Annotation: Analysis of an Italian Twitter Corpus.” In Proceedings of CLiC-It 2017, edited by Roberto Basili, Malvina Nissim, and Giorgio Satta, 263–68. Academia University Press.

Rieger, Diana, Anna Sophie, Maximilian Wich, Toni Kiening, and Georg Groh. 2021. “Assessing the Extent and Types of Hate Speech in Fringe Communities: A Case Study of Alt-right Communities on 8chan, 4chan, and Reddit.” Social Media + Society 7(4).

Sanguinetti, Manuela, Fabio Poletto, Cristina Bosco, Viviana Patti, and Marco Stranisci. 2018. “An Italian Twitter Corpus of Hate Speech against Immigrants.” In Proceedings of the LREC 2018, 2798–2805. Miyazaki, Japan: ELRA.

Schmidt, Anna, and Michael Wiegand. 2017. “A Survey on Hate Speech Detection Using Natural Language Processing.” In Proceedings of the Fifth International Workshop on Natural Language Processing for Social Media, 1–10. Valencia, Spain: Association for Computational Linguistics.

Sellars, Andrew. 2016. “Defining Hate Speech.” Berkman Klein Center Research Publication 2016–20: Boston Univ. School of Law, Public Law Research.

Serafis, Dimitris, Franco Zappettini, and Stavros Assimakopoulos. 2023. “The Institutionalization of Hatred Politics in the Mediterranean: Studying Corpora of Online News Portals during the European ‘Refugee Crisis’.” Topoi (2023): 1–20.

Siegel, Alexandra A. 2020. “Online Hate Speech.” In Social Media and Democracy: The State of the Field, Prospects for Reform, edited by Nathaniel Persily, and Joshua A. Tucker, 56–88. Cambridge: Cambridge University Press.

Stangor, Charles. 2016. “The Study of Stereotyping, Prejudice, and Discrimination within Social Psychology: A Quick History of Theory and Research.” In Handbook of Prejudice, Stereotyping, and Discrimination, edited by Todd Nelson, 3–27. New York: Psychology Press.

Stephan, Walter S., and Cookie White Stephan. 2013. “An Integrated Threat Theory of Prejudice.” In Reducing Prejudice and Discrimination, edited by Stuart Oskamp, 33–56. New York: Psychology Press.

Tindale, Christopher W. 2007. Fallacies and Argument Appraisal. 1st ed. Cambridge: Cambridge University Press.

Tognini-Bonelli, Elena. 2001. Corpus Linguistics at Work. Amsterdam/Philadelphia: John Benjamins.

Torres da Silva, Marisa. 2021. Discurso de Ódio, Jornalismo e Participação das Audiências. Enquadramento, Regulação e Boas Práticas. Lisboa: Almedina ERC.

Ullmann, Stefanie, and Marcus Tomalin. 2020. “Quarantining Online Hate Speech: Technical and Ethical Perspectives.” Ethics and Information Technology 22(1): 69–80.

van Dijk, Teun A. 1992. “Discourse and the Denial of Racism.” Discourse & Society 3(1): 87–118.

1993. “Principles of Critical Discourse Analysis.” Discourse & Society 4(2): 249–283.

van Eemeren, Frans, and Rob Grootendorst. 1987. “Fallacies in Pragma-dialectical Perspective.” Argumentation 11: 283–301.

van Eemeren, Frans H., and Rob Grootendorst. 2004. A Systematic Theory of Argumentation: The Pragma-dialectical Approach. Cambridge: Cambridge University Press.

van Eemeren, Frans., and Bart Garssen. 2023. “The Pragma-Dialectical Approach to the Fallacies Revisited.” Argumentation 1–14.

Vargas, Francielle, Isabelle Carvalho, Fabiana Góes, Thiago Pardo, and Fabrício Benevenuto. 2022. “HateBR: A Large Expert Annotated Corpus of Brazilian Instagram Comments for Offensive Language and Hate Speech Detection.” In Proceedings of LREC 2022, 7174–7183. Marseille, France: ELRA. [URL]

Walton, Douglas N. 1996. “Practical Reasoning and the Structure of Fear Appeal Arguments”. Philosophy & Rhetoric 29(4): 301–313.

Waseem, Zeerak. 2016. “Are You a Racist or Am I Seeing Things? Annotator Influence on Hate Speech Detection on Twitter.” In Proceedings of the First Workshop on NLP and Computational Social Science, 138–142. Austin, Texas: Association for Computational Linguistics.

Weaver, Simon. 2011. “Jokes, Rhetoric and Embodied Racism: A Rhetorical Discourse Analysis of the Logics of Racist Jokes on the Internet.” Ethnicities 11(4): 413–435.

Wiegand, Michael, Melanie Siegel, and Josef Ruppenhofer. 2018. “Overview of the GermEval 2018 Shared Task on the Identification of Offensive Language.” In Proceedings of GermEval 2018 Workshop, 14th Conference on Natural Language Processing (KONVENS 2018), 1–10. Vienna, Austria: Austrian Academy of Sciences.

Wodak, Ruth. 2015. The Politics of Fear: What Right-wing Populist Discourses Mean. London: Sage.

Wodak, Ruth, and Martin Reisigl. 2015. “Discourse and Racism.” In The Handbook of Discourse Analysis, 2nd Edition, edited by Deborah Tannen, Heidi Hamilton, and Deborah Schiffrin, 576–596. West Sussex: John Wiley and Sons.

Wodak, Ruth. 2020. The Politics of Fear. London: Sage.

Cited by (3)

Cited by three other publications

Ramos, Gil, Fernando Batista, Ricardo Ribeiro, Pedro Fialho, Sérgio Moro, António Fonseca, Rita Guerra, Paula Carvalho, Catarina Marques & Cláudia Silva

2024. Leveraging Transfer Learning for Hate Speech Detection in Portuguese Social Media Posts. IEEE Access 12 ► pp. 101374 ff.

Almeida, Pedro, Janainna Pereira & Diego Candido

2023. Online hate speech on social media in Portugal: extremism or structural racism?. Social Identities 29:5 ► pp. 419 ff.

Silva, Cláudia

2023. Fighting Against Hate Speech: A Case for Harnessing Interactive Digital Counter-Narratives. In Interactive Storytelling [Lecture Notes in Computer Science, 14383], ► pp. 159 ff.

This list is based on CrossRef data as of 14 september 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.