Evaluating Computational Approaches for Harmful Content Analysis: Promise, Pitfalls, and Tools for Responsible Research

Itai Himelboim & Mudit Baid (UGA graduate student), “Evaluating Computational Approaches for Harmful Content Analysis: Promise, Pitfalls, and Tools for Responsible Research,” accepted for publication in Big Data and Cognitive Computing. Abstract: This manuscript develops and demonstrates a practical framework for evaluating automated classifiers used in communication research, using harmful language detection as an illustrative case. We combine (a) a structured review of documentation practices for 27 publicly available classifiers and their associated annotation processes with (b) a cross-dataset evaluation that retests each model beyond its original training context. Across 27 datasets, we extract and compare reporting on construct definitions, annotator instructions, and inter-annotator agreement, and we quantify generalization by applying each model to multiple out-of-domain test sets. We also benchmark a contemporary large language model (GPT-5) under a consistent prompting protocol to illustrate how LLM-based classification compares to fine-tuned classifiers. Results show that documentation is uneven and often insufficient for theory-driven measurement, inter-annotator agreement varies widely across datasets, and cross-dataset performance frequently drops substantially relative to within-dataset evaluations. Building on these findings and existing validation guidance, we provide a reusable checklist and decision flow to help researchers select, justify, and report classifier-based measures in ways that support transparency and cumulative science. Recommendations for researchers, reviewers, and journal editors stress aligning model selection with standards of validity, reliability, and transparency.

Itai Himelboim

Related Research

The Power of Investing in NH/PI and AA Communities through Commercial Tobacco and Cancer Prevention: A Community Perspective
Stephen Jiwanmall was a panelist for a session titled “The Power of Investing in NH/PI and AA Communities through Commercial Tobacco and Cancer Prevention: A Community Perspective” at the 2026 […]

Learn More Learn More about The Power of Investing in NH/PI and AA Communities through Commercial Tobacco and Cancer Prevention: A Community Perspective
When Flattery Backfires: Flattery, Perceived Fit, and Brand Evaluation
Sun, Y., Mengqi (Maggie) Liao, & Yang, J. (2026, August). “When Flattery Backfires: Flattery, Perceived Fit, and Brand Evaluation,” paper to be presented at the 109th annual conference of the […]

Learn More Learn More about When Flattery Backfires: Flattery, Perceived Fit, and Brand Evaluation