Datasets


EmoThreat: Dataset for Multi-label Emotion Classification in Urdu Link

EmoThreat: Dataset for Threatening Language Detection Task in Urdu Link

CoLI-Kanglish: Dataset for Word Level Language Identification in Code-mixed Kannada-English Texts Link

ReDDIT: Dataset for Regret Detection and Domain Identification from English Texts Link (Email the corresponding author)

UrduFake: Dataset for Urdu Fake News named Bend-The-Truth Link

UrduThreat: Dataset for Abusive language using Twitter tweets in Urdu language Link

Dataset for YouTube Based Religious Hate Speech and Extremism Detection from English Texts Link