Beyond Clean Text: Evaluating Encoder and Decoder Robustness for Bangla Event Detection in Noisy Text
arXiv:2606.30914v1 Announce Type: new Abstract: Event detection (ED) systems are typically evaluated on clean, curated text, leaving their robustness to real-world noise largely unexplored, particularly for low-resource languages such as Bangla. We introduce a generalized Bangla news event ontology and a benchmark comprising 9,979 annotated sentences across 40 event subtypes, spanning clean news text, real-world Automatic Speech Recognition (ASR) transcripts, and orthographically corrupted text....
arXiv cs.CL
·Tanvir Ahmed Sijan, S. M Golam Rifat, Nayeemul Islam, Md. Musfique Anwar
·
// relacionados
Leia também
Blog
SpaceX has an AI device prototype, and it sure sounds phone-ish
Blog
Ashton Kutcher leaving Sound Ventures to launch new VC firm with Morgan Beller
Blog
Building a Multimodal Dataset of Academic Paper for Keyword Extraction
Blog