logo Idiap Research Institute        
 [BibTeX] [Marc21]
Noisy Text Categorization
Type of publication: Idiap-RR
Citation: vincia03d
Number: Idiap-RR-61-2003
Year: 2003
Institution: IDIAP
Abstract: This work presents a system for the categorization of noisy texts. By noisy it is meant any text obtained through an extraction process (affected by errors) from media different than digital texts. We show that, even with an average Word Error Rate of around 50%, the categorization performance loss with respect to the clean version of the same documents is negligible.
Userfields: ipdmembership={vision},
Keywords:
Projects Idiap
Authors Vinciarelli, Alessandro
Crossref by vin03d-art
Added by: [UNK]
Total mark: 0
Attachments
  • rr03-61.pdf
  • rr03-61.ps.gz
Notes