Breaking text-based CAPTCHAs with variable word and character orientation. Starostenko, O., Cruz-Perez, C., Uceda-Ponga, F., & Alarcon-Aquino, V. Pattern Recognition, 48(4):1101-1112, 4, 2015.
Breaking text-based CAPTCHAs with variable word and character orientation [link]Website  doi  abstract   bibtex   
A novel approach for automatic segmentation and recognition of CAPTCHAs with variable orientation and random collapse of overlapped characters is presented in this paper. Additionally, the extension of the proposed approach to break reCAPTCHA of version of 2012 is also discussed. The original proposal consists in straightening characters and word in CAPTCHA exploiting then a three-color bar code for their segmentation. The recognition of straightened characters and whole word is provided by the proposed original SVM-based learning classifier. The main goal of this research is to reduce vulnerability of CAPTCHA from spam and frauds as well as to provide an approach for recognizing either handwritten or degraded and damaged texts in ancient manuscripts by OCR systems. The designed framework for breaking CAPTCHAs by the proposed approach has been tested achieving average segmentation success rate up to 82% for reCAPTCHA of version 2011 and achieving 95.5% by extended approach for reCAPTCHA of version 2012 with response time less than 0.5 s per two-word reCAPTCHA. The implemented SVM classifier shows a competitive precision about 94%. The obtained very satisfactory results confirm that the proposed approach may be used for development of new security mechanisms to protect users against cyber-criminal activities and Internet threats.
@article{
 title = {Breaking text-based CAPTCHAs with variable word and character orientation},
 type = {article},
 year = {2015},
 keywords = {Breaking CAPTCHA,Heuristic classifier,Three-color bar character encoding,Word and character straightening,reCAPTCHA Version 2012},
 pages = {1101-1112},
 volume = {48},
 websites = {https://linkinghub.elsevier.com/retrieve/pii/S0031320314003483},
 month = {4},
 id = {7d205188-d964-359d-b51d-97b780489a73},
 created = {2022-08-29T17:42:39.480Z},
 file_attached = {false},
 profile_id = {940dd160-7d67-3a5f-b9f8-935da0571367},
 group_id = {92fccab2-8d44-33bc-b301-7b94bb18523c},
 last_modified = {2022-08-29T17:42:39.480Z},
 read = {false},
 starred = {false},
 authored = {false},
 confirmed = {true},
 hidden = {false},
 private_publication = {false},
 abstract = {A novel approach for automatic segmentation and recognition of CAPTCHAs with variable orientation and random collapse of overlapped characters is presented in this paper. Additionally, the extension of the proposed approach to break reCAPTCHA of version of 2012 is also discussed. The original proposal consists in straightening characters and word in CAPTCHA exploiting then a three-color bar code for their segmentation. The recognition of straightened characters and whole word is provided by the proposed original SVM-based learning classifier. The main goal of this research is to reduce vulnerability of CAPTCHA from spam and frauds as well as to provide an approach for recognizing either handwritten or degraded and damaged texts in ancient manuscripts by OCR systems. The designed framework for breaking CAPTCHAs by the proposed approach has been tested achieving average segmentation success rate up to 82% for reCAPTCHA of version 2011 and achieving 95.5% by extended approach for reCAPTCHA of version 2012 with response time less than 0.5 s per two-word reCAPTCHA. The implemented SVM classifier shows a competitive precision about 94%. The obtained very satisfactory results confirm that the proposed approach may be used for development of new security mechanisms to protect users against cyber-criminal activities and Internet threats.},
 bibtype = {article},
 author = {Starostenko, Oleg and Cruz-Perez, Claudia and Uceda-Ponga, Fernando and Alarcon-Aquino, Vicente},
 doi = {10.1016/j.patcog.2014.09.006},
 journal = {Pattern Recognition},
 number = {4}
}

Downloads: 0