IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP. Koto, F., Rahimi, A., Lau, J. H., & Baldwin, T. CoRR, 2020.
IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP [link]Paper  bibtex   
@article{DBLP:journals/corr/abs-2011-00677,
  author       = {Fajri Koto and
                  Afshin Rahimi and
                  Jey Han Lau and
                  Timothy Baldwin},
  title        = {IndoLEM and IndoBERT: {A} Benchmark Dataset and Pre-trained Language
                  Model for Indonesian {NLP}},
  journal      = {CoRR},
  volume       = {abs/2011.00677},
  year         = {2020},
  url          = {https://arxiv.org/abs/2011.00677},
  eprinttype    = {arXiv},
  eprint       = {2011.00677},
  timestamp    = {Fri, 06 Nov 2020 00:00:00 +0100},
  biburl       = {https://dblp.org/rec/journals/corr/abs-2011-00677.bib},
  bibsource    = {dblp computer science bibliography, https://dblp.org}
}

Downloads: 0