Efficient NLP Inference at the Edge via Elastic Pipelining. Guo, L., Choe, W., & Lin, F. X. arXiv (Cornell University), 2022.
Efficient NLP Inference at the Edge via Elastic Pipelining [link]Paper  bibtex   
@article{2014,
  author = {Ling Guo and Wonkyo Choe and Felix Xiaozhu Lin},
  title = {Efficient NLP Inference at the Edge via Elastic Pipelining},
  year = {2022},
  journal = {arXiv (Cornell University)},
  url = {http://arxiv.org/abs/2207.05022}
}

Downloads: 0