Fast multidimensional reduction and broadcast operations on GPU for machine learning. Dikbayir, D., Çoban, E. B., Kesen, I., Yuret, D., & Unat, D. Concurrency and Computation: Practice and Experience, 2018.
bibtex   
@article{Dikbayir2018FastMR,
    author = "Dikbayir, Doga and {\c{C}}oban, Enis Berk and Kesen, Ilker and Yuret, Deniz and Unat, D.",
    title = "Fast multidimensional reduction and broadcast operations on GPU for machine learning",
    journal = "Concurrency and Computation: Practice and Experience",
    year = "2018",
    volume = "30",
    keywords = "ML,NLP"
}

Downloads: 0