Nonnegative spatial factorization applied to spatial genomics

Nonnegative spatial factorization applied to spatial genomics. Townes, F. W. & Engelhardt, B. E. Nature Methods, December, 2022. Publisher: Nature Publishing Group

Paper doi abstract bibtex 2 downloads

Nonnegative matrix factorization (NMF) is widely used to analyze high-dimensional count data because, in contrast to real-valued alternatives such as factor analysis, it produces an interpretable parts-based representation. However, in applications such as spatial transcriptomics, NMF fails to incorporate known structure between observations. Here, we present nonnegative spatial factorization (NSF), a spatially-aware probabilistic dimension reduction model based on transformed Gaussian processes that naturally encourages sparsity and scales to tens of thousands of observations. NSF recovers ground truth factors more accurately than real-valued alternatives such as MEFISTO in simulations, and has lower out-of-sample prediction error than probabilistic NMF on three spatial transcriptomics datasets from mouse brain and liver. Since not all patterns of gene expression have spatial correlations, we also propose a hybrid extension of NSF that combines spatial and nonspatial components, enabling quantification of spatial importance for both observations and features. A TensorFlow implementation of NSF is available from https://github.com/willtownes/nsf-paper.

@article{townes_nonnegative_2022,
	title = {Nonnegative spatial factorization applied to spatial genomics},
	copyright = {2022 The Author(s)},
	issn = {1548-7105},
	url = {https://www.nature.com/articles/s41592-022-01687-w},
	doi = {10.1038/s41592-022-01687-w},
	abstract = {Nonnegative matrix factorization (NMF) is widely used to analyze high-dimensional count data because, in contrast to real-valued alternatives such as factor analysis, it produces an interpretable parts-based representation. However, in applications such as spatial transcriptomics, NMF fails to incorporate known structure between observations. Here, we present nonnegative spatial factorization (NSF), a spatially-aware probabilistic dimension reduction model based on transformed Gaussian processes that naturally encourages sparsity and scales to tens of thousands of observations. NSF recovers ground truth factors more accurately than real-valued alternatives such as MEFISTO in simulations, and has lower out-of-sample prediction error than probabilistic NMF on three spatial transcriptomics datasets from mouse brain and liver. Since not all patterns of gene expression have spatial correlations, we also propose a hybrid extension of NSF that combines spatial and nonspatial components, enabling quantification of spatial importance for both observations and features. A TensorFlow implementation of NSF is available from https://github.com/willtownes/nsf-paper.},
	language = {en},
	urldate = {2022-12-31},
	journal = {Nature Methods},
	author = {Townes, F. William and Engelhardt, Barbara E.},
	month = dec,
	year = {2022},
	note = {Publisher: Nature Publishing Group},
	keywords = {Gene expression analysis, Machine learning, Software, Statistical methods, Transcriptomics},
	pages = {1--10},
}

Downloads: 2

{"_id":"aBSHbHBtkNQYErgTh","bibbaseid":"townes-engelhardt-nonnegativespatialfactorizationappliedtospatialgenomics-2022","author_short":["Townes, F. W.","Engelhardt, B. E."],"bibdata":{"bibtype":"article","type":"article","title":"Nonnegative spatial factorization applied to spatial genomics","copyright":"2022 The Author(s)","issn":"1548-7105","url":"https://www.nature.com/articles/s41592-022-01687-w","doi":"10.1038/s41592-022-01687-w","abstract":"Nonnegative matrix factorization (NMF) is widely used to analyze high-dimensional count data because, in contrast to real-valued alternatives such as factor analysis, it produces an interpretable parts-based representation. However, in applications such as spatial transcriptomics, NMF fails to incorporate known structure between observations. Here, we present nonnegative spatial factorization (NSF), a spatially-aware probabilistic dimension reduction model based on transformed Gaussian processes that naturally encourages sparsity and scales to tens of thousands of observations. NSF recovers ground truth factors more accurately than real-valued alternatives such as MEFISTO in simulations, and has lower out-of-sample prediction error than probabilistic NMF on three spatial transcriptomics datasets from mouse brain and liver. Since not all patterns of gene expression have spatial correlations, we also propose a hybrid extension of NSF that combines spatial and nonspatial components, enabling quantification of spatial importance for both observations and features. A TensorFlow implementation of NSF is available from https://github.com/willtownes/nsf-paper.","language":"en","urldate":"2022-12-31","journal":"Nature Methods","author":[{"propositions":[],"lastnames":["Townes"],"firstnames":["F.","William"],"suffixes":[]},{"propositions":[],"lastnames":["Engelhardt"],"firstnames":["Barbara","E."],"suffixes":[]}],"month":"December","year":"2022","note":"Publisher: Nature Publishing Group","keywords":"Gene expression analysis, Machine learning, Software, Statistical methods, Transcriptomics","pages":"1–10","bibtex":"@article{townes_nonnegative_2022,\n\ttitle = {Nonnegative spatial factorization applied to spatial genomics},\n\tcopyright = {2022 The Author(s)},\n\tissn = {1548-7105},\n\turl = {https://www.nature.com/articles/s41592-022-01687-w},\n\tdoi = {10.1038/s41592-022-01687-w},\n\tabstract = {Nonnegative matrix factorization (NMF) is widely used to analyze high-dimensional count data because, in contrast to real-valued alternatives such as factor analysis, it produces an interpretable parts-based representation. However, in applications such as spatial transcriptomics, NMF fails to incorporate known structure between observations. Here, we present nonnegative spatial factorization (NSF), a spatially-aware probabilistic dimension reduction model based on transformed Gaussian processes that naturally encourages sparsity and scales to tens of thousands of observations. NSF recovers ground truth factors more accurately than real-valued alternatives such as MEFISTO in simulations, and has lower out-of-sample prediction error than probabilistic NMF on three spatial transcriptomics datasets from mouse brain and liver. Since not all patterns of gene expression have spatial correlations, we also propose a hybrid extension of NSF that combines spatial and nonspatial components, enabling quantification of spatial importance for both observations and features. A TensorFlow implementation of NSF is available from https://github.com/willtownes/nsf-paper.},\n\tlanguage = {en},\n\turldate = {2022-12-31},\n\tjournal = {Nature Methods},\n\tauthor = {Townes, F. William and Engelhardt, Barbara E.},\n\tmonth = dec,\n\tyear = {2022},\n\tnote = {Publisher: Nature Publishing Group},\n\tkeywords = {Gene expression analysis, Machine learning, Software, Statistical methods, Transcriptomics},\n\tpages = {1--10},\n}\n\n","author_short":["Townes, F. W.","Engelhardt, B. E."],"key":"townes_nonnegative_2022","id":"townes_nonnegative_2022","bibbaseid":"townes-engelhardt-nonnegativespatialfactorizationappliedtospatialgenomics-2022","role":"author","urls":{"Paper":"https://www.nature.com/articles/s41592-022-01687-w"},"keyword":["Gene expression analysis","Machine learning","Software","Statistical methods","Transcriptomics"],"metadata":{"authorlinks":{}},"downloads":2},"bibtype":"article","biburl":"https://api.zotero.org/users/2233851/collections/LERQCLKC/items?key=cxJj8p7F2LdXbebFPvCv7NrT&format=bibtex&limit=100","dataSources":["PixH7YTqtXkBGc2QJ"],"keywords":["gene expression analysis","machine learning","software","statistical methods","transcriptomics"],"search_terms":["nonnegative","spatial","factorization","applied","spatial","genomics","townes","engelhardt"],"title":"Nonnegative spatial factorization applied to spatial genomics","year":2022,"downloads":2}