PhytoCell: An ensemble learning framework for identifying cell states in plant scRNA-seq data. Wang, H., Yan, S., Ma, X., Si, H., Lu, Q., Chen, Y., Liu, L., Hong, J., Xu, X., Fang, W., He, Q., Zan, Y., & Yang, A. The Crop Journal, March, 2026.
PhytoCell: An ensemble learning framework for identifying cell states in plant scRNA-seq data [link]Paper  doi  abstract   bibtex   
Single-cell transcriptome sequencing (scRNA-seq) can reveal the roles of diverse cells in an organism, but accurately classifying cell subpopulations and their marker genes remains a challenge. Here, we present PhytoCell, an ensemble learning framework that combines feature selection engineering with machine learning to uncover cell markers and annotate cell subpopulations. We evaluated our approach on 120,000 cells from corollas of the dicotyledonous plant species coyote tobacco (Nicotiana attenuata) and eight tissues from the monocotyledonous plant species rice (Oryza sativa). Comprehensive evaluation across species and tissues demonstrated that PhytoCell effectively eliminates redundant information, identifies key cell markers, improves clustering performance, and accurately classifies cell subpopulations. Importantly, PhytoCell did not rely on prior biological knowledge for selecting cell markers, preserving the biological landscape of the original data. For broader accessibility, we developed a user-friendly web interface that provides convenient tools for users to access cell marker resources and perform predictions for cell type. PhytoCell is freely accessible at https://cgris.net/phyto. PhytoCell is scalable to different sizes of single-cell datasets, representing a valuable resource for precise identification in cell research.
@article{wang_phytocell_2026,
	title = {{PhytoCell}: {An} ensemble learning framework for identifying cell states in plant {scRNA}-seq data},
	issn = {2214-5141},
	shorttitle = {{PhytoCell}},
	url = {https://www.sciencedirect.com/science/article/pii/S2214514126000760},
	doi = {10.1016/j.cj.2026.02.021},
	abstract = {Single-cell transcriptome sequencing (scRNA-seq) can reveal the roles of diverse cells in an organism, but accurately classifying cell subpopulations and their marker genes remains a challenge. Here, we present PhytoCell, an ensemble learning framework that combines feature selection engineering with machine learning to uncover cell markers and annotate cell subpopulations. We evaluated our approach on 120,000 cells from corollas of the dicotyledonous plant species coyote tobacco (Nicotiana attenuata) and eight tissues from the monocotyledonous plant species rice (Oryza sativa). Comprehensive evaluation across species and tissues demonstrated that PhytoCell effectively eliminates redundant information, identifies key cell markers, improves clustering performance, and accurately classifies cell subpopulations. Importantly, PhytoCell did not rely on prior biological knowledge for selecting cell markers, preserving the biological landscape of the original data. For broader accessibility, we developed a user-friendly web interface that provides convenient tools for users to access cell marker resources and perform predictions for cell type. PhytoCell is freely accessible at https://cgris.net/phyto. PhytoCell is scalable to different sizes of single-cell datasets, representing a valuable resource for precise identification in cell research.},
	urldate = {2026-05-19},
	journal = {The Crop Journal},
	author = {Wang, Hao and Yan, Shen and Ma, Xiaoding and Si, Huan and Lu, Qiong and Chen, Yanqing and Liu, Lijia and Hong, Jingpeng and Xu, Xingjian and Fang, Wei and He, Qiang and Zan, Yanjun and Yang, Aiguo},
	month = mar,
	year = {2026},
	keywords = {Cell subpopulations, Ensemble learning, Machine learning, PhytoCell, scRNA-seq},
}

Downloads: 0