A Single Vector Is Not Enough: Taxonomy Expansion via Box Embeddings. Jiang, S., Yao, Q., Wang, Q., & Sun, Y. In Proceedings of the ACM Web Conference 2023, pages 2467–2476, Austin TX USA, April, 2023. ACM. 0 citations (Semantic Scholar/DOI) [2023-05-03]
A Single Vector Is Not Enough: Taxonomy Expansion via Box Embeddings [link]Paper  doi  abstract   bibtex   
Taxonomies, which organize knowledge hierarchically, support various practical web applications such as product navigation in online shopping and user profle tagging on social platforms. Given the continued and rapid emergence of new entities, maintaining a comprehensive taxonomy in a timely manner through human annotation is prohibitively expensive. Therefore, expanding a taxonomy automatically with new entities is essential. Most existing methods for expanding taxonomies encode entities into vector embeddings (i.e., single points). However, we argue that vectors are insufcient to model the “is-a” hierarchy in taxonomy (asymmetrical relation), because two points can only represent pairwise similarity (symmetrical relation). To this end, we propose to project taxonomy entities into boxes (i.e., hyperrectangles). Two boxes can be "contained", "disjoint" and "intersecting", thus naturally representing an asymmetrical taxonomic hierarchy. Upon box embeddings, we propose a novel model BoxTaxo for taxonomy expansion. The core of BoxTaxo is to learn boxes for entities to capture their child-parent hierarchies. To achieve this, BoxTaxo optimizes the box embeddings from a joint view of geometry and probability. BoxTaxo also ofers an easy and natural way for inference: examine whether the box of a given new entity is fully enclosed inside the box of a candidate parent from the existing taxonomy. Extensive experiments on two benchmarks demonstrate the efectiveness of BoxTaxo compared to vector based models.
@inproceedings{jiang_single_2023,
	address = {Austin TX USA},
	title = {A {Single} {Vector} {Is} {Not} {Enough}: {Taxonomy} {Expansion} via {Box} {Embeddings}},
	isbn = {978-1-4503-9416-1},
	shorttitle = {A {Single} {Vector} {Is} {Not} {Enough}},
	url = {https://dl.acm.org/doi/10.1145/3543507.3583310},
	doi = {10.1145/3543507.3583310},
	abstract = {Taxonomies, which organize knowledge hierarchically, support various practical web applications such as product navigation in online shopping and user profle tagging on social platforms. Given the continued and rapid emergence of new entities, maintaining a comprehensive taxonomy in a timely manner through human annotation is prohibitively expensive. Therefore, expanding a taxonomy automatically with new entities is essential. Most existing methods for expanding taxonomies encode entities into vector embeddings (i.e., single points). However, we argue that vectors are insufcient to model the “is-a” hierarchy in taxonomy (asymmetrical relation), because two points can only represent pairwise similarity (symmetrical relation). To this end, we propose to project taxonomy entities into boxes (i.e., hyperrectangles). Two boxes can be "contained", "disjoint" and "intersecting", thus naturally representing an asymmetrical taxonomic hierarchy. Upon box embeddings, we propose a novel model BoxTaxo for taxonomy expansion. The core of BoxTaxo is to learn boxes for entities to capture their child-parent hierarchies. To achieve this, BoxTaxo optimizes the box embeddings from a joint view of geometry and probability. BoxTaxo also ofers an easy and natural way for inference: examine whether the box of a given new entity is fully enclosed inside the box of a candidate parent from the existing taxonomy. Extensive experiments on two benchmarks demonstrate the efectiveness of BoxTaxo compared to vector based models.},
	language = {en},
	urldate = {2023-05-03},
	booktitle = {Proceedings of the {ACM} {Web} {Conference} 2023},
	publisher = {ACM},
	author = {Jiang, Song and Yao, Qiyue and Wang, Qifan and Sun, Yizhou},
	month = apr,
	year = {2023},
	note = {0 citations (Semantic Scholar/DOI) [2023-05-03]},
	pages = {2467--2476},
}

Downloads: 0