LTLBench: Towards Benchmarks for Evaluating Temporal Logic Reasoning in Large Language Models. Tang, W. & Belle, V. CoRR, 2024.
LTLBench: Towards Benchmarks for Evaluating Temporal Logic Reasoning in Large Language Models [link]Paper  doi  bibtex   
@article{DBLP:journals/corr/abs-2407-05434,
  author       = {Weizhi Tang and
                  Vaishak Belle},
  title        = {LTLBench: Towards Benchmarks for Evaluating Temporal Logic Reasoning
                  in Large Language Models},
  journal      = {CoRR},
  volume       = {abs/2407.05434},
  year         = {2024},
  url          = {https://doi.org/10.48550/arXiv.2407.05434},
  doi          = {10.48550/ARXIV.2407.05434},
  eprinttype    = {arXiv},
  eprint       = {2407.05434},
  timestamp    = {Mon, 12 Aug 2024 01:00:00 +0200},
  biburl       = {https://dblp.org/rec/journals/corr/abs-2407-05434.bib},
  bibsource    = {dblp computer science bibliography, https://dblp.org}
}

Downloads: 0