MermaidSeqBench: An Evaluation Benchmark for LLM-to-Mermaid Sequence Diagram Generation

MermaidSeqBench: An Evaluation Benchmark for LLM-to-Mermaid Sequence Diagram Generation. Shbita, B., Ahmed, F., & DeLuca, C. In NeurIPS 2025 Workshop on Evaluating the Evolving LLM Lifecycle: Benchmarks, Emergent Abilities, and Scaling, 2025.

Link

MermaidSeqBench: An Evaluation Benchmark for LLM-to-Mermaid Sequence Diagram Generation [pdf]

Paper doi bibtex 2 downloads

@inproceedings{shbita2025mermaidseqbench,
  title={MermaidSeqBench: An Evaluation Benchmark for LLM-to-Mermaid Sequence Diagram Generation},
  author={Shbita, Basel and Ahmed, Farhan and DeLuca, Chad},
  booktitle={NeurIPS 2025 Workshop on Evaluating the Evolving LLM Lifecycle: Benchmarks, Emergent Abilities, and Scaling},
  year={2025},
  doi={10.48550/arXiv.2511.14967},
  urlLink={https://arxiv.org/abs/2511.14967},
  urlPaper={https://arxiv.org/pdf/2511.14967.pdf}
}

Downloads: 2