Arzt, V., & Hanbury, A. (2024). Beyond the Numbers: Transparency in Relation Extraction Benchmark Creation and Leaderboards. In D. Hupkes, V. Dankers, K. Batsuren, A. Kazemnejad, C. Christodoulopoulos, M. Giulianelli, & R. Cotterel (Eds.), Proceedings of the 2nd GenBench Workshop on Generalisation (Benchmarking) in NLP (pp. 120–130). Association for Computational Linguistics. https://doi.org/10.18653/v1/2024.genbench-1.8