Benchmarking Language Model Creativity: A Case Study on Code Generation was accepted by NAACL 2025 main!