Generative AI/benchmarks
A survey on evaluation of large language models
김아다만티움
2024. 7. 30. 09:45
Chang, Y., Wang, X., Wang, J., Wu, Y., Yang, L., Zhu, K., ... & Xie, X. (2024). A survey on evaluation of large language models. ACM Transactions on Intelligent Systems and Technology, 15(3), 1-45.