Hi! I am a third-year Ph.D. student in the School of Computing at KAIST, advised by Alice Oh. My research focuses on developing evaluation methods for language models that reflect multilingual, multicultural, and interactive real-world use. I am particularly interested in the gap between benchmark performance and how people experience models in everyday interactions, especially for users whose linguistic and cultural backgrounds differ from those of AI developers. To better understand and close this gap, I study both the behavioral patterns of language models and human-centered evaluation practices.
My research focuses on:
My long-term goal is to build evaluation practices that make language models more reliable and meaningful for real-world users.
Email: 411juhyun [at] kaist.ac.kr
Links: [Google Scholar] [Twitter] [CV]
For a complete list, check my Google Scholar