Publications

2026

ICLR

RPM: Reasoning-Level Personalization for Black-Box Large Language Models

{Jieyong Kim, Tongyoung Kim}, Soojin Yoon, Jaehyung Kim, Dongha Lee

AgenticShop: Benchmarking Agentic Product Curation for Personalized Web Shopping

{Sunghwan Kim, Ryang Heo}, Yongsik Seo, Jinyoung Yeo, Dongha Lee

Preprint (arXiv)

P-Check: Advancing Personalized Reward Model via Learning to Generate Dynamic Checklist

Kwangwook Seo, Dongha Lee

CREAM: Continual Retrieval on Dynamic Streaming Corpora with Adaptive Soft Memory

Huijeong Son, Hyeongu Kang, Sunho Kim, Subeen Ho, Seongku Kang, Dongha Lee, Susik Yoon

2025

Preprint (arXiv)

Personalized Reward Modeling for Text-to-Image Generation

Jeongeun Lee, Ryang Heo, Dongha Lee

Preprint (arXiv)

IPQA: A Benchmark for Core Intent Identification in Personalized Question Answering

Jieyong Kim, Maryam Amirizaniani, Soojin Yoon, Dongha Lee

Preprint (arXiv)

Beyond the Final Answer: Evaluating the Reasoning Trajectories of Tool-Augmented Agents

Wonjoong Kim, Sangwu Park, Yeonjun In, Sein Kim, Dongha Lee, Chanyoung Park

Preprint (arXiv)

In Their Own Words: Reasoning Traces Tailored for Small Models Make Them Better Reasoners

Jaehoon Kim, Kwangwook Seo, Dongha Lee

Preprint (arXiv)

BESPOKE: Benchmark for Search-Augmented Large Language Model Personalization via Diagnostic Feedback

{Hyunseo Kim, Sangam Lee}, Kwangwook Seo, Dongha Lee

BPL: Bias-adaptive Preference Distillation Learning for Recommender System

Seongku Kang, Jianxun Lian, Dongha Lee, Wonbin Kweon, Sanghwan Jang, Jaehyun Lee, Jindong Wang, Xing Xie, Hwanjo Yu

NeurIPS
Spotlight

Fast and Fluent Diffusion Language Models via Convolutional Decoding and Rejective Fine-tuning

Yeongbin Seo, Dongha Lee, Jaehyung Kim, Jinyoung Yeo

Can Large Language Models be Effective Online Opinion Miners?

Ryang Heo, Yongsik Seo, Junseong Lee, Dongha Lee

EMNLP Findings

Towards Personalized Conversational Sales Agents: Contextual User Profiling for Strategic Action

{Tongyoung Kim, Jeongeun Lee}, Soojin Yoon, Seonghwan Kim, Dongha Lee

EMNLP Findings

Stop Playing the Guessing Game! Target-Free User Simulation for Evaluating Conversational Recommender Systems

{Sunghwan Kim, Kwangwook Seo}, Tongyoung Kim, Jinyoung Yeo, Dongha Lee

EMNLP Findings

Can Code-Switched Texts Activate a Knowledge Switch in LLMs? A Case Study on English-Korean Code-Switching

Seoyeon Kim, Huiseo Kim, Chanjun Park, Jinyoung Yeo, Dongha Lee

EMNLP Findings

How Diversely Can Language Models Solve Problems? Exploring the Algorithmic Diversity of Model-Generated Code

Seonghyeon Lee, Heejae Chon, Joonwon Jang, Dongha Lee, Hwanjo Yu

HIPPO-Video: Simulating Watch Histories with Large Language Models for Personalized Video Highlighting

Jeongeun Lee, Youngjae Yu, Dongha Lee

Imagine All The Relevance: Scenario-Profiled Indexing with Knowledge Expansion for Dense Retrieval

Sangam Lee, Ryang Heo, SeongKu Kang, Dongha Lee

MT-RAIG: Novel Benchmark and Evaluation Framework for Retrieval-Augmented Insight Generation over Multiple Tables

{Kwangwook Seo, Donguk Kwon}, Dongha Lee

Rethinking Reward Model Evaluation Through the Lens of Reward Overoptimization

{Sunghwan Kim, Dongjin Kang}, Taeyoon Kwon, Hyungjoo Chae, Dongha Lee, Jinyoung Yeo

Review-driven Personalized Preference Reasoning with Large Language Models for Recommendation

{Jieyong Kim, Hyunseo Kim}, Hyunjin Cho, SeongKu Kang, Buru Chang, Jinyoung Yeo, Dongha Lee

Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation

Hyungjoo Chae, Namyoung Kim, Kai Tzu-iunn Ong, Minju Gwak, Gwanwoo Song, Jihoon Kim, Sunghwan Kim, Dongha Lee, Jinyoung Yeo

Towards Lifelong Dialogue Agents via Relation-aware Memory Construction and Timeline-augmented Response Generation

{Kai Tzu-iunn Ong, Namyoung Kim}, Minju Gwak, Hyungjoo Chae, Taeyoon Kwon, Yohan Jo, Seung-won Hwang, Dongha Lee, Jinyoung Yeo

NAACL Findings

Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics

Seungbeen Lee, Seungwon Lim, Seungju Han, Giyeong Oh, Hyungjoo Chae, Jiwan Chung, Minju Kim, Beong-woo Kwak, Yeonsoo Lee, Dongha Lee, Jinyoung Yeo, Youngjae Yu

Unsupervised Robust Cross-Lingual Entity Alignment via Neighbor Triple Matching with Entity and Relation Texts

Soojin Yoon, Sungho Ko, TongYoung Kim, SeongKu Kang, Jinyoung Yeo, Dongha Lee

WSDM

Improving Scientific Document Retrieval with Concept Coverage-based Query Set Generation

SeongKu Kang, Bowen Jin, Wonbin Kweon, Yu Zhang, Dongha Lee, Jiawei Han, Hwanjo Yu

2024

Preprint (arXiv)

MVIGER: Multi-View Variational Integration of Complementary Knowledge for Generative Recommender

Tongyoung Kim, SooJin Yoon, SeongKu Kang, Jinyoung Yeo, Dongha Lee

Preprint (arXiv)

Why These Documents? Explainable Generative Retrieval with Hierarchical Category Paths

Sangam Lee, Ryang Heo, SeongKu Kang, Susik Yoon, Jinyoung Yeo, Dongha Lee

Preprint (arXiv)

Graph Signal Processing for Cross-Domain Recommendation

Jeongeun Lee, SeongKu Kang, Won-Yong Shin, Jeongwhan Choi, Noseong Park, Dongha Lee

MICCAI Challenge
First place award at UWF4DR

Bag of Tricks for Diabetic Retinopathy and Diabetic Macular Edema Classification in Ultra-Widefield Imaging

Hyeonmin Kim, Chanyang Seo, Wonyoung Seo, Yunnie Cho, Ohhyun Kwon, Dongha Lee

NeurIPS

Train-Attention: Meta-Learning Where to Focus in Continual Knowledge Learning

Yeongbin Seo, Dongha Lee, Jinyoung Yeo

EMNLP
Oral

Evidence-Focused Fact Summarization for Knowledge-Augmented Zero-Shot Question Answering

{Sungho Ko, Hyunjin Cho}, Hyungjoo Chae, Jinyoung Yeo, Dongha Lee

EMNLP
Oral

Taxonomy-guided Semantic Indexing for Academic Paper Search

SeongKu Kang, Yunyi Zhang, Pengcheng Jiang, Dongha Lee, Jiawei Han, Hwanjo Yu

EMNLP Findings

Unveiling Implicit Table Knowledge with Question-Then-Pinpoint Reasoner for Insightful Table Summarization

Kwangwook Seo, Jinyoung Yeo, Dongha Lee

EMNLP Findings

Make Compound Sentences Simple to Analyze: Learning to Split Sentences for Aspect-based Sentiment Analysis

{Yongsik Seo, Sungwon Song, Ryang Heo}, Jieyong Kim, Dongha Lee

EMNLP Findings

CACTUS: Towards Psychological Counseling Conversations using Cognitive Behavioral Theory

{Suyeon Lee, Sunghwan Kim, Minju Kim}, Dongjin Kang, Dongil Yang, Harim Kim, Minseok Kang, Dayi Jung, Min Hee Kim, Seungbeen Lee, Kyoung-Mee Chung, Youngjae Yu, Dongha Lee, Jinyoung Yeo

EMNLP Findings

Eliciting Instruction-tuned Code Language Models' Capabilities to Utilize Auxiliary Function for Code Generation

Seonghyeon Lee, Suyeon Kim, Joonwon Jang, HeeJae Chon, Dongha Lee, Hwanjo Yu

Preprint (arXiv)

YA-TA: Towards Personalized Question-Answering Teaching Assistants using Instructor-Student Dual Retrieval-augmented Knowledge Fusion

Dongil Yang, Suyeon Lee, Minjin Kim, Jungsoo Won, Namyoung Kim, Dongha Lee, Jinyoung Yeo

VerifiNER: Verification-augmented NER via Knowledge-grounded Reasoning with Large Language Models

{Seoyeon Kim, Kwangwook Seo}, Hyungjoo Chae, Jinyoung Yeo, Dongha Lee

ACL
Outstanding Paper Award

Can Large Language Models be Good Emotional Supporter? Mitigating Preference Bias on Emotional Support Conversation

{Dongjin Kang, Sunghwan Kim}, Taeyoon Kwon, Seungjun Moon, Hyunsouk Cho, Youngjae Yu, Dongha Lee, Jinyoung Yeo

ACL Findings

PEARL: A Review-driven Persona-Knowledge grounded Conversational Recommendation Dataset

{Minjin Kim, Minju Kim}, Hana Kim, Beong-woo Kwak, Soyeon Jeon, Hyunseo Kim, SeongKu Kang, Youngjae Yu, Jinyoung Yeo, Dongha Lee

ACL Findings

Self-Consistent Reasoning-based Aspect-Sentiment Quad Prediction with Extract-Then-Assign Strategy

{Jieyong Kim, Ryang Heo}, Yongsik Seo, SeongKu Kang, Jinyoung Yeo, Dongha Lee

NAACL Demo

RTSUM: Relation Triple-based Interpretable Summarization with Multi-level Salience Visualization

Seonglae Cho, Myungha Jang, Jinyoung Yeo, Dongha Lee

NAACL Findings

Exploring Language Model's Code Generation Ability with Auxiliary Functions

Seonghyeon Lee, Sanghwan Jang, Seongbo Jang, Dongha Lee, Hwanjo Yu

Learning Discriminative Dynamics with Label Corruption for Noisy Label Detection

Suyeon Kim, Dongha Lee, SeongKu Kang, Sukang Chae, Sanghwan Jang, Hwanjo Yu

TORS

Unbiased, Effective, and Efficient Distillation from Heterogeneous Models for Recommender Systems

Seongku Kang, Wonbin Kweon, Dongha Lee, Jianxun Lian, Xing Xie, Hwanjo Yu

Improving Retrieval in Theme-specific Applications using a Corpus Topical Taxonomy

SeongKu Kang, Shivam Agarwal, Bowen Jin, Dongha Lee, Hwanjo Yu, Jiawei Han

EACL Findings

Evidentiality-Aware Retrieval for Overcoming Abstractiveness in Open-Domain Question Answering

Yongho Song, Dahyun Lee, Myungha Jang, Seung-won Hwang, Kyungjae Lee, Dongha Lee, Jinyoung Yeo

EACL

Commonsense-augmented Memory Construction and Management in Long-term Conversations via Context-aware Persona Refinement

Hana Kim, Kai Tzu-iunn Ong, Seoyeon Kim, Dongha Lee, Jinyoung Yeo

Large Language Models are Clinical Reasoners: Reasoning-Aware Diagnosis Framework with Prompt-Generated Rationales

Taeyoon Kwon, Kai Tzu-iunn Ong, Dongjin Kang, Seungjun Moon, Jeong Ryong Lee, Dosik Hwang, Beomseok Sohn, Yongsik Sim, Dongha Lee, Jinyoung Yeo

AAAI

Multi-Domain Recommendation to Attract Users via Domain Preference Modeling

Hyunjun Ju, Seongku Kang, Dongha Lee, Junyoung Hwang, Sanghwan Jang, Hwanjo Yu

2023

Preprint (arXiv)

RDGCL: Reaction-Diffusion Graph Contrastive Learning for Recommendation

Jeongwhan Choi, Hyowon Wi, Chaejeong Lee, Sung-Bae Cho, Dongha Lee, Noseong Park

EMNLP

Dialogue Chain-of-Thought Distillation for Commonsense-aware Conversational Agents

Hyungjoo Chae, Yongho Song, Kai Tzu-iunn Ong, Taeyoon Kwon, Minjin Kim, Youngjae Yu, Dongha Lee, Dongyeop Kang, Jinyoung Yeo

Unsupervised Story Discovery from Continuous News Streams via Scalable Thematic Embedding

Susik Yoon, Dongha Lee, Yunyi Zhang, Jiawei Han

SCStory: Self-supervised and Continual Online Story Discovery

Susik Yoon, Yu Meng, Dongha Lee, Jiawei Han

Distillation from Heterogeneous Models for Top-K Recommendation

Seongku Kang, Wonbin Kweon, Dongha Lee, Jianxun Lian, Xing Xie, Hwanjo Yu

AAAI

Learning Topology-Specific Experts for Molecular Property Prediction

Suyeon Kim, Dongha Lee, Seongku Kang, Seonghyeon Lee, Hwanjo Yu

2022

EMNLP Findings

Topic Taxonomy Expansion via Hierarchy-Aware Topic Phrase Generation

Dongha Lee, Jiaming Shen, Seonghyeon Lee, Susik Yoon, Hwanjo Yu, Jiawei Han

Information Sciences

Mitigating Viewpoint Sensitivity of Self-supervised One-class Classifiers

Hyunjun Ju, Dongha Lee, Seongku Kang, Hwanjo Yu

Toward Interpretable Semantic Textual Similarity via Optimal Transport-based Contrastive Sentence Learning

Seonghyeon Lee, Dongha Lee, Seongbo Jang, Hwanjo Yu

TaxoCom: Topic Taxonomy Completion with Hierarchical Discovery of Novel Topic Clusters

Dongha Lee, Jiaming Shen, Seongku Kang, Susik Yoon, Jiawei Han, Hwanjo Yu

Consensus Learning from Heterogeneous Objectives for One-Class Collaborative Filtering

Seongku Kang, Dongha Lee, Wonbin Kweon, Junyoung Hwang, Hwanjo Yu

Knowledge-Based Systems

Personalized Knowledge Distillation for Recommender System

Seongku Kang, Dongha Lee, Wonbin Kweon, Hwanjo Yu

2021

Out-of-Category Document Identification Using Target-Category Names as Weak Supervision

Dongha Lee, Dongmin Hyun, Jiawei Han, Hwanjo Yu

Learnable Structural Semantic Readout for Graph Classification

Dongha Lee, Suyeon Kim, Seonghyeon Lee, Chanyoung Park, Hwanjo Yu

Weakly Supervised Temporal Anomaly Segmentation with Dynamic Time Warping

Dongha Lee, Sehun Yu, Hyunjun Ju, Hwanjo Yu

Out-of-manifold Regularization in Contextual Embedding Space for Text Classification

Seonghyeon Lee, Dongha Lee, Hwanjo Yu

Bootstrapping User and Item Representations for One-Class Collaborative Filtering

Dongha Lee, Seongku Kang, Hyunjun Ju, Chanyoung Park, Hwanjo Yu

Learnable Dynamic Temporal Pooling for Time Series Classification

Dongha Lee, Seonghyeon Lee, Hwanjo Yu

2020

Multi-class Data Description for Out-of-distribution Detection

Dongha Lee, Sehun Yu, Hwanjo Yu

Generating Sequential Electronic Health Records using Dual Adversarial Autoencoder

Dongha Lee, Hwanjo Yu, Xiaoqian Jiang, Deevakar Rogith, Meghana Gudala, Mubeen Tejani, Qiuchen Zhang, Li Xiong

Harmonized Representation Learning on Dynamic EHR Graphs

Dongha Lee, Xiaoqian Jiang, Hwanjo Yu

Convolutional Neural Networks with Compression Complexity Pooling for Out-of-Distribution Image Detection

Sehun Yu, Dongha Lee, Hwanjo Yu

Information Sciences

PUMAD: PU Metric Learning for Anomaly Detection

Hyunjun Ju, Dongha Lee, Junyoung Hwang, Junghyun Namkung, Hwanjo Yu

Information Sciences

Scalable Disk-based Topic Modeling for Memory Limited Devices

Byungju Kim, Dongha Lee, Jinoh Oh, Hwanjo Yu

Information Sciences

OCAM: Out-of-core Coordinate Descent Algorithm for Matrix Completion

Dongha Lee, Jinoh Oh, Hwanjo Yu

Ph.D. Dissertation

Large-Scale Matrix and Tensor Completion based on Out-of-Core Approaches

Dongha Lee

2019

CIKM

Semi-Supervised Learning for Cross-Domain Recommendation to Cold-Start Users

Seongku Kang, Junyoung Hwang, Dongha Lee, Hwanjo Yu

Action Space Learning for Heterogeneous User Behavior Prediction

Dongha Lee, Chanyoung Park, Hyunjun Ju, Junyoung Hwang, Hwanjo Yu

Fast Tucker Factorization for Large-scale Tensor Completion

Dongha Lee, Jaehyung Lee, Hwanjo Yu

CIKM

Disk-based Matrix Completion for Memory Limited Devices

Dongha Lee, Jinoh Oh, Christos Faloutsos, Byungju Kim, Hwanjo Yu

IMCOM

DualSentiNet: Dual Prediction of Word and Document Sentiments Using Shared Word Embedding

Dongha Lee, Hyunjun Ju, Jung-Mi Park, Kye-Yoon Kim, Hwanjo Yu

KDBC

Compressing Model for Matrix Factorization with Quantization Using k-means Clustering

Junsu Cho, Dongha Lee, Hwanjo Yu

Information Sciences

GeoVideoIndex: Indexing for Georeferenced Video

Dongha Lee, Jinoh Oh, Woong-Kee Loh, Hwanjo Yu