arxiv:2504.07128
Dongchan Shin
ShinDC
AI & ML interests
NLP
Recent Activity
updated a dataset about 1 month ago
ShinDC/mdaqa_corpus published a dataset about 1 month ago
ShinDC/mdaqa_corpus upvoted a paper about 1 year ago
AgentRewardBench: Evaluating Automatic Evaluations of Web Agent
Trajectories