Using RL to elicit context leverage ability of LLMs to learn unseen languages!
Hanxu Hu PRO
HanxuHU
AI & ML interests
LLM, NLP
Recent Activity
authored a paper 1 day ago
DeReason: A Difficulty-Aware Curriculum Improves Decoupled SFT-then-RL Training for General Reasoning authored a paper 1 day ago
Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation upvoted a paper 2 days ago
Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation