Trained ExpRL checkpoints. Paper link: https://arxiv.org/abs/2606.17024
Violet Xiang PRO
violetxi
AI & ML interests
None yet
Recent Activity
updated a model about 13 hours ago
violetxi/poker-action_only-qwen35-4b updated a model about 13 hours ago
violetxi/poker-env_only-qwen35-4b updated a model about 13 hours ago
violetxi/poker-vanilla-qwen35-4b