Trained ExpRL checkpoints. Paper link: https://arxiv.org/abs/2606.17024
Violet Xiang PRO
violetxi
AI & ML interests
None yet
Recent Activity
updated a model about 3 hours ago
violetxi/poker-env_only-qwen35-4b published a model about 3 hours ago
violetxi/poker-env_only-qwen35-4b updated a model about 3 hours ago
violetxi/poker-action_only-qwen35-4b