Alex Rigler

aldaleri

35 246

https://choochoo.cc

AI & ML interests

systems, security & governance

Recent Activity

liked a model 2 days ago

deepreinforce-ai/Ornith-1.0-35B-GGUF

liked a model 2 days ago

nationaldesignstudio/rampart

upvoted a paper 2 days ago

Agents' Last Exam

View all activity

Organizations

liked 2 models 2 days ago

deepreinforce-ai/Ornith-1.0-35B-GGUF

Text Generation • 35B • Updated 6 days ago • 234k • 602

nationaldesignstudio/rampart

Token Classification • Updated 1 day ago • 414 • 93

upvoted 2 papers 2 days ago

Agents' Last Exam

Paper • 2606.05405 • Published 29 days ago • 371

OPID: On-Policy Skill Distillation for Agentic Reinforcement Learning

Paper • 2606.26790 • Published 7 days ago • 52

upvoted a paper 3 days ago

On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters

Paper • 2606.02437 • Published about 1 month ago • 236

liked 3 datasets 3 days ago

liked a model 3 days ago

zai-org/GLM-5.2

Text Generation • 753B • Updated 9 days ago • 160k • • 3.16k

liked 2 models 8 days ago

Qwen/Qwen-AgentWorld-35B-A3B

Text Generation • 35B • Updated 7 days ago • 34.4k • 493

allenai/tmax-9b

9B • Updated 9 days ago • 4.72k • 8

upvoted a paper 8 days ago

Tmax: A simple recipe for terminal agents

Paper • 2606.23321 • Published 10 days ago • 14

upvoted an article 10 days ago

Article

Beyond LoRA: Can you beat the most popular fine-tuning technique?

BenjaminB, sayakpaul, hubnemo, kashif

•

14 days ago

• 70

upvoted a paper 16 days ago

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Paper • 2605.23904 • Published May 22 • 249

liked a dataset 16 days ago

OxRML/MADQA

Viewer • Updated Mar 13 • 3.05k • 539 • 20

liked a model 16 days ago

poolside/Laguna-XS.2

Text Generation • 33B • Updated about 12 hours ago • 87.8k • 317

liked 2 datasets 16 days ago

nvidia/CantTalkAboutThis-Topic-Control-Dataset

Viewer • Updated Jan 16, 2025 • 1.09k • 354 • 12

nvidia/Nemotron-Safety-Guard-Dataset-v3

Viewer • Updated Feb 3 • 515k • 1.63k • 32

upvoted an article 16 days ago

Article

Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI

nvidia

•

27 days ago

• 12