Diwank Tomer PRO
diwank
AI & ML interests
None yet
Recent Activity
liked a model 1 day ago
pat-jj/harness-1 upvoted a paper 3 days ago
Polar: Agentic RL on Any Harness at Scale liked a model 5 days ago
JetBrains/Mellum2-12B-A2.5B-ThinkingOrganizations
Text-diffusion
world
code
reasoning
-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 5.69M • • 13.4k -
deepseek-ai/DeepSeek-R1-Zero
Text Generation • 685B • Updated • 8.39k • 958 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
15B • Updated • 543k • 653 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation • 2B • Updated • 762k • • 1.52k
search
Art
S1.1
Audio
M
-
PRELUDE: A Benchmark Designed to Require Global Comprehension and Reasoning over Long Contexts
Paper • 2508.09848 • Published • 71 -
ttchungc/PRELUDE
Viewer • Updated • 1.16k • 147 • 19 -
ai-hyz/MemoryAgentBench
Viewer • Updated • 146 • 14.7k • 38 -
TommyChien/MemoRAG-Training
Viewer • Updated • 21.1k • 143 • 1
steadytext
Med
Robotics
F
Vision
K
Sam
thought
SAE
M
-
PRELUDE: A Benchmark Designed to Require Global Comprehension and Reasoning over Long Contexts
Paper • 2508.09848 • Published • 71 -
ttchungc/PRELUDE
Viewer • Updated • 1.16k • 147 • 19 -
ai-hyz/MemoryAgentBench
Viewer • Updated • 146 • 14.7k • 38 -
TommyChien/MemoRAG-Training
Viewer • Updated • 21.1k • 143 • 1
Text-diffusion
steadytext
world
Med
code
Robotics
reasoning
-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 5.69M • • 13.4k -
deepseek-ai/DeepSeek-R1-Zero
Text Generation • 685B • Updated • 8.39k • 958 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
15B • Updated • 543k • 653 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation • 2B • Updated • 762k • • 1.52k
F
search
Vision
Art
K
S1.1
Sam
Audio
thought