arxiv:2606.23543
Kai Zheng
tangmen
AI & ML interests
None yet
Recent Activity
authored a paper about 5 hours ago
RubricBench: Aligning Model-Generated Rubrics with Human Standards authored a paper about 5 hours ago
OffSeeker: Online Reinforcement Learning Is Not All You Need for Deep Research Agents authored a paper about 5 hours ago
Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models