Kai Zheng's picture

Kai Zheng

tangmen

·

AI & ML interests

None yet

Recent Activity

authored a paper about 5 hours ago

RubricBench: Aligning Model-Generated Rubrics with Human Standards

authored a paper about 5 hours ago

OffSeeker: Online Reinforcement Learning Is Not All You Need for Deep Research Agents

authored a paper about 5 hours ago

Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models

View all activity

Organizations

Papers 9

arxiv:2606.23543

arxiv:2603.01571

arxiv:2603.01562

arxiv:2601.18467

models 5

tangmen/zephyr-7b-dpo-qlora

Updated Jan 21, 2024

tangmen/zephyr-7b-dpo-full

Updated Jan 21, 2024

tangmen/WizardVerseV1

Updated Dec 7, 2023

tangmen/WizardVerse

Updated Dec 7, 2023

tangmen/chatV

Updated Oct 3, 2023

datasets 0

None public yet