Collections
Discover the best community collections!
Collections including paper arxiv:2511.23404
-
OptiMind: Teaching LLMs to Think Like Optimization Experts
Paper • 2509.22979 • Published • 4 -
LFM2 Technical Report
Paper • 2511.23404 • Published • 61 -
Zero-Overhead Introspection for Adaptive Test-Time Compute
Paper • 2512.01457 • Published • 3 -
Confidence Estimation for LLMs in Multi-turn Interactions
Paper • 2601.02179 • Published • 17
-
LFM2 Technical Report
Paper • 2511.23404 • Published • 61 -
Artificial Hippocampus Networks for Efficient Long-Context Modeling
Paper • 2510.07318 • Published • 32 -
Scaling Latent Reasoning via Looped Language Models
Paper • 2510.25741 • Published • 231 -
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Paper • 2410.20672 • Published • 7
-
Xtra-Computing/XtraGPT-14B
Text Generation • 15B • Updated • 276 • 5 -
ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development
Paper • 2601.11077 • Published • 67 -
Molecular Contrastive Learning with Chemical Element Knowledge Graph
Paper • 2112.00544 • Published • 1 -
Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models
Paper • 2404.00884 • Published • 1
-
A Picture is Worth More Than 77 Text Tokens: Evaluating CLIP-Style Models on Dense Captions
Paper • 2312.08578 • Published • 20 -
ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks
Paper • 2312.08583 • Published • 11 -
Vision-Language Models as a Source of Rewards
Paper • 2312.09187 • Published • 12 -
StemGen: A music generation model that listens
Paper • 2312.08723 • Published • 48
-
LFM2 Technical Report
Paper • 2511.23404 • Published • 61 -
Artificial Hippocampus Networks for Efficient Long-Context Modeling
Paper • 2510.07318 • Published • 32 -
Scaling Latent Reasoning via Looped Language Models
Paper • 2510.25741 • Published • 231 -
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Paper • 2410.20672 • Published • 7
-
OptiMind: Teaching LLMs to Think Like Optimization Experts
Paper • 2509.22979 • Published • 4 -
LFM2 Technical Report
Paper • 2511.23404 • Published • 61 -
Zero-Overhead Introspection for Adaptive Test-Time Compute
Paper • 2512.01457 • Published • 3 -
Confidence Estimation for LLMs in Multi-turn Interactions
Paper • 2601.02179 • Published • 17
-
Xtra-Computing/XtraGPT-14B
Text Generation • 15B • Updated • 276 • 5 -
ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development
Paper • 2601.11077 • Published • 67 -
Molecular Contrastive Learning with Chemical Element Knowledge Graph
Paper • 2112.00544 • Published • 1 -
Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models
Paper • 2404.00884 • Published • 1
-
A Picture is Worth More Than 77 Text Tokens: Evaluating CLIP-Style Models on Dense Captions
Paper • 2312.08578 • Published • 20 -
ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks
Paper • 2312.08583 • Published • 11 -
Vision-Language Models as a Source of Rewards
Paper • 2312.09187 • Published • 12 -
StemGen: A music generation model that listens
Paper • 2312.08723 • Published • 48