Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Building on HF
60.8
TFLOPS
549
112
331
Nathan Habib
PRO
SaylorTwift
Follow
seekinmonky's profile picture
jdelavande's profile picture
Samuel00000's profile picture
433 followers
·
385 following
nathanhabib1011
NathanHB
AI & ML interests
Evals
Recent Activity
new
activity
1 day ago
nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-NVFP4:
Add evaluation results (GPQA, MMLU-Pro, SWE-bench Verified, HLE)
new
activity
1 day ago
nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16:
Add evaluation results (GPQA, MMLU-Pro, SWE-bench Verified, HLE)
liked
a model
1 day ago
nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-NVFP4
View all activity
Organizations
SaylorTwift
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-NVFP4
1 day ago
Add evaluation results (GPQA, MMLU-Pro, SWE-bench Verified, HLE)
#6 opened 1 day ago by
SaylorTwift
New activity in
nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16
1 day ago
Add evaluation results (GPQA, MMLU-Pro, SWE-bench Verified, HLE)
#3 opened 1 day ago by
SaylorTwift
Add GPQA evaluation result
#2 opened 1 day ago by
SaylorTwift
New activity in
meituan-longcat/WBench
1 day ago
Register WBench as benchmark (add eval.yaml)
17
#9 opened 8 days ago by
Kaining
New activity in
MMMU/MMMU_Pro
1 day ago
Update eval.yaml
👍
1
#7 opened 1 day ago by
SaylorTwift
New activity in
google/gemma-4-12B-it
3 days ago
Add HLE evaluation result
#7 opened 3 days ago by
SaylorTwift
Add AIME 2026 evaluation result
#6 opened 3 days ago by
SaylorTwift
Add MMMU Pro evaluation result
#5 opened 3 days ago by
SaylorTwift
Add MMLU-Pro evaluation result
#4 opened 3 days ago by
SaylorTwift
Add GPQA Diamond evaluation result
#3 opened 3 days ago by
SaylorTwift
New activity in
actava/chi-bench
3 days ago
Make chi-bench an community benchmark
🤗
3
4
#2 opened 5 days ago by
SaylorTwift
New activity in
Kwai-Keye/Keye-VL-2.0-30B-A3B
5 days ago
Add Video-MME-v2 evaluation result
#5 opened 5 days ago by
SaylorTwift
Add AIME 2026 evaluation result
#4 opened 5 days ago by
SaylorTwift
Add SWE-bench Verified evaluation result
#3 opened 5 days ago by
SaylorTwift
New activity in
stepfun-ai/Step-3.7-Flash
8 days ago
Add SWE-bench Pro evaluation result
#4 opened 8 days ago by
SaylorTwift
Add HLE with tools evaluation result
#3 opened 8 days ago by
SaylorTwift
New activity in
LiquidAI/LFM2.5-8B-A1B
8 days ago
Add AIME 2026 evaluation result
#4 opened 8 days ago by
SaylorTwift
New activity in
facebook/flores
8 days ago
Convert dataset to Parquet
8
#8 opened 10 months ago by
SaylorTwift
New activity in
gaia-benchmark/leaderboard
9 days ago
Fix OAuth login broken by missing gradio[oauth] extra; reload eval_results on submit
1
#102 opened 9 days ago by
SaylorTwift
New activity in
InternScience/ResearchClawBench
9 days ago
Benchmark allow-list request for ResearchClawBench
🔥
1
8
#9 opened 17 days ago by
black-yt
Load more