This is the first big project output from the
@eval-eval.bsky.social coalition! Thread below:
This is the first big project output from the
@eval-eval.bsky.social coalition! Thread below:
✅ Collaborative challenges targeting upstream problems
✅ Cross-disciplinary education
✅ Recognition for data & infrastructure work
✅ Community-owned infrastructure
All links follow 🤗
✅ Collaborative challenges targeting upstream problems
✅ Cross-disciplinary education
✅ Recognition for data & infrastructure work
✅ Community-owned infrastructure
All links follow 🤗
- Most downloaded datasets are evaluation benchmarks (MMLU, Squad, GLUE)
- Universities and research institutions dominate foundational data
- Domain-specific datasets thrive in finance, healthcare, robotics, and science
- Open datasets power most AI development!
- Most downloaded datasets are evaluation benchmarks (MMLU, Squad, GLUE)
- Universities and research institutions dominate foundational data
- Domain-specific datasets thrive in finance, healthcare, robotics, and science
- Open datasets power most AI development!
This suggests practical deployment considerations often matter more than maximum capability. The community is building for real-world use, not just benchmarks.
This suggests practical deployment considerations often matter more than maximum capability. The community is building for real-world use, not just benchmarks.
Sign up here! tinyurl.com/ai-mirrors
Sign up here! tinyurl.com/ai-mirrors