Datasets and Models for the REEval project
AI & ML interests
None defined yet.
Recent Activity
View all activity
Organization Card
Edit this README.md markdown file to author your organization card.
datasets 51
stair-lab/nonmyopia_results
Updated
• 5.58k
stair-lab/code_insights_results
Preview
• Updated
• 23
stair-lab/fantastic-bugs
Viewer
• Updated
• 404 • 106
stair-lab/reeval_fa
Viewer
• Updated
• 21.2k • 8
stair-lab/cultural_value_understanding_wvs
Viewer
• Updated
• 1k • 17
stair-lab/chatbot_arena_embedding
Viewer
• Updated
• 323k • 3
stair-lab/chatbot_arena
Viewer
• Updated
• 23.3k • 9
stair-lab/zeroshot_evaluator
Viewer
• Updated
• 1M • 12
stair-lab/zero_shot_evaluator_openllm_val
Preview
• Updated
• 7
stair-lab/zero_evaluator_agentic
Viewer
• Updated
• 34.7k • 7