Decision Workspace
evalframe vs serdes-ai-evals vs crabllm-bench
Side-by-side comparison of Rust crates
42
evalframe
experimentalv0.3.0
Standalone eval framework for LLM outputs — Lua DSL with Rust host
48
serdes-ai-evals
experimentalv0.2.6
Evaluation framework for testing and benchmarking serdes-ai agents
44
crabllm-bench
experimentalv0.0.7
Mock OpenAI backend for benchmarking crabllm
Core Metrics
| evalframe | serdes-ai-evals | crabllm-bench | |
|---|---|---|---|
| Health Score | 42 | 48 | 44 |
| Total Downloads | 14 | 399 | 24 |
| 30d Downloads | 14 | 109 | 24 |
| Dependents | 0 | 8 | 0 |
| Releases | 1 | 10 | 2 |
| Last Updated | 10d ago | 35d ago | 7d ago |
| Age | 10d | 2m | 8d |
Health Breakdown
evalframe
Maintenance
12
Quality
13
Community
6
Popularity
1
Documentation
10
serdes-ai-evals
Maintenance
11
Quality
14
Community
8
Popularity
3
Documentation
12
crabllm-bench
Maintenance
14
Quality
13
Community
5
Popularity
2
Documentation
10
Technical Details
| evalframe | serdes-ai-evals | crabllm-bench | |
|---|---|---|---|
| Version | 0.3.0 | 0.2.6 | 0.0.7 |
| Stable (≥1.0) | ✗ No | ✗ No | ✗ No |
| License | MIT | MIT | MIT OR Apache-2.0 |
| Dependencies | 5 | 15 | 6 |
| Crate Size | 45KB | 29KB | 11KB |
| Features | 0 | 2 | 0 |
| Yanked % | 0.0% | 0.0% | 0.0% |
| Edition | 2021 | 2021 | 2024 |
| MSRV | — | 1.75.0 | — |
| Owners | 1 | 1 | 1 |
Links
Quick Verdict
- •serdes-ai-evals leads with a health score of 48/100, but none of the options score above 80.