Decision Workspace
serdes-ai-evals vs adk-eval vs llm-test-bench-core
Side-by-side comparison of Rust crates
48
serdes-ai-evals
experimentalv0.2.6
Evaluation framework for testing and benchmarking serdes-ai agents
57
adk-eval
experimentalv0.4.0
Agent evaluation framework for ADK-Rust
42
llm-test-bench-core
experimentalv0.1.0
Core library for LLM Test Bench - comprehensive testing framework for Large Language Models with 65+ supported models across 14+ providers
Core Metrics
| serdes-ai-evals | adk-eval | llm-test-bench-core | |
|---|---|---|---|
| Health Score | 48 | 57 | 42 |
| Total Downloads | 399 | 972 | 127 |
| 30d Downloads | 109 | 659 | 26 |
| Dependents | 8 | 3 | 1 |
| Releases | 10 | 12 | 1 |
| Last Updated | 35d ago | 12d ago | 143d ago |
| Age | 2m | 3m | 4m |
Health Breakdown
serdes-ai-evals
Maintenance
11
Quality
14
Community
8
Popularity
3
Documentation
12
adk-eval
Maintenance
19
Quality
12
Community
7
Popularity
4
Documentation
15
llm-test-bench-core
Maintenance
6
Quality
11
Community
7
Popularity
3
Documentation
15
Technical Details
| serdes-ai-evals | adk-eval | llm-test-bench-core | |
|---|---|---|---|
| Version | 0.2.6 | 0.4.0 | 0.1.0 |
| Stable (≥1.0) | ✗ No | ✗ No | ✗ No |
| License | MIT | Apache-2.0 | MIT OR Apache-2.0 |
| Dependencies | 15 | 15 | 55 |
| Crate Size | 29KB | 43KB | 355KB |
| Features | 2 | 0 | 2 |
| Yanked % | 0.0% | 0.0% | 0.0% |
| Edition | 2021 | 2024 | 2021 |
| MSRV | 1.75.0 | 1.85.0 | 1.75.0 |
| Owners | 1 | 1 | 1 |
Links
Quick Verdict
- •adk-eval leads with a health score of 57/100, but none of the options score above 80.