Decision Workspace

evalframe vs serdes-ai-evals vs crabllm-bench

Side-by-side comparison of Rust crates

experimentalv0.3.0

Standalone eval framework for LLM outputs — Lua DSL with Rust host

serdes-ai-evals

experimentalv0.2.6

Evaluation framework for testing and benchmarking serdes-ai agents

experimentalv0.0.7

Mock OpenAI backend for benchmarking crabllm

Core Metrics

	evalframe	serdes-ai-evals	crabllm-bench
Health Score	42	48	44
Total Downloads	14	399	24
30d Downloads	14	109	24
Dependents	0	8	0
Releases	1	10	2
Last Updated	10d ago	35d ago	7d ago
Age	10d	2m	8d

Health Breakdown

evalframe

Maintenance

12

Quality

13

Community

6

Popularity

1

Documentation

10

serdes-ai-evals

Maintenance

11

Quality

14

Community

8

Popularity

3

Documentation

12

crabllm-bench

Maintenance

14

Quality

13

Community

5

Popularity

2

Documentation

10

Technical Details

	evalframe	serdes-ai-evals	crabllm-bench
Version	0.3.0	0.2.6	0.0.7
Stable (≥1.0)	✗ No	✗ No	✗ No
License	MIT	MIT	MIT OR Apache-2.0
Dependencies	5	15	6
Crate Size	45KB	29KB	11KB
Features	0	2	0
Yanked %	0.0%	0.0%	0.0%
Edition	2021	2021	2024
MSRV	—	1.75.0	—
Owners	1	1	1

Links

evalframe detail →GitHub ↗crates.io ↗

serdes-ai-evals detail →GitHub ↗crates.io ↗

crabllm-bench detail →GitHub ↗crates.io ↗

Quick Verdict

•serdes-ai-evals leads with a health score of 48/100, but none of the options score above 80.