AgentBench — AI benchmark

Stars3.5k

Forks261

HF Downloads—30d

Last commit4mo ago

Refreshed1d ago

Project healthStaleNo commits in 121d.

Production readinessMVP-readySuitable for non-critical production use.

Risk notesApache-2.0Verify license before production use.

AgentHub Score

73 / 100

Composite score from 6 signals. How we score →

Active project

73Score

Growth

98A+

Activity

50C

Documentation

50C

Maturity

87A

Community

95A+

Production

58C

GitHub stars · 90 days3.5k +14.2%

30d90d1y

Commit activity · 52 weeksActive contributor activity

LowHigh

JunSepDecMarNow

Practical assessment

Should you use it?

✓ Best for

Research and experimentation
Prototype development
Learning agentic patterns

◎ Strengths

Active community
Open source
Well-documented API

✕ Not ideal for

Untested at scale without validation
Teams without AI/ML expertise

⚠ Watch-outs

Review changelog before updating
Verify license for commercial use

Technical details

What's inside

LanguagePython

LicenseApache-2.0

Sourcegithub

Open source✗ No

Commercial use—

Docs—

Demo—

Paper—

AgentHub Score

Score 73/100

Above average

Alternatives

crewai

26.1k · Multi-Agent

autogen

42.7k · Multi-Agent

smolagents

11.2k · Coding

openai-agents-python

9.4k · Multi-Agent

Compare all →

Recent activity

Latest commit 4mo ago4mo ago

Indexed by AgentHub crawler1d ago

Monitor for new releasesongoing

THUDM/AgentBenchStale

✓ Best for

◎ Strengths

✕ Not ideal for

⚠ Watch-outs

AgentHub Score

Alternatives

Recent activity

Tags