Explore/benchmark/WebArena
W

web-arena-x/WebArenaAbandoned

Self-hosted realistic web environment for evaluating autonomous agents.

benchmarkPythonApache-2.0Web AgentsBenchmarkEvaluation
GitHubCompare
Refreshed 4d ago
OverviewActivity52wAlternativesDocs
Stars1.5k
Forks241
HF Downloads30d
Last commit6mo ago
Refreshed4d ago
Project healthAbandonedNo commits in 6 months.
Production readinessExperimentalGrowing but not yet battle-tested at scale.
Risk notesApache-2.0Verify license before production use.
AgentHub Score
69 / 100
Composite score from 6 signals. How we score →
Active project
69Score
Growth
97A+
Activity
30C
Documentation
50C
Maturity
85B+
Community
95A+
Production
58C
GitHub stars · 90 days1.5k +12.7%
30d90d1y
latest release
Commit activity · 52 weeksActive contributor activity
LowHigh
JunSepDecMarNow
Practical assessment
Should you use it?

✓ Best for

  • Research and experimentation
  • Prototype development
  • Learning agentic patterns

◎ Strengths

  • Active community
  • Open source
  • Well-documented API

✕ Not ideal for

  • Untested at scale without validation
  • Teams without AI/ML expertise

⚠ Watch-outs

  • Review changelog before updating
  • Verify license for commercial use
Technical details
What's inside
LanguagePython
LicenseApache-2.0
Sourcegithub
Open source✗ No
Commercial use
Docs
Demo
Paper

AgentHub Score

69
Score 69/100
Above average

Alternatives

C
crewai
26.1k · Multi-Agent
87
A
autogen
42.7k · Multi-Agent
71
S
smolagents
11.2k · Coding
84
O
openai-agents-python
9.4k · Multi-Agent
81
Compare all →

Recent activity

Latest commit 6mo ago6mo ago
Indexed by AgentHub crawler4d ago
Monitor for new releasesongoing

Tags