Explore/framework/chinese-llm-benchmark
C

jeinlee1991/chinese-llm-benchmarkActive

非线智能 NoneLinear - ReLE评测:中文AI大模型能力评测(持续更新):目前已囊括374个大模型,覆盖chatgpt、gpt-5.4、谷歌gemini-3.1-pro、Claude-4.6、文心ERNIE-X1.1、ERNIE-5.0、qwen3.6-max、qwen3.6-plus、百川、讯飞星火、商汤senseChat等商用模型, 以及step3.5-flash、kimi-k2.6、ernie4.5、MiniMax-M2.7、deepseek-v4、Qwen3.6、llama4、智谱GLM-5.1、MiMo-V2、LongCat、gemma4、mistral等开源大模型。不仅提供排行榜,也提供规模超200万的大模型缺陷库!方便广大社区研究分析、改进大模型。

framework
GitHubCompare
Refreshed 4d ago
OverviewActivity52wAlternativesDocs
Stars6.1k
Forks244
HF Downloads30d
Last commit9d ago
Refreshed4d ago
Project healthActiveLast commit 9d ago.
Production readinessMVP-readySuitable for non-critical production use.
Risk notesUnknown licenseVerify license before production use.
AgentHub Score
83 / 100
Composite score from 6 signals. How we score →
Active project
83Score
Growth
98A+
Activity
90A
Documentation
70C+
Maturity
89A
Community
95A+
Production
58C
GitHub stars · 90 days6.1k +15.1%
30d90d1y
latest release
Commit activity · 52 weeksActive contributor activity
LowHigh
JunSepDecMarNow
Practical assessment
Should you use it?

✓ Best for

  • Multi-agent orchestration
  • Production agentic workflows
  • Stateful long-running tasks

◎ Strengths

  • Stable API
  • Active release cadence
  • Strong GitHub community

✕ Not ideal for

  • Simple single-step automation
  • Teams without Python/ML expertise

⚠ Watch-outs

  • Breaking changes between minor versions
  • Ecosystem lock-in if tightly coupled
Technical details
What's inside
Language
License
Sourcegithub
Open source✗ No
Commercial use
Docs
Demo
Paper

AgentHub Score

83
Score 83/100
Top performer

Alternatives

C
crewai
26.1k · Multi-Agent
87
A
autogen
42.7k · Multi-Agent
71
S
smolagents
11.2k · Coding
84
O
openai-agents-python
9.4k · Multi-Agent
81
Compare all →

Recent activity

Latest commit 9d ago9d ago
Indexed by AgentHub crawler4d ago
Monitor for new releasesongoing