From morning-ai
Tracks AI model rankings and scores on leaderboards like LMSYS Chatbot Arena and HuggingFace Open LLM using SQLite snapshots. Detects rank and score changes between snapshots.
npx claudepluginhub octo-patch/morningai --plugin morning-aiThis skill uses the workspace's default tool permissions.
Track AI model leaderboard rankings over time using SQLite snapshots. Detect new models, removed models, rank changes, and score changes between dates.
Adds and manages structured evaluation results in Hugging Face model cards: extracts tables from READMEs, imports from Artificial Analysis API, runs custom evals with vLLM/lighteval/inspect-ai. Supports model-index format.
Queries Hugging Face benchmark leaderboards to find top models for tasks like coding, math, chat, or image classification, enriches with sizes, filters by device memory, and outputs comparison tables.
Defines unified specification for tracking, scoring, and validating AI news items in Product, Model, Benchmark, and Funding categories with include/exclude rules and source priorities.
Share bugs, ideas, or general feedback.
Track AI model leaderboard rankings over time using SQLite snapshots. Detect new models, removed models, rank changes, and score changes between dates.
| Leaderboard | URL | Modality |
|---|---|---|
| LMSYS Chatbot Arena | https://lmsys.org | Text, Vision |
| LMArena | https://lmarena.ai | Text, Vision |
| HuggingFace Open LLM | https://huggingface.co/spaces/open-llm-leaderboard | Text |
| Artificial Analysis | https://artificialanalysis.ai | Text, Image, Video |
| Scale AI SEAL | https://scale.com/leaderboard | Text |
cd {SKILL_DIR} && python3 skills/leaderboard/scripts/leaderboard_snapshot.py save \
--leaderboard "chatbot-arena" \
--date 2026-04-14 \
--data '[{"model": "claude-4-opus", "rank": 1, "score": 1350}]'
Prints the diff against the previous snapshot (new models, rank changes, score changes).
cd {SKILL_DIR} && python3 skills/leaderboard/scripts/leaderboard_snapshot.py latest \
--leaderboard "chatbot-arena"
Snapshots are stored in ~/.cache/morning-ai/leaderboard.db (SQLite). Each entry has:
leaderboard — leaderboard identifiermodel — model namerank — position on the leaderboardscore — numeric score (ELO, accuracy, etc.)snapshot_date — date of the snapshotThis skill is currently a standalone utility. It can be integrated into the main morning-ai workflow as a Benchmark data source:
TrackerItem entries for rank changesTo integrate, a collector module (lib/leaderboard_collector.py) would:
save_snapshot() to persistdiff_snapshot() to detect changesTrackerItem objects with Benchmark type