From mims-harvard-tooluniverse
Identifies and annotates metabolites using HMDB and PubChem, retrieves study details from MetaboLights and Metabolomics Workbench, searches studies by keywords or disease for research reports.
npx claudepluginhub joshuarweaver/cascade-data-analytics --plugin mims-harvard-tooluniverseThis skill uses the workspace's default tool permissions.
Comprehensive metabolomics research skill that identifies metabolites, analyzes studies, and searches metabolomics databases. Generates structured research reports with annotated metabolite information, study details, and database statistics.
COMPARISON_ORIGINAL_VS_FIXED.mdQUICK_START.mdSKILL_TEST_FEEDBACK.mddiabetes_analysis.pydiabetes_metabolomics_report_FIXED.mdexample1_metabolites.mdexample2_study.mdexample3_search.mdpython_implementation.pypython_implementation_fixed.pyscripts/metabolism_ref.pytest1_metabolites.mdtest2_study.mdtest3_search.mdtest4_comprehensive.mdtest_detailed.pytest_skill.pytest_tools_debug.pyConducts multi-round deep research on GitHub repos via API and web searches, generating markdown reports with executive summaries, timelines, metrics, and Mermaid diagrams.
Dynamically discovers and combines enabled skills into cohesive, unexpected delightful experiences like interactive HTML or themed artifacts. Activates on 'surprise me', inspiration, or boredom cues.
Generates images from structured JSON prompts via Python script execution. Supports reference images and aspect ratios for characters, scenes, products, visuals.
Comprehensive metabolomics research skill that identifies metabolites, analyzes studies, and searches metabolomics databases. Generates structured research reports with annotated metabolite information, study details, and database statistics.
Use this skill when asked to:
Example queries:
Primary metabolite databases:
The skill executes a 4-phase analysis pipeline:
For each metabolite in the input list:
For provided study IDs:
For keyword searches:
Always included in reports:
Input:
Output report includes:
Input:
Output report includes:
Input:
Output report includes:
Input:
Output report includes:
List of metabolite names to identify and annotate.
["glucose"], ["lactate", "pyruvate", "acetate"]MetaboLights or Metabolomics Workbench study identifier.
"MTBLS1", "ST000001"Keyword to search metabolomics studies.
"diabetes", "glucose metabolism", "LC-MS"Target organism for study filtering.
"Homo sapiens""Mus musculus", "Saccharomyces cerevisiae"Path for the generated markdown report.
"my_analysis.md", "metabolomics_report.md"All analyses generate a structured markdown report with:
Header section:
Phase sections:
Database overview:
Error handling:
HMDB tools are SOAP-based and require special parameter handling:
HMDB_search: Requires operation="search" parameterHMDB_get_metabolite: Requires operation="get_metabolite" parameterendpoint or method parameters (not applicable to SOAP)Tools return different response formats - handle all three:
{status: "success", data: [...], metadata: {...}}[...] (e.g., metabolights_list_studies){field1: ..., field2: ...} (e.g., some detail endpoints)Always check response type with isinstance() before accessing fields.
Follow this hierarchy for robustness:
Write report incrementally to avoid memory issues:
The skill automatically discovers and uses these tools from ToolUniverse:
HMDB Tools:
HMDB_search: Search metabolites by nameHMDB_get_metabolite: Get detailed metabolite informationMetaboLights Tools:
metabolights_list_studies: List available studiesmetabolights_search_studies: Search studies by keywordmetabolights_get_study: Get study details by IDMetabolomics Workbench Tools:
MetabolomicsWorkbench_get_study: Get study informationMetabolomicsWorkbench_search_compound_by_name: Search compoundsPubChem Tools:
PubChem_get_CID_by_compound_name: Get PubChem CIDPubChem_get_compound_properties_by_CID: Get chemical propertiesNo manual tool configuration required - all tools loaded automatically.
Cause: HMDB search returned empty results or index error accessing first result Solution: This is expected for uncommon metabolites; PubChem fallback will be attempted
Cause: Study ID not found or API unavailable Solution: Verify study ID format (MTBLS* or ST*), check if study is public
Cause: Missing API keys for some databases
Solution: Check .env.template, add required API keys to .env file (most metabolomics tools work without keys)
Cause: Pipeline queries each metabolite individually Solution: Reports limit to first 10 metabolites; consider batching for >20 metabolites
The Metabolomics Research skill provides comprehensive metabolomics analysis through a 4-phase pipeline that:
Key Features:
operation parameter)Best for:
Metabolite identification starts with the mass spectrum. LOOK UP DON'T GUESS — always search HMDB/PubChem with the calculated neutral mass rather than guessing identity from m/z alone.
Metabolite identification: HMDB IDs provide the strongest annotation when paired with experimental validation. A PubChem-only match (fallback) indicates the metabolite is chemically characterized but may lack biological context (pathways, disease associations). Always report the identification confidence level.
Pathway enrichment strategy: When multiple metabolites map to the same KEGG or HMDB pathway, enrichment is meaningful only if the input list is unbiased (not pre-selected for that pathway). Report hits vs. pathway size (3/5 detected is more informative than 3/500). LOOK UP DON'T GUESS — use HMDB_get_metabolite to get pathway annotations for each metabolite rather than assuming pathway membership from names alone.
Biomarker discovery reasoning: A candidate biomarker should show: (1) consistent direction of change across samples (fold-change > 1.5), (2) statistical significance (FDR-adjusted p < 0.05), (3) biological plausibility — LOOK UP the metabolite's known disease associations via HMDB, and (4) reproducibility in an independent cohort. Single-study HMDB associations are hypothesis-generating, not confirmatory. Check MetaboLights/Metabolomics Workbench for independent validation datasets.
A complete metabolomics report should answer:
Limitations:
See QUICK_START.md for Python SDK examples, MCP integration, and step-by-step tutorials.