**Goal**: Track the execution of analysis scripts and record data dependencies to enable reproducibility verification and analysis traceability.
Track script execution and record data dependencies for reproducibility verification. Use after running analysis scripts to document data lineage and enable full traceability of your data pipeline.
/plugin marketplace add syfyufei/adrian-marketplace/plugin install truth-verification@LLM-Research-MarketplaceGoal: Track the execution of analysis scripts and record data dependencies to enable reproducibility verification and analysis traceability.
When to use: After running analysis scripts, track their execution to document data lineage and enable reproducibility validation.
Before running this command:
.truth/manifest.json exists (run /truth-verification:init if needed)/truth-verification:register for all source data)/truth-verification:track --script codes/analysis.py --inputs data/raw/dataset.csv --outputs data/cleaned/results.csv
/truth-verification:track --script codes/preprocessing.py --inputs data/raw/raw1.csv,data/raw/raw2.csv --outputs data/cleaned/merged.csv
/truth-verification:track --script codes/analysis.py \
--inputs data/raw/dataset.csv \
--outputs data/cleaned/results.csv \
--parameters "threshold=0.75,method=zscore"
/truth-verification:track --script codes/analysis.py \
--inputs data/raw/dataset.csv \
--outputs data/cleaned/results.csv \
--execution-duration 45
/truth-verification:track --script codes/analysis.py \
--inputs data/raw/dataset.csv \
--outputs data/cleaned/results.csv \
--auto-register-outputs
Script Validation:
Input Verification:
Output Handling:
--auto-register-outputs)Dependency Graph Creation:
Manifest Update:
analysis_scripts arraydependencies arraylast_updated timestampResearch Memory Integration (if enabled):
memory/devlog.md (if using research-memory skill)#data-trackingLogging:
.truth/logs/track.log✓ Tracked script execution
Script: codes/analysis.py
Inputs: data/raw/dataset.csv (verified ✓)
Outputs: data/cleaned/results.csv (registered & hashed)
Dependencies: 2 edges added
Duration: 45 seconds
Status: Complete
⚠ Tracked script execution (with warnings)
Script: codes/analysis.py
Inputs: data/raw/dataset.csv (MODIFIED since registration!)
Expected hash: a7b3f8d9...
Current hash: c9d5b1a4...
Outputs: data/cleaned/results.csv (registered & hashed)
Dependencies: 2 edges added
Warning: Input data modified! Reproducibility may be affected.
| Error | Cause | Solution |
|---|---|---|
| "Script file not found" | Script path doesn't exist | Verify script file path |
| "Input file not registered" | Input file not in manifest | Run /truth-verification:register --file <input> first |
| "Output file not found" | Output file doesn't exist | Run script first to generate output, or use --dry-run to preview |
| "Hash mismatch on input" | Input file was modified | Investigate modification or re-register with --update |
| "Invalid manifest" | .truth/manifest.json corrupted | Restore from backup or reinitialize |
After successful execution:
analysis_scripts: New script entry with execution metadatadependencies: New edges linking inputs → script → outputs.truth/logs/track.log contains execution recordmemory/devlog.mdThe manifest's dependencies array tracks relationships:
{
"id": "dep_001",
"type": "data_to_script",
"from": "data/raw/dataset.csv",
"to": "codes/analysis.py",
"relationship": "input",
"created_at": "2025-12-15T14:30:00Z"
}
Graph types:
/truth-verification:track --script codes/analysis.py \
--inputs data/raw/dataset.csv \
--outputs data/cleaned/results.csv \
--dry-run
Shows what dependencies would be created without modifying manifest.
/truth-verification:track --script codes/download.py \
--external-inputs "https://api.example.com/data" \
--outputs data/raw/downloaded.csv
Records external data sources that scripts depend on.
/truth-verification:track --script codes/pipeline.py \
--inputs data/raw/raw.csv \
--outputs data/final/results.csv \
--include-intermediate results/step1.csv,results/step2.csv
Documents all intermediate processing steps for complete traceability.
/truth-verification:track --script codes/analysis.py \
--validate-inputs \
--validate-outputs
Validates all inputs are registered and all outputs exist before tracking.
When research-memory is installed, tracking automatically logs:
In memory/devlog.md:
## 2025-12-15 14:30 - Data Analysis Pipeline #data-tracking
Ran analysis script with:
- Input: data/raw/dataset.csv (verified ✓)
- Output: data/cleaned/results.csv (456 KB)
- Duration: 45 seconds
- Status: Complete
Dependency graph updated with 2 edges.
This creates an audit trail in your research memory.
After tracking script execution:
/truth-verification:verify --file data/cleaned/results.csv/truth-verification:reproduce (Phase 3) to validate complete chain/truth-verification:audit (Phase 4) to create full report/truth-verification:register - Register data files and outputs/truth-verification:verify - Check data integrity/truth-verification:reproduce - Validate reproducibility of complete chains (Phase 3)/truth-verification:audit - Generate comprehensive audit reports (Phase 4)/research-memory:remember - Log analysis work to memory (when research-memory installed)