Skill

content-access

Guides Claude through paywalled source retrieval: free version search, Unpaywall, CORE, Semantic Scholar. Prevents premature low-confidence judgments.

automation

Popularity

Parent stars

Parent forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/spotlight:content-access

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

When a source is behind a paywall or access barrier, work through this hierarchy before marking it inaccessible. A paywalled source should not default to `low` confidence — it should default to an access attempt.

SKILL.md

202 lines · ~2.1k tokens

Stats

LanguageHTML

Parent stars23

Parent forks3

MaintenanceGood

Last CommitJun 9, 2026

Actions

View Source View Plugin View on GitHub View README

Content Access

When a source is behind a paywall or access barrier, work through this hierarchy before marking it inaccessible. A paywalled source should not default to low confidence — it should default to an access attempt.

Decision Tree

Try each step in order. Stop as soon as you have the full text.

Before any execute-shell command that uses a DOI, URL, query, timestamp, filename, or path, invoke shell-safety and validate the value with scripts/spotlight_safe.py. Prefer --data-urlencode or serialized JSON over interpolating values into shell strings.

External-access boundary: Unpaywall, CORE, Semantic Scholar, web archives, and similar services may receive only the minimum DOI, title/query, URL, or timestamp needed to retrieve the source. Do not send case notes, raw source dumps, unpublished allegations, private source material, credentials, vault contents, or unnecessary personal data. In sensitive mode, remote access services are disabled unless the user explicitly approves an override.

1. Free Version Search

Many paywalled articles have freely distributed copies. Search for them:

search: query="{exact title}" filetype:pdf
search: query="{exact title}" site:{author-institution}.edu
search: query="{exact title}" site:researchgate.net
search: query="{exact title}" site:academia.edu

2. Unpaywall (Academic Papers with DOI, Optional Integration)

Unpaywall is optional. Before using it, confirm the integration is green:

execute-shell: python3 integrations/preflight.py --json

Use Unpaywall only when unpaywall is green. It requires $UNPAYWALL_EMAIL in .env; the email is a fair-use identifier, not a notification target. If the integration is red or missing, skip to step 3.

execute-shell: python3 scripts/spotlight_safe.py validate-doi "{DOI}"
execute-shell: curl --get "https://api.unpaywall.org/v2/{DOI}" --data-urlencode "email=$UNPAYWALL_EMAIL"

Parse the response for best_oa_location.url — if present, it's the legal open-access copy. Also check oa_locations[] for mirrors.

If a URL is found, fetch the full text:

fetch: url={oa_location_url}, output_path={CASE_DIR}/research/{filename}.md

3. CORE (295M Open Access Papers)

execute-shell: curl --get "https://api.core.ac.uk/v3/search/works" \
  --data-urlencode "q={query}" \
  --data-urlencode "limit=5" \
  -H "Authorization: Bearer ${CORE_API_KEY}"

Check results[].downloadUrl for PDF links.

4. Semantic Scholar

execute-shell: curl --get "https://api.semanticscholar.org/graph/v1/paper/search" \
  --data-urlencode "query={query}" \
  --data-urlencode "fields=title,openAccessPdf,externalIds" \
  --data-urlencode "limit=5"

Check openAccessPdf.url.

5. Archive Copy

Check if an archived version is accessible before the paywall went up:

execute-shell: python3 scripts/spotlight_safe.py validate-url "{URL}"
execute-shell: curl --get "http://web.archive.org/cdx/search/cdx" \
  --data-urlencode "url={URL}" \
  --data-urlencode "output=json" \
  --data-urlencode "limit=3" \
  --data-urlencode "fl=timestamp,statuscode"

execute-shell: python3 scripts/spotlight_safe.py validate-url "{URL}"
execute-shell: curl -s "https://archive.ph/newest/" --get --data-urlencode "url={URL}"

Early snapshots often predate paywall implementation. If a pre-paywall snapshot is found, retrieve it:

execute-shell: python3 scripts/spotlight_safe.py validate-timestamp "{TIMESTAMP}"
execute-shell: python3 scripts/spotlight_safe.py validate-url "{URL}"
fetch: url=https://web.archive.org/web/{TIMESTAMP}/{URL}, output_path={CASE_DIR}/research/archived/{filename}.md

6. Google Scholar

Search the title in Google Scholar. Look for [PDF] links next to results — these are often author-posted copies on institutional servers.

search: query="{exact title}" site:scholar.google.com

7. Reader Mode (Metered Paywalls)

For publications with free article limits (e.g., Financial Times, Bloomberg):

Safari: Cmd-Shift-R or View > Show Reader
Firefox: F9 or the reader icon in the address bar
Edge: reader icon in address bar

Works only on metered paywalls, not hard paywalls. This is a manual step — note in access_notes if reader mode was used.

8. Author Contact (Last Resort)

Email the author directly. Most authors are willing to share their work:

Subject: Research access request — [{Title}]

I'm a journalist investigating [{topic}] and would like to access your paper "{Title}" (published {year}). Would you be able to share a copy? I'm happy to credit your work in my reporting.

Note the outreach in access_notes and set access_method to author_request_pending until a response is received.

Do Not

Use browser extension bypasses (Bypass Paywalls Clean, Unpaywall browser extension in bypass mode)
Use Sci-Hub or similar sites for current journalism work — legal and reputational risk
Scrape paywalled pages by rotating user agents or sessions — ToS violation
Pretend to be an institutional user to access licensed databases

Source Metadata

Add access_method to every source entry in findings.json and fact-check.json:

{
  "url": "https://example.com/paywalled-article",
  "type": "news",
  "accessed": "2026-03-15T14:20:00Z",
  "access_method": "archive_copy",
  "access_notes": "Retrieved via Wayback Machine snapshot from 2024-11-02"
}

Valid values for access_method:

full_text — Complete article accessed directly (no barrier)
open_access — Legal open-access copy found via Unpaywall/CORE/author
free_version — Freely distributed copy found via search (PDF, institutional mirror)
archive_copy — Accessed via Wayback Machine or Archive.today
google_scholar_pdf — PDF found via Google Scholar institutional link
reader_mode — Retrieved via browser reader mode on metered paywall
author_provided — Author shared copy in response to request
author_request_pending — Author contact sent, awaiting response
abstract_only — Only abstract/summary available
partial_text — Some content accessible (e.g., first paragraphs before hard paywall)
cached_copy — Found in search engine cache (Google, Bing)
institutional_access — Accessed through legitimate institutional subscription
preprint — Preprint version found (arXiv, SSRN, bioRxiv, etc.)
inaccessible — Could not access after full hierarchy attempt

Confidence Implications

access_method	Confidence cap
`full_text` / `open_access` / `author_provided` / `institutional_access`	No cap
`free_version` / `google_scholar_pdf` / `preprint`	No cap (but note version date — may differ from published version)
`archive_copy` / `cached_copy`	No cap (but note snapshot date)
`reader_mode` / `partial_text`	`medium` at best — may be missing key content
`abstract_only`	`medium` at best — abstract may omit key findings
`author_request_pending`	`low` — pending, do not cite as verified
`inaccessible`	`low` — cite the source but flag that content was not verified

A finding that rests on an inaccessible source must say so explicitly in confidence_rationale. Do not inflate confidence because the source sounds authoritative if you have not read it.

Rules

Attempt access before downgrading. Work through at least the first five steps before marking a source inaccessible.
Record what you tried. Note attempted access methods in access_notes so the next cycle doesn't repeat the effort.
Abstract != full text. Never cite a finding as supported by a paper you only read the abstract of without flagging it.
Date-check archived copies. An archived copy from 5 years ago may not reflect the current version of a document. Note the snapshot date.

Credits

Adapted from claude-skills-journalism by Jay Amditis, released under MIT License.

content-access

Popularity

Invocation

Context Preview

SKILL.md

content-access

Popularity

Invocation

Context Preview

SKILL.md

Content Access

Decision Tree

1. Free Version Search

2. Unpaywall (Academic Papers with DOI, Optional Integration)

3. CORE (295M Open Access Papers)

4. Semantic Scholar

5. Archive Copy

6. Google Scholar

7. Reader Mode (Metered Paywalls)

8. Author Contact (Last Resort)

Do Not

Source Metadata

Confidence Implications

Rules

Credits

Similar Skills

Content Access

Decision Tree

1. Free Version Search

2. Unpaywall (Academic Papers with DOI, Optional Integration)

3. CORE (295M Open Access Papers)

4. Semantic Scholar

5. Archive Copy

6. Google Scholar

7. Reader Mode (Metered Paywalls)

8. Author Contact (Last Resort)

Do Not

Source Metadata

Confidence Implications

Rules

Credits

Similar Skills