Skill

aidp-ingest-file-to-table

From oracle-ai-data-platform-workbench-engineer-agent

Ingests CSV/JSON/Parquet files into managed AIDP Delta tables via the `aidp` CLI (one-step or three-step upload→infer→create). Use when the user says "load this file into a table" or "create a table from a file."

data-engineering

Popularity

Parent stars

Parent forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/oracle-ai-data-platform-workbench-engineer-agent:aidp-ingest-file-to-table

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Land a file into a managed AIDP table, either in one call or via the staged 3-step flow when you need to

SKILL.md

76 lines · ~1.4k tokens

Stats

LanguagePython

Parent stars36

Parent forks21

MaintenanceGood

Last CommitJun 12, 2026

Actions

View Source View Plugin View on GitHub View README

`aidp-ingest-file-to-table` — file → managed Delta table

Land a file into a managed AIDP table, either in one call or via the staged 3-step flow when you need to review/adjust the inferred schema. This is a control-plane flow on the DataLake schema/tables resource. Primary engine: the official Oracle aidp CLI (same REST API + auth); oci raw-request is the fallback when the CLI isn't installed.

When to use

"Load this CSV/JSON into a table", "create a table from ", "ingest into the lakehouse".

CLI (preferred)

Per references/aidp-cli-map.md: schema generate-temp-file-upload-target → schema infer / infer-with-preview → schema create-data-table / create-table (also schema retrieve-par). All commands take --instance-id <DATALAKE_OCID> --auth api_key --profile DEFAULT --region <r>.

# 3-step (control): stage → infer → create
aidp schema generate-temp-file-upload-target --instance-id <DATALAKE_OCID> --auth api_key --profile DEFAULT --region us-ashburn-1   # returns upload target / PAR (also: retrieve-par)
aidp schema infer-with-preview              --instance-id <DATALAKE_OCID> --auth api_key --profile DEFAULT --region us-ashburn-1   # review columns/types/preview (or: infer)
aidp schema create-data-table --body-file .aidp/payloads/create-data-table-<name>.json \
  --instance-id <DATALAKE_OCID> --auth api_key --profile DEFAULT --region us-ashburn-1                                            # or: create-table

Mutating ops (create-data-table/create-table, upload): persist the body to .aidp/payloads/ and confirm with the user before running (see references/payloads.md).

Fallback (no CLI) — same REST + auth via oci raw-request against …/20240831/dataLakes/<OCID>/… (auth ladder in references/oci-raw-request.md): POST /tables/actions/uploadDataFile (multipart/binary may need PAR upload — see aidp-volumes), POST /tables/actions/inferSchema, POST /tables/actions/createTable (with catalogKey, schemaKey, table name, finalized columns, source format, load options), verify GET /tables?catalogKey=<cat>&schemaKey=<cat.schema>.

Verify-first (no-fabrication): the upload/infer/create action shapes are UNVERIFIED in this env (not yet in references/rest-endpoint-map.md). Confirm with a live probe (start with a GET /tables?catalogKey=…&schemaKey=… 200 against the target schema) before any write; record results.

Live-verified 2026-06-10 on de-agent (CSV → de_ingest_test, 3 rows) — correction: the uploadDataFile / inferSchema / createTable action names above are WRONG. The working flow is the schema-resource 3-step: (1) generate-temp-file-upload-target returns a PAR + ociFilePath; (2) PUT the file bytes to the PAR (HTTP 200); (3) infer-with-preview — its location MUST be the ociFilePath OCI URI, not the uploadKey (passing uploadKey → 400); (4) create-data-table returns 202 + a datalake-async-operation-key (poll to SUCCEEDED). create-data-table is HEADERLESS/POSITIONAL: header=true is ignored at create, so tableFields must use the reader column names _c0/_c1/_c2… — naming them id/name/amt fails the async op with UNRESOLVED_COLUMN. Rename afterward via ALTER TABLE … RENAME COLUMN.

Workflow

Confirm the source file location (workspace path or volume) and the target catalog.schema.table (create the schema first if needed).
1-step (simple): aidp schema create-table referencing the source file, format, and options — fastest when the schema infers cleanly.
3-step (control): generate-temp-file-upload-target → infer-with-preview (review columns/types with the user; fix types/headers/delimiters) → create-data-table with the finalized columns.
Async: table creation may return 202 with an async-operation key — poll until terminal (async convention in references/oci-raw-request.md; track via aidp-observability).
Verify with aidp schema list-tables / GET /tables?…; report the fully-qualified table name and row/column summary.

Gotchas (documented limits, no workaround)

Delimited files: comma only — auto-populate "Doesn't support delimiters other than comma" (platform reference §42 Known Issues #15). Pre-convert tab/pipe/semicolon-delimited files to CSV before ingest.
No multi-line JSON for external tables — "Can't create external tables with multi-line JSON" (platform reference §42 Known Issues #12). Use newline-delimited JSON (one record per line) for external tables.

Notes

Big files: prefer landing into a volume / object storage and loading from there; mind cluster memory.
For continuous/streaming or external-source ingestion, use the spark-connectors plugin + aidp-federate, not this skill (this is file→table).
Clean up temporary tables created during validation.

References

references/aidp-cli-map.md · references/payloads.md · references/oci-raw-request.md · references/rest-endpoint-map.md
pairs with aidp-workspace-files, aidp-volumes, aidp-profiling-tables

aidp-ingest-file-to-table

Popularity

Invocation

Context Preview

SKILL.md

aidp-ingest-file-to-table

Popularity

Invocation

Context Preview

SKILL.md

`aidp-ingest-file-to-table` — file → managed Delta table

When to use

CLI (preferred)

Workflow

Gotchas (documented limits, no workaround)

Notes

References

Reused across plugins

Similar Skills

`aidp-ingest-file-to-table` — file → managed Delta table

When to use

CLI (preferred)

Workflow

Gotchas (documented limits, no workaround)

Notes

References

Reused across plugins

Similar Skills

aidp-ingest-file-to-table

Popularity

Invocation

Context Preview

SKILL.md

aidp-ingest-file-to-table

Popularity

Invocation

Context Preview

SKILL.md

aidp-ingest-file-to-table — file → managed Delta table

When to use

CLI (preferred)

Workflow

Gotchas (documented limits, no workaround)

Notes

References

Reused across plugins

Similar Skills

aidp-ingest-file-to-table — file → managed Delta table

When to use

CLI (preferred)

Workflow

Gotchas (documented limits, no workaround)

Notes

References

Reused across plugins

Similar Skills

`aidp-ingest-file-to-table` — file → managed Delta table

`aidp-ingest-file-to-table` — file → managed Delta table