Stats
Actions
Tags
From liter-llm
Handles streaming token output from LLMs via liter-llm over SSE or async iterators, including delta handling and null-content chunks.
How this skill is triggered — by the user, by Claude, or both
Slash command
/liter-llm:streaming-responsesThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Use `chat_stream(...)` to receive tokens as they are produced instead of waiting
Use chat_stream(...) to receive tokens as they are produced instead of waiting
for the full completion. The proxy streams over SSE; bindings expose async
iterators.
import asyncio, os
from liter_llm import LlmClient
async def main() -> None:
client = LlmClient(api_key=os.environ["OPENAI_API_KEY"])
async for chunk in await client.chat_stream(
model="openai/gpt-4o",
messages=[{"role": "user", "content": "Tell me a story"}],
):
if chunk.choices and chunk.choices[0].delta.content:
print(chunk.choices[0].delta.content, end="", flush=True)
print()
asyncio.run(main())
const chunks = await client.chatStream({
model: "openai/gpt-4o",
messages: [{ role: "user", content: "Tell me a story" }],
});
for (const chunk of chunks) {
process.stdout.write(chunk.choices[0]?.delta?.content ?? "");
}
chunk.choices[0].delta.content (Python) or
chunk.choices[0]?.delta?.content (TypeScript) before using it.delta.tool_calls; accumulate
function.arguments fragments across chunks before parsing."stream": true on
/v1/chat/completions.Creates bite-sized, testable implementation plans from specs or requirements, with file structure and task decomposition. Activates before coding multi-step tasks.
npx claudepluginhub xberg-io/plugins --plugin liter-llm