Performance

Comprehensive performance optimization patterns for frontend, backend, and LLM inference.

Quick Reference

Category	Rules	Impact	When to Use
Core Web Vitals	3	CRITICAL	LCP, INP, CLS optimization with 2026 thresholds
Render Optimization	3	HIGH	React Compiler, memoization, virtualization
Lazy Loading	3	HIGH	Code splitting, route splitting, preloading
Image Optimization	3	HIGH	Next.js Image, AVIF/WebP, responsive images
Profiling & Backend	3	MEDIUM	React DevTools, py-spy, bundle analysis
LLM Inference	3	MEDIUM	vLLM, quantization, speculative decoding
Caching	2	HIGH	Redis cache-aside, prompt caching, HTTP cache headers
Query & Data Fetching	2	HIGH	TanStack Query prefetching, optimistic updates, rollback

Total: 22 rules across 8 categories

Core Web Vitals

Google's Core Web Vitals with 2026 stricter thresholds.

Rule	File	Key Pattern
LCP Optimization	`rules/cwv-lcp.md`	Preload hero, SSR, fetchpriority="high"
INP Optimization	`rules/cwv-inp.md`	scheduler.yield, useTransition, requestIdleCallback
CLS Prevention	`rules/cwv-cls.md`	Explicit dimensions, aspect-ratio, font-display

2026 Thresholds

Metric	Current Good	2026 Good
LCP	<= 2.5s	<= 2.0s
INP	<= 200ms	<= 150ms
CLS	<= 0.1	<= 0.08

Render Optimization

React render performance patterns for React 19+.

Rule	File	Key Pattern
React Compiler	`rules/render-compiler.md`	Auto-memoization, "Memo" badge verification
Manual Memoization	`rules/render-memo.md`	useMemo/useCallback escape hatches, state colocation
Virtualization	`rules/render-virtual.md`	TanStack Virtual for 100+ item lists

Lazy Loading

Code splitting and lazy loading with React.lazy and Suspense.

Rule	File	Key Pattern
React.lazy + Suspense	`rules/loading-lazy.md`	Component lazy loading, error boundaries
Route Splitting	`rules/loading-splitting.md`	React Router 7.x, Vite manual chunks
Preloading	`rules/loading-preload.md`	Prefetch on hover, modulepreload hints

Image Optimization

Production image optimization for modern web applications.

Rule	File	Key Pattern
Next.js Image	`rules/images-nextjs.md`	Image component, priority, blur placeholder
Format Selection	`rules/images-formats.md`	AVIF/WebP, quality 75-85, picture element
Responsive Images	`rules/images-responsive.md`	sizes prop, art direction, CDN loaders

Profiling & Backend

Profiling tools and backend optimization patterns.

Rule	File	Key Pattern
React Profiling	`rules/profiling-react.md`	DevTools Profiler, flamegraph, render counts
Backend Profiling	`rules/profiling-backend.md`	py-spy, cProfile, memory_profiler, flame graphs
Bundle Analysis	`rules/profiling-bundle.md`	vite-bundle-visualizer, tree shaking, performance budgets

LLM Inference

High-performance LLM inference with vLLM, quantization, and speculative decoding.

Rule	File	Key Pattern
vLLM Deployment	`rules/inference-vllm.md`	PagedAttention, continuous batching, tensor parallelism
Quantization	`rules/inference-quantization.md`	AWQ, GPTQ, FP8, INT8 method selection
Speculative Decoding	`rules/inference-speculative.md`	N-gram, draft model, 1.5-2.5x throughput

Caching

Backend Redis caching and LLM prompt caching for cost savings and performance.

Rule	File	Key Pattern
Redis & Backend	`rules/caching-redis.md`	Cache-aside, write-through, invalidation, stampede prevention
HTTP & Prompt	`rules/caching-http.md`	HTTP cache headers, LLM prompt caching, semantic caching

Query & Data Fetching

TanStack Query v5 patterns for prefetching and optimistic updates.

Rule	File	Key Pattern
Prefetching	`rules/query-prefetching.md`	Hover prefetch, route loaders, queryOptions, Suspense
Optimistic Updates	`rules/query-optimistic.md`	Optimistic mutations, rollback, cache invalidation

Quick Start Example

// LCP: Priority hero image with SSR
import Image from 'next/image';

export default async function Page() {
  const data = await fetchHeroData();
  return (
    <Image
      src={data.heroImage}
      alt="Hero"
      priority
      placeholder="blur"
      sizes="100vw"
      fill
    />
  );
}

Key Decisions

Decision	Recommendation
Memoization	Let React Compiler handle it (2026 default)
Lists 100+ items	Use TanStack Virtual
Image format	AVIF with WebP fallback (30-50% smaller)
LCP content	SSR/SSG, never client-side fetch
Code splitting	Per-route for most apps, per-component for heavy widgets
Prefetch strategy	On hover for nav links, viewport for content
Quantization	AWQ for 4-bit, FP8 for H100/H200
Bundle budget	Hard fail in CI to prevent regression

Common Mistakes

Client-side fetching LCP content (delays render)
Images without explicit dimensions (causes CLS)
Lazy loading LCP images (delays largest paint)
Heavy computation in event handlers (blocks INP)
Layout-shifting animations (use transform instead)
Lazy loading tiny components < 5KB (overhead > savings)
Missing error boundaries on lazy components
Using GPTQ without calibration data
Not benchmarking actual workload patterns
Only measuring in lab environment (need RUM)

Related Skills

ork:react-server-components-framework - Server-first rendering
ork:vite-advanced - Build optimization
caching - Cache strategies for responses
ork:monitoring-observability - Production monitoring and alerting
ork:database-patterns - Query and index optimization
ork:llm-integration - Local inference with Ollama

Capability Details

lcp-optimization

Keywords: LCP, largest-contentful-paint, hero, preload, priority, SSR Solves:

Optimize hero image loading
Server-render critical content
Preload and prioritize LCP resources

inp-optimization

Keywords: INP, interaction, responsiveness, long-task, transition, yield Solves:

Break up long tasks with scheduler.yield
Defer non-urgent updates with useTransition
Optimize event handler performance

cls-prevention

Keywords: CLS, layout-shift, dimensions, aspect-ratio, font-display Solves:

Reserve space for dynamic content
Prevent font flash and image pop-in
Use transform for animations

react-compiler

Keywords: react-compiler, auto-memo, memoization, React 19 Solves:

Enable automatic memoization
Identify when manual memoization needed
Verify compiler is working

virtualization

Keywords: virtual, TanStack, large-list, scroll, overscan Solves:

Render 100+ item lists efficiently
Dynamic height virtualization
Window scrolling patterns

lazy-loading

Keywords: React.lazy, Suspense, code-splitting, dynamic-import Solves:

Route-based code splitting
Component lazy loading with error boundaries
Prefetch on hover and viewport

image-optimization

Keywords: next/image, AVIF, WebP, responsive, blur-placeholder Solves:

Next.js Image component patterns
Format selection and quality settings
Responsive sizing and CDN configuration

profiling

Keywords: profiler, flame-graph, py-spy, DevTools, bundle-analyzer Solves:

Profile React renders and backend code
Generate and interpret flame graphs
Analyze and optimize bundle size

llm-inference

Keywords: vllm, quantization, speculative-decoding, inference, throughput Solves:

Deploy LLMs with vLLM for production
Choose quantization method for hardware
Accelerate generation with speculative decoding

References

RUM Setup - Real User Monitoring
React Compiler Migration - Compiler adoption
TanStack Virtual - Virtualization patterns
vLLM Deployment - Production vLLM config
Quantization Guide - Method comparison
CDN Setup - Image CDN configuration

performance