Skill

profile

Profiles .NET applications for CPU performance, memory allocations, lock contention, exceptions, heap analysis, and JIT inlining using dotnet-trace and dotnet-gcdump. Useful for bottlenecks, leaks, high CPU, and GC pressure.

.NET

performance

monitoring

npx claudepluginhub asynkron/asynkron-skills --plugin asynkron-devtools

Tool Access

This skill uses the workspace's default tool permissions.

Preview

This tool requires .NET 10+ SDK (for `dnx` support).

SKILL.md

Similar Skills

dotnet-trace-collect

841

Guides selection and use of .NET diagnostic tools like PerfView and dotnet-trace to collect production performance data across Windows/Linux, containers, and Kubernetes.

5 files

dotnet-diag

profiling-application-performance

1.9k

Profiles Node.js, Python, and Java app performance by analyzing CPU usage, memory allocation, and execution hotspots to identify bottlenecks and recommend optimizations.

5 files6 tools

application-profiler

profiling

Profiles apps to identify CPU/memory bottlenecks using runtime tools: clinic.js/0x for Node.js, py-spy/cProfile for Python, Chrome DevTools/Lighthouse for frontend. For slow apps, leaks, high CPU.

alfred-dev

Stats

Stars31

Forks3

Last CommitFeb 12, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

Flag	Purpose
`--cpu`	CPU profiling
`--memory`	Memory allocation profiling
`--contention`	Lock contention analysis
`--exception`	Exception profiling
`--heap`	Heap snapshot
`--root <text>`	Root call tree at first matching method
`--filter <text>`	Filter function tables by substring
`--exception-type <text>`	Filter exceptions by type name
`--calltree-depth <n>`	Max call tree depth (default: 30)
`--calltree-width <n>`	Max children per node (default: 4)
`--calltree-self`	Include self-time tree
`--calltree-sibling-cutoff <n>`	Hide siblings below X% (default: 5)
`--include-runtime`	Include runtime/framework frames
`--input <path>`	Analyze existing trace file
`--tfm <tfm>`	Target framework for .csproj/.sln

Flag

Purpose

--cpu

CPU profiling

--memory

Memory allocation profiling

--contention

Lock contention analysis

--exception

Exception profiling

--heap

Heap snapshot

--root <text>

Root call tree at first matching method

--filter <text>

Filter function tables by substring

--exception-type <text>

Filter exceptions by type name

--calltree-depth <n>

Max call tree depth (default: 30)

--calltree-width <n>

Max children per node (default: 4)

--calltree-self

Include self-time tree

--calltree-sibling-cutoff <n>

Hide siblings below X% (default: 5)

--include-runtime

Include runtime/framework frames

--input <path>

Analyze existing trace file

--tfm <tfm>

Target framework for .csproj/.sln

Mode	Formats
CPU	`.speedscope.json`, `.nettrace`
Memory	`.nettrace`, `.etlx`
Exceptions	`.nettrace`, `.etlx`
Contention	`.nettrace`, `.etlx`
Heap	`.gcdump`

Mode

Formats

CPU

.speedscope.json, .nettrace

Memory

.nettrace, .etlx

Exceptions

.nettrace, .etlx

Contention

.nettrace, .etlx

Heap

.gcdump

Symptom	Start with
High CPU / slow execution	`--cpu`
High memory / GC pressure	`--memory`
High latency but low CPU	`--contention`
Too many exceptions in logs	`--exception`
Memory keeps growing (leak)	`--heap`
Want to verify JIT optimization	JIT/inlining analysis

Symptom

Start with

High CPU / slow execution

--cpu

High memory / GC pressure

--memory

High latency but low CPU

--contention

Too many exceptions in logs

--exception

Memory keeps growing (leak)

--heap

Want to verify JIT optimization

JIT/inlining analysis

=== HOT FUNCTIONS === Time (ms) Calls Function ------------------------------------------------- 38805.39 19533 MyApp.Core.ProcessItem... 19769.23 9897 MyApp.Core.TransformData...

# Capture detailed GC trace dotnet-trace collect \ --profile gc-verbose \ --format NetTrace \ -o trace.nettrace \ -- dotnet run -c Release --project ./MyApp # Analyze with the profiler dnx asynkron-profiler --input trace.nettrace --memory # Or convert for external tools dotnet-trace convert trace.nettrace --format Speedscope

[MethodImpl(MethodImplOptions.AggressiveInlining)] private static Result HandleHotPath(Data data) { // Fast path: ~20-30 lines max, handles the common case if (data.IsSimpleCase) { // Direct, minimal work return Result.From(data.Value); } // Rare/complex case — delegate to non-inlined method return HandleHotPathSlow(data); } [MethodImpl(MethodImplOptions.NoInlining)] private static Result HandleHotPathSlow(Data data) { // Complex logic: type coercion, error handling, edge cases // This can be as large as needed — it won't bloat the call site }

// Define handler signature delegate Result Handler(Context ctx, Instruction instr); // Build dispatch table once (static constructor) private static readonly Handler[] _dispatch = new Handler[64]; static MyRunner() { _dispatch[(int)Kind.Add] = HandleAdd; _dispatch[(int)Kind.Call] = HandleCall; _dispatch[(int)Kind.Branch] = HandleBranch; // ... } // Hot loop — direct delegate invocation, no switch overhead while (running) { var instr = instructions[pc]; var result = _dispatch[(int)instr.Kind](ctx, instr); // ... }

// 1. Define what poolable objects look like interface IRentable { void Activate(); // Called when rented — initialize state void Reset(); // Called when returned — clear state for reuse } // 2. Lock-free pool using Interlocked.CompareExchange class ObjectPool<T> where T : class, IRentable { private readonly T?[] _items; private readonly Func<T> _factory; public T Rent() { for (int i = 0; i < _items.Length; i++) { var item = Interlocked.Exchange(ref _items[i], null); if (item is not null) { item.Activate(); return item; } } var created = _factory(); created.Activate(); return created; } public void Return(T item) { item.Reset(); for (int i = 0; i < _items.Length; i++) { if (Interlocked.CompareExchange(ref _items[i], item, null) == null) return; } // Pool full — abandon to GC (graceful degradation) } } // 3. RAII wrapper ensures objects are returned using var handle = pool.Rent(); var obj = handle.Value; // ... use obj ... // Automatically returned on dispose

static TCache GetOrCreate<TCache>(ref TCache? field, Func<TCache> factory) where TCache : class { var existing = Volatile.Read(ref field); if (existing is not null) return existing; var created = factory(); var prior = Interlocked.CompareExchange(ref field, created, null); return prior ?? created; }

Flag	Purpose
`--cpu`	CPU profiling
`--memory`	Memory allocation profiling
`--contention`	Lock contention analysis
`--exception`	Exception profiling
`--heap`	Heap snapshot
`--root <text>`	Root call tree at first matching method
`--filter <text>`	Filter function tables by substring
`--exception-type <text>`	Filter exceptions by type name
`--calltree-depth <n>`	Max call tree depth (default: 30)
`--calltree-width <n>`	Max children per node (default: 4)
`--calltree-self`	Include self-time tree
`--calltree-sibling-cutoff <n>`	Hide siblings below X% (default: 5)
`--include-runtime`	Include runtime/framework frames
`--input <path>`	Analyze existing trace file
`--tfm <tfm>`	Target framework for .csproj/.sln

Flag

Purpose

--cpu

CPU profiling

--memory

Memory allocation profiling

--contention

Lock contention analysis

--exception

Exception profiling

--heap

Heap snapshot

--root <text>

Root call tree at first matching method

--filter <text>

Filter function tables by substring

--exception-type <text>

Filter exceptions by type name

--calltree-depth <n>

Max call tree depth (default: 30)

--calltree-width <n>

Max children per node (default: 4)

--calltree-self

Include self-time tree

--calltree-sibling-cutoff <n>

Hide siblings below X% (default: 5)

--include-runtime

Include runtime/framework frames

--input <path>

Analyze existing trace file

--tfm <tfm>

Target framework for .csproj/.sln

Mode	Formats
CPU	`.speedscope.json`, `.nettrace`
Memory	`.nettrace`, `.etlx`
Exceptions	`.nettrace`, `.etlx`
Contention	`.nettrace`, `.etlx`
Heap	`.gcdump`

Mode

Formats

CPU

.speedscope.json, .nettrace

Memory

.nettrace, .etlx

Exceptions

.nettrace, .etlx

Contention

.nettrace, .etlx

Heap

.gcdump

Symptom	Start with
High CPU / slow execution	`--cpu`
High memory / GC pressure	`--memory`
High latency but low CPU	`--contention`
Too many exceptions in logs	`--exception`
Memory keeps growing (leak)	`--heap`
Want to verify JIT optimization	JIT/inlining analysis

Symptom

Start with

High CPU / slow execution

--cpu

High memory / GC pressure

--memory

High latency but low CPU

--contention

Too many exceptions in logs

--exception

Memory keeps growing (leak)

--heap

Want to verify JIT optimization

JIT/inlining analysis

=== HOT FUNCTIONS === Time (ms) Calls Function ------------------------------------------------- 38805.39 19533 MyApp.Core.ProcessItem... 19769.23 9897 MyApp.Core.TransformData...

profile

Tool Access

Preview

SKILL.md

Similar Skills

Help us improve

Help us improve

profile

Tool Access

Preview

SKILL.md

Prerequisites

About Asynkron.Profiler

Running via dnx (no install needed)

Profiling Modes

CPU Profiling (--cpu)

Memory Allocation Profiling (--memory)

Lock Contention Profiling (--contention)

Exception Profiling (--exception)

Heap Snapshot (--heap)

JIT / Inlining Analysis

Analyzing Existing Traces

Key Flags

Supported Input Formats

Profiling Methodology

Step 1: Build Release First

Step 2: Choose the Right Mode

Step 3: Read the Hot Function Table

Step 4: Read the Allocation Call Graph

Step 5: Iterate

Step 6: Use Manual Tracing for Deep Dives

Common Optimization Patterns

JIT-Aware Optimization Patterns

Fast/Slow Path Split

Dispatch Tables vs Switch Statements

Object Pooling for Hot Allocations

Thread-Safe Lazy Initialization

Guidelines

Similar Skills

Help us improve

Prerequisites

About Asynkron.Profiler

Running via dnx (no install needed)

Profiling Modes

CPU Profiling (--cpu)

Memory Allocation Profiling (--memory)

Lock Contention Profiling (--contention)

Exception Profiling (--exception)

Heap Snapshot (--heap)

JIT / Inlining Analysis

Analyzing Existing Traces

Key Flags

Supported Input Formats

Profiling Methodology

Step 1: Build Release First

Step 2: Choose the Right Mode

Step 3: Read the Hot Function Table

Step 4: Read the Allocation Call Graph

Step 5: Iterate

Step 6: Use Manual Tracing for Deep Dives

Common Optimization Patterns

JIT-Aware Optimization Patterns

Fast/Slow Path Split

Dispatch Tables vs Switch Statements

Object Pooling for Hot Allocations

Thread-Safe Lazy Initialization

Guidelines

CPU Profiling (`--cpu`)

Memory Allocation Profiling (`--memory`)

Lock Contention Profiling (`--contention`)

Exception Profiling (`--exception`)

Heap Snapshot (`--heap`)

CPU Profiling (`--cpu`)

Memory Allocation Profiling (`--memory`)

Lock Contention Profiling (`--contention`)

Exception Profiling (`--exception`)

Heap Snapshot (`--heap`)