Garbage collection, SOH, LOH, POH, P/Invoke

Garbage collection

A garbage collection is an automatic memory management feature in .NET that allocates and releases memory with help of garbage collector.

Also the term garbage collection often refers to only releasing memory as opposed to both allocating and releasing memory.

A garbage collector or GC is as an automatic memory manager in the ↑ CLR that allocates and releases memory for your application.

Memory allocation

When you initialize a new process, the runtime reserves a contiguous region of address space for the process — the managed heap.

All reference types are allocated on the managed heap.

When an application creates the first reference type, memory is allocated for the type at the base address of the managed heap. As long as address space is available, the runtime continues to allocate space for new objects.

Memory release

Before a garbage collection starts, all managed threads are suspended except for the thread that triggered the garbage collection.

Managed heap

A managed heap is a contiguous region of address space reserved by runtime for the process.

All threads in the process allocate memory for objects on the same heap.

A general overview of the .NET managed heap:

flowchart
    heap(.NET managed heap) --> gcheap
    heap --> nongcheap
    nongcheap("Non-GC heap\n(Not managed by GC)")
    subgraph gcheap[GC heaps]
        soh("Small object heap (SOH)")
        poh("Pinned object heap (POH)")
        loh("Large object heap (LOH)")
    end
    style nongcheap stroke:green

Loading

The managed heap maintains a pointer to the address where the next object in the heap will be allocated. Initially, this pointer is set to the managed heap's base address. All reference types are allocated on the managed heap. When an application creates the first reference type, memory is allocated for the type at the base address of the managed heap. When the application creates the next object, the garbage collector allocates memory for it in the address space immediately following the first object.

Allocating memory from the managed heap is faster than unmanaged memory allocation. Because the runtime allocates memory for an object by adding a value to a pointer, it's almost as fast as allocating memory from the stack. In addition, because new objects that are allocated consecutively are stored contiguously in the managed heap, an application can access the objects quickly.

Non-GC heap

A non-GC heap is a specialized heap that is not managed by the garbage collector and is designed to store immortal objects with certain benefits for GC and code generation.

Non-GC heap was introduced in .NET 8.0. The basic idea that certain kinds of objects are essentially immortal and will never be collected, hence, we can put them into a separate storage where they are never scanned or compacted. All string literals are interned and therefore immortal.

↑ NonGC Heap.

↑ Exploring .NET frozen segments.

SOH

A small object heap or SOH is a memory zone for objects smaller than 85 kilobytes.

The 85KB threshold was determined empirically as the point beyond which defragmentation no longer provides significant performance benefits.

LOH

A large object heap, or LOH for short is a special memory zone for objects that are greater than 85,000 bytes.

LOH objects and collected during generation 2 garbage collection.

By default LOH objects are not compacted.

.NET Core and .NET Framework, starting with version 4.5.1, include the ↑ GCSettings.LargeObjectHeapCompactionMode property that allows users to specify that the LOH should be compacted during the next full blocking garbage collection. And in the future, .NET may decide to compact the LOH automatically. This means that, if you allocate large objects and want to make sure that they don't move, you should still pin them.

In addition, the LOH is ↑ automatically compacted when a hard limit is set by specifying either:

A memory limit on a container
The ↑ GCHeapHardLimit or ↑ GCHeapHardLimitPercent runtime configuration options

↑ The large object heap on Windows systems.

POH

The pinned object heap, or POH, is a specialized heap introduced in .NET 5.0 as part of the .NET runtime.

Pinning objects in C# is primarily used to ensure that an object remains at a fixed memory location and does not get moved by the garbage collector. This is particularly important in scenarios where you need to pass a reference to managed memory to unmanaged code, such as when working with P/Invoke or interfacing with low-level system components. Pinning an object prevents the GC from relocating it, ensuring that the unmanaged code receives a stable pointer.

`fixed` keyword

Here's a simple example demonstrating how to pin an array and pass it to unmanaged code using the fixed statement:

using System.Runtime.InteropServices;

[DllImport("SomeNativeLibrary.dll")]
static extern void NativeFunction(IntPtr ptr);

byte[] data = new byte[100];

// Pin the array and get a pointer to its data
unsafe
{
    fixed (byte* pData = data)
    {
        IntPtr ptr = (IntPtr)pData;
        NativeFunction(ptr);
    }
}

In this example:

The fixed statement is used to pin the data array
The byte* pData is a pointer to the first element of the array
IntPtr ptr = (IntPtr)pData converts the pointer to an IntPtr that can be passed to the unmanaged function NativeFunction

`GCHandle`

Another approach to pinning objects is using the ↑ GCHandle structure, which provides more control over the pinning process and can pin any managed object, not just arrays:

using System.Runtime.InteropServices;

[DllImport("SomeNativeLibrary.dll")]
static extern void NativeFunction(IntPtr ptr);

byte[] data = new byte[100];
GCHandle handle = GCHandle.Alloc(data, GCHandleType.Pinned);

try
{
    IntPtr ptr = handle.AddrOfPinnedObject();
    NativeFunction(ptr);
}
finally
{
    handle.Free();
}

In this example:

GCHandle.Alloc(data, GCHandleType.Pinned) pins the data array
handle.AddrOfPinnedObject() retrieves the pinned memory address
handle.Free() releases the pinning once it is no longer needed

↑ Pinned Heap.

P/Invoke

The ↑ P/Invoke or platform invoke is a technology that allows you to access structs, callbacks, and functions in unmanaged libraries from your managed code.

Ephemeral generations and segments

Because objects in generations 0 and 1 are short-lived, these generations are known as the ephemeral generations.

Ephemeral generations are allocated in the memory segment that's known as the ephemeral segment. Each new segment acquired by the garbage collector becomes the new ephemeral segment and contains the objects that survived a generation 0 garbage collection. The old ephemeral segment becomes the new generation 2 segment.

The size of the ephemeral segment varies depending on whether a system is 32-bit or 64-bit and on the type of garbage collector it is running (workstation or server GC). The following table shows the default sizes of the ephemeral segment.

Workstation/server GC	32-bit	64-bit
Workstation GC	16 MB	256 MB
Server GC	64 MB	4 GB
Server GC with > 4 logical CPUs	32 MB	2 GB
Server GC with > 8 logical CPUs	16 MB	1 GB

The ephemeral segment can include generation 2 objects. Generation 2 objects can use multiple segments (as many as your process requires and memory allows for).

The amount of freed memory from an ephemeral garbage collection is limited to the size of the ephemeral segment. The amount of memory that is freed is proportional to the space that was occupied by the dead objects.

↑ Understanding different GC modes with Concurrency Visualizer.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

garbage-collection.md

garbage-collection.md

Garbage collection, SOH, LOH, POH, P/Invoke

Table of contents