Custom Heat Sinks for AI Servers, GPUs & HPC Compute

Richconn manufactures custom heat sinks for AI servers — CNC-machined copper and aluminum thermal solutions engineered for 700 W to 1,400 W GPU thermal envelopes, from NVIDIA H100 air-cooled designs to B200 and GB200 liquid cold-plate bases. We handle the precision-machined components inside every thermal stack: copper bases, aluminum fin stacks, cold-plate housings, and the interface features that decide whether a 1 kW chip stays within its junction temperature.

Custom Heat Sinks for AI Servers
01 The Thermal Problem

GPU TDP Has Tripled in Seven Years

AI accelerator thermal design power has roughly tripled across the H100 → B200 → B300 generation. The chart below shows the trajectory — and why air-cooled designs that worked for H100 stop working at B200, forcing every server OEM and integrator to redesign their thermal stack.

NVIDIA AI GPU Thermal Design Power · 2022–2025
SOURCE: NVIDIA spec sheets · DCD · 2025
350W
A100
PCIe · 2020
700W
H100
SXM · 2022
700W
H200
SXM · 2024
1,000W
B200
air · 2025
1,200W
B200
liquid · 2025
1,400W
B300
liquid · 2025
Air-cooled feasible
Air-cooled, high airflow + Cu base
Liquid cooling required
Direct-to-chip liquid mandatory
Rack-level heat
A GB200 NVL72 rack generates 120–140 kW of heat — versus 10–15 kW for a traditional enterprise rack.
Heat flux density
Above 30 W/cm² at the die, single-phase air convection physically cannot extract enough heat.
Liquid-cooling share
Liquid cooling rose from ~10% of AI chips in late 2024 to default for high-end GB200 / B300 systems.

What this means for a hardware engineer: the heat sink is no longer a finishing component. It is the determining piece of a server's reliability budget. Base flatness, material choice, fin geometry, and TIM compatibility now dictate whether a 1 kW chip stays within its junction temperature limit at sustained workload — or throttles and degrades job completion time by 15–25%.

02 Capability Snapshot

Heat-Sink Capabilities at a Glance

A quick capability check for thermal engineers and procurement teams. These are the materials, processes, tolerances, and inspection methods we apply to AI server heat sinks every day.

Target Applications
AI GPU heat sinks (H100, H200, B200, B300, GB200, MI300), HPC server CPUs, AI accelerator cards, edge AI modules (Jetson), liquid cold-plate bases for direct-to-chip cooling
Constructions
Solid copper · solid aluminum · Cu base + Al fin (hybrid) · CNC-machined finned blocks · cold-plate housings · zipper-fin bases · skived-fin post-machining
Materials
Copper (C1100 ETP · C1020 OFC), aluminum (6061-T6 · 6063-T5 · 1060), brass (rare, low-cost applications)
Tolerances
Base flatness ±0.01 mm, base surface roughness Ra 0.4–0.8 µm (mating face), fin geometry per drawing, hole position per GD&T
Part Envelope
Up to 1,000 × 500 mm footprint · fin heights up to 180 mm (CNC milled) · aspect ratios up to 15:1 on CNC-cut fins (higher via skived process)
Surface Treatments
Black anodize, clear anodize, nickel plating (Cu), passivation, bead blasting, laser marking, masking for selective treatment of mating faces
Volume
Prototype (1–10 pcs, 5–10 days) · pilot run (10–500 pcs) · series production (500+ pcs/month)
Inspection
CMM, surface profilometer (base flatness), Ra tester, optical projector, mill-certified raw stock, full FAI report on first article
03 Definition

What Is a Custom AI Server Heat Sink?

An AI server heat sink is the passive thermal component bolted to a GPU, CPU, or accelerator that conducts heat from the die into a finned structure where it transfers to an air or liquid coolant. Unlike a desktop heat sink, "AI server" implies sustained power dissipation at or near TDP for hours — not occasional bursts — across processors that today range from 350 W (A100) to 1,400 W (B300).

A "custom" heat sink, in this context, means the part is designed to a specific server's mechanical envelope, chip footprint, airflow direction, mounting pattern, and target junction temperature. Off-the-shelf heat sinks rarely fit current AI server designs because: (a) chip packages and BGA pitches keep changing, (b) 1U / 2U / 4U chassis impose different fin-height envelopes, and (c) hyperscale OEMs specify proprietary fan curves and rail spacing.

Three typical orders we machine for AI server programs:

  • Air-cooled tower or finned-block heat sink for 350–800 W GPU/CPU — Cu base with bonded or CNC-cut Al fins, designed around a specific 2U/4U airflow envelope
  • Liquid cold-plate base for 700–1,400 W direct-to-chip cooling — CNC-machined copper or copper-nickel-plated body with milled or skived micro-fins inside the coolant cavity
  • Hybrid Cu-base + Al-fin heat sink with embedded heat-pipe slots — we machine the base, fin block, and pipe channels; heat pipe insertion handled by the customer or a partner workshop
04 Manufacturing Processes

How We Make Each Type of Heat Sink

There are four mainstream heat-sink construction methods. Each has a place — but for AI server power densities, the right choice depends on heat flux, footprint, fin density, and whether you need a liquid cold plate. Below is what each method does well, and where Richconn fits.

04.1 / PRIMARY PROCESS

CNC Machined Heat Sinks

Richconn signature · 3 / 4 / 5-axis

An entire heat sink — base, fin block, mounting features, cable channels — milled from a single block of copper or aluminum. The base and fins share one datum, eliminating the thermal interface between base and fin that exists in bonded or zipper-fin construction. Best for prototype, low-volume, complex geometry, and cold-plate housings.

Material: Cu C1100/C1020 · Al 6061/6063
Fin aspect: up to ~15:1 (CNC limit)
Base flatness: ±0.01 mm
Best for: Cold-plate bases, prototype runs, custom mounting, 50–5,000 pcs
04.2 / FIN PROCESS

Skived Fin Heat Sinks

Ultra-high fin density · monolithic

A precision blade peels ultra-thin fins from a solid copper or aluminum block in one operation. The result is a one-piece structure with zero base-to-fin interface and aspect ratios up to 30:1 — impossible on a CNC mill. Excellent for high-density forced-air cooling where space is tight.

Fin aspect: up to 30:1
Fin pitch: down to 0.5 mm
Volume sweet spot: 500+ pcs
Richconn scope: Skived parts produced via vetted partner; we machine the base interface, mounting features, and assembly tolerances
04.3 / FIN PROCESS

Bonded / Zipper-Fin Heat Sinks

Tall fins · low cost at volume

Thin stamped or extruded fins are mechanically locked or thermally bonded to a separately machined base — allowing taller, thinner fins than skiving or CNC. Trade-off: a thermal interface exists between fin and base, so thermal resistance is higher than a monolithic part for the same envelope.

Fin aspect: very high (40:1+)
Trade-off: Base-to-fin interface resistance
Volume sweet spot: 1,000+ pcs
Richconn scope: Bases CNC-machined in-house to flatness spec; fin stack bonded via partner
04.4 / FIN PROCESS

Aluminum Extrusion + CNC Finish

High volume · linear fin geometry

Aluminum 6063 is extruded through a profile die to form a base+fin cross-section, then CNC-machined to add mounting holes, custom outline, and a precision base surface. Tooling cost is amortized over runs typically 5,000+ pcs. Limited to constant linear fin geometry along the extrusion axis.

Material: Al 6063 (limited to extrudable alloys)
Fin geometry: Linear, constant along axis
Volume sweet spot: 5,000+ pcs
Richconn scope: Profile sourced from extrusion partner; we CNC-machine the base, holes, outline, and surface finish to spec
Honest scope note

Richconn's core competency is precision CNC machining: bases, cold-plate housings, custom milled fin geometry, hybrid Cu-Al constructions, and finishing to flatness/Ra specs. Skiving, deep extrusion, vacuum brazing, and heat-pipe insertion are produced via vetted partner workshops or executed by your in-house process. We're upfront about this scope so you know exactly where Richconn adds value and where another supplier — or a partner we coordinate — does the operation. If your project requires a complete thermal subassembly, we can manage the partner steps end-to-end and ship a finished unit.

05 Material Selection

Copper vs. Aluminum vs. Hybrid Construction

Material choice is the single largest lever on heat-sink thermal performance — and on cost and weight. For an AI server engineer, the trade space comes down to three constructions. The numbers below are what each can do, not marketing rounding.

Aluminum

All-Aluminum

Al 6061 / 6063 / 1060
200 W/m·K
Thermal Conductivity (6063)
Density
2.7 g/cm³
Cost (vs Cu)
~1× (baseline)
Machinability
Excellent
Extrudable
Yes (6063)
Best forTDP < 500 W · weight-sensitive 1U/2U servers · cost-driven OEM volume · edge AI modules
Copper

All-Copper

Cu C1100 / C1020
391 W/m·K
Thermal Conductivity (C1100)
Density
8.96 g/cm³
Cost (vs Al)
3–4×
Machinability
Moderate (gummy)
Extrudable
No (skive or machine)
Best forTDP > 700 W · heat flux > 30 W/cm² · constrained footprint · liquid cold plates · GPU/CPU direct contact
Hybrid

Cu Base + Al Fin

C1100 base · 6063 fins
~85%
of Cu Performance, ~45% of Cu Weight
Spreading
Copper (391 W/m·K)
Convection
Aluminum (lighter fins)
Cost (vs Cu)
~60%
Construction
Bonded / soldered / mech locked
Best forMainstream AI servers · 500–1,000 W where weight matters but Cu spreading is needed at the die interface
06 Heat Sink Types

Heat-Sink Types We Machine for AI Servers

Representative part categories we produce for AI server, HPC, and accelerator card OEMs. Real photos and inspection data shared per project under NDA.

GPU · Hybrid

Cu Base + Al Fin Block

500–1000W 5-axis
Liquid Cooling

Cold-Plate Bases

Cu C1100 1000W+
CPU · HPC

All-Al Tower / Block Sink

6063-T5 <500W
Hybrid · Heat-Pipe

HP-Ready Base + Fin Block

3 × ⌀6mm Cu/Al
Optical / Network

OSFP / QSFP-DD Sinks

Cu compact
Low-Profile

1U / 2U Server Heat Sinks

Al 6063 extrusion
Liquid · Micro-channel

Micro-Channel Cold Plates

Cu partner-brazed
Edge AI

Jetson / Edge Module Sinks

Al 6061 < 100W

Renderings are schematic. Real photos and CFD / inspection data shared per project under NDA.

07 Engineering Specifications

Tolerances & Specs We Hold in Production

CNC machining can hold extremely tight specs in single pieces. What matters for a server program is what we hold stably in batch, with documented CMM and profilometer records. The numbers below are batch production values, not best-case.

SpecificationRichconn HoldsWhy It Matters for AI Servers
Base flatness (mating face)±0.01 mm across mating areaDirectly drives TIM thickness uniformity → thermal-interface resistance Rth_int. Out-of-flat bases create air gaps that act as insulators at the die interface.
Base surface roughnessRa 0.4–0.8 µmLower Ra reduces TIM bond-line thickness and improves thermal contact. Too-rough surfaces require thicker TIM, raising resistance.
Fin pitch (CNC machined)1.5–5 mmTighter pitch increases surface area but raises pressure drop and dust-loading risk. We work with the customer's airflow envelope to set it.
Fin aspect ratioup to 15:1 (CNC) · up to 30:1 (skived, partner)Higher aspect ratio packs more surface area in a fixed footprint — the only path forward when the chassis is fixed and chip power went up.
Hole / boss position±0.02 mm GD&TCritical for mating to the GPU package mounting pattern. Position errors compound across multiple mounting points.
Overall envelopeUp to 1,000 × 500 mm · fin height to 180 mmCovers GPU/CPU heat sinks, accelerator card sinks, and rack-level cold plates.
Material certificationMill certs for every Cu/Al batchThermal conductivity depends on alloy purity. Mill cert with chemistry + mechanical properties prevents surprises.

Inspection Flow for Every Heat Sink

QC.01
Material & FAI

Mill cert check + first-piece CMM inspection against drawing before any lot proceeds.

QC.02
Base Flatness

Surface profilometer / dial indicator across mating area. Pass criterion ±0.01 mm.

QC.03
Geometry & Finish

CMM dimensional, optical projector for fin profile, Ra tester on critical surfaces.

QC.04
Final OQC

Cosmetic inspection, deburr/clean verified, masking confirmed, packed with FAI + lot report.

08 Why Richconn

Why Server OEMs Choose Richconn for Heat-Sink Machining

Six concrete reasons — each backed by something we actually do for thermal customers. We don't oversell what we are. We're a precision CNC shop specialized in the machining steps of heat-sink production — and we coordinate the rest.

01 — INTERFACE FIRST

We Hold Base Flatness Like It's the Whole Spec

Because for a heat sink it almost is. We measure every mating-face heat sink on a surface profilometer, not just a dial indicator, and we record the result per lot. ±0.01 mm across the mating area is the production standard — not a best-case sample.

02 — CU + AL DUAL FLUENCY

Equal Comfort in Copper and Aluminum

Many CNC shops are aluminum-specialists who avoid copper because it gums up tools and burr-control is harder. We run dedicated tooling and chip-control strategy for C1100/C1020 copper — so cold-plate bases and Cu-base hybrids come off the machine clean and ready for finish.

03 — CFD-AWARE DFM

We Read Your Thermal Spec, Not Just Your Drawing

If you send a TDP, ambient, and airflow envelope, our engineers will flag fin density, base thickness, or boss-position decisions that look fine on the drawing but trade away thermal performance. Free DFM with every quote, no commitment.

04 — HONEST SCOPE

We Tell You What We Don't Do

Skiving, vacuum brazing, heat-pipe insertion, vapor-chamber assembly — these go to vetted partner workshops, and we say so. If a partner step is part of your job, we coordinate it and ship a finished unit. No silent sub-contracting to unknown shops.

05 — PROTO TO VOLUME

5-Day Prototypes, Series Production Ready

Prototype lead time of 5–10 working days for standard Cu / Al sinks. Pilot lots of 10–500 pcs validate process. Series production runs in the 500–5,000+ pcs/month range with stable setup, documented FAI, and ISO 9001 lot traceability.

06 — RECORDS YOU CAN AUDIT

Material Certs & Inspection Records, Every Lot

Mill cert for every Cu and Al lot. CMM and profilometer records for every FAI. Lot traceability — batch, machine, operator, date — kept on file for 3 years. The documentation hyperscaler and OEM programs require, without you having to ask.

09 Quoting Workflow

How to Quote a Custom AI Server Heat Sink

Sending us your design does not commit you to an order. NDAs on request before any file is uploaded. Here's what gets you an accurate quote within 2 business hours.

1
Send Drawings or Thermal Spec
You send: 3D CAD (STEP, X_T, IGES) + 2D drawing with flatness/Ra callouts, material spec, qty — OR a thermal spec (TDP, ambient, airflow, envelope) if you want a design proposal first
We confirm: Receipt within 30 minutes in business hours
2
DFM Review
You wait: Free, no commitment
We check: Manufacturability, base-flatness method, fin aspect feasibility, material choice, surface treatment route, cost-driver flags
3
Quotation
You receive: Per-unit price by quantity tier, lead time, finishing options, partner-step coordination if any
We commit: Within 2 business hours of complete files
4
Prototype
You issue: PO for sample run (1–10 pcs)
We deliver: Sample in 5–10 working days with full FAI report (CMM + flatness)
5
Pilot & Approval
You test: Thermal validation on your bench / in-system
We hold: Setup ready for production sign-off; iterate if thermal data calls for fin/base changes
6
Production & Ship
You receive: Lots with FAI + per-lot CMM record, material cert, anti-corrosion packaging, export docs
We pack: Cleaned, deburred, mating faces protected, ready for assembly
NDA Available

We sign NDAs before any file is uploaded — standard for AI server OEM and hyperscaler programs.

Send Drawings or Specs
10 FAQ

AI Server Heat Sinks — FAQ

The questions thermal engineers and procurement teams ask most often. Each answer is written to give decision-useful detail, not marketing copy.

Q.01What is a custom heat sink for an AI server?+
A custom AI server heat sink is a thermal component designed to a specific server's GPU/CPU footprint, mechanical envelope, airflow direction, mounting pattern, and target junction temperature — rather than pulled from a catalogue. "AI server" implies sustained dissipation at or near TDP for hours, across processors that range from 350 W (A100) to 1,400 W (B300). Custom designs are needed because chip packages, BGA pitches, chassis envelopes, and fan curves keep changing — off-the-shelf parts rarely fit current AI server programs.
Q.02Which GPUs can your heat sinks be designed for?+
We've machined heat sinks and cold-plate bases for the full current AI accelerator landscape: NVIDIA A100 (350–400 W), H100 (700 W), H200 (700 W), B200 (1,000–1,200 W), B300 (1,100–1,400 W), GB200 Superchip components, and AMD MI300X/MI300A. We've also done CPU server sinks (Xeon, EPYC), accelerator card sinks (custom inference cards), edge AI module sinks (NVIDIA Jetson), and OSFP / QSFP-DD optical-module sinks. If your design is targeted at a specific package and TDP, send the spec — we'll tell you what construction makes sense.
Q.03Copper vs. aluminum — which should I use for AI servers?+
Rule of thumb: aluminum below 500 W TDP, copper or hybrid above 700 W, and copper-based for heat flux above ~30 W/cm². Copper has ~2× the thermal conductivity of aluminum (391 vs. ~200 W/m·K) and achieves equivalent cooling in 40–60% less volume, but costs 3–4× more and is 3.3× denser. For most mainstream AI servers, a Cu base + Al fin hybrid gives ~85% of copper performance at ~45% of copper weight, which is why it's the dominant construction for 500–1,000 W GPU heat sinks today.
Q.04What base flatness do you hold, and why does it matter?+
We hold ±0.01 mm flatness across the mating area in production, measured with a surface profilometer and recorded per lot. Base flatness matters because the thermal interface material (TIM) between the die and heat sink works best at a uniform, thin bond line. Out-of-flat bases create air gaps that act as insulators — even with premium TIM. On a 1 kW GPU, a few microns of unintended gap can move junction temperature several degrees, which compounds across the rack.
Q.05Can you make liquid cold plates for direct-to-chip cooling?+
Yes — we CNC-machine cold-plate bases and housings in copper or copper with nickel plating, including milled coolant channels, inlet/outlet ports, and the precision mating face that sits against the GPU. For vacuum-brazed micro-channel cold plates (where multiple copper layers are sealed into a single sub-millimeter-channel structure for the highest-density liquid cooling), the brazing step is executed by a vetted partner — we machine all the precision metal parts and coordinate the brazing-and-leak-test process to ship a finished, tested cold plate.
Q.06Do you do skived-fin or extruded heat sinks?+
We CNC-machine bases and finishing features for skived-fin and extruded heat sinks; the skiving and extrusion themselves are produced by partner workshops we've vetted on quality. Skiving is the right process when you need fin aspect ratios above ~15:1 (CNC's practical limit) or fin pitch below 1 mm — typically for very high-density forced-air cooling. Extrusion is the right process at volumes above ~5,000 pcs with constant linear fin geometry. We're transparent about which steps we do in-house and which go through partners; the customer always knows the supply chain on their parts.
Q.07Can you support heat-pipe or vapor-chamber integration?+
For heat-pipe-ready bases and fin blocks — yes. We CNC-machine the base with precise pipe grooves (typically ⌀6 mm or ⌀8 mm), and the fin block with matching pipe holes, sized for solder or thermal-epoxy assembly. Pipe insertion and the final solder/braze step are done by the customer or by a partner workshop we coordinate. For vapor chambers (sealed two-phase phase-change heat spreaders) — these are a specialized supplier category; we machine the housings and integration interfaces, but the vapor chamber itself is sourced via partner.
Q.08What surface finish should I specify for the mating face?+
For an AI GPU / CPU mating face, specify Ra 0.4–0.8 µm. Below Ra 0.4 µm starts to get expensive (lapping or polishing) without significantly improving thermal contact, because the TIM fills the remaining micro-roughness. Above Ra 0.8 µm, you start trading away interface performance. For nickel-plated copper cold plates, specify the plating thickness (typically 5–15 µm) and whether the mating face is masked or plated — we handle both. We can also leave a "machine to size after plate" allowance if your plating adds dimensional growth.
Q.09What lead time should I expect?+
Standard prototype lead time is 5–10 working days for CNC-machined Cu or Al heat sinks of moderate complexity, from drawing approval. Pilot lots (10–500 pcs) typically run 2–3 weeks. Series production schedules are committed in advance — for an ongoing program we hold setup and can ship weekly or monthly batches. Hybrid Cu+Al constructions or parts requiring partner steps (skiving, brazing, plating) add typical partner lead times — we tell you the full schedule with the quote, not after the PO.
Q.10Do you sign NDAs and protect our designs?+
Yes, NDAs are signed on request before any file is uploaded — standard for AI server OEM, hyperscaler, and accelerator card customers. Drawings are restricted internally to assigned engineers and operators. We don't upload customer files to public cloud quoting platforms, and we don't share heat-sink designs with other customers. For programs that span partner workshops (skiving, brazing), partner shops sign back-to-back NDAs before any drawing is shared.
Q.11What inspection reports do you provide?+
Every order includes a FAI report (first-article CMM inspection against drawing), material certificate (mill cert with chemistry and mechanical properties), and a base flatness record measured on a surface profilometer. On request we provide full CMM dimensional per lot, Ra measurement on critical surfaces, and statistical process control data for ongoing batches. Records are retained for 3 years and fully traceable — batch, machine, operator, inspection date.
Q.12How do I send a request — drawing only, or thermal spec also?+
Either works. If you have a finished design, send 3D CAD (STEP / X_T / IGES) + 2D drawings with flatness, Ra, material, finish, and quantity callouts — we'll quote machining directly. If you're at the design stage and want a proposal, send a thermal spec: target TDP, ambient, airflow, mechanical envelope, mounting pattern, and any GPU package info — our engineers will propose a construction (all-Cu, all-Al, hybrid, cold plate), fin geometry, and base spec, with the quote. Either way you get free DFM and no commitment before you place an order.

Send Us the Thermal Job Your AI Server Depends On

Upload a heat-sink drawing or a thermal spec — our engineers will return a manufacturable design proposal and quote with free DFM feedback within 2 business hours. No commitment, NDA on request, and your design stays confidential.

Quick quote within 2 hours

Fill out our contact form below. By providing us with details about your needs, you enable our experts to tailor solutions that perfectly align with your specifications.

*If you have any design files that need to be sent, please email them to sales@richconn.com