Products

AI Inference Infrastructure
Product Matrix

For AI inference and compute deployments at different scales, IFT provides a product matrix from standardized delivery units to server product lines, helping customers build sustainable inference infrastructure.

Token Factory Servers

TOKEN FACTORY

Token Factory Systems

IFT turns the AI Factory concept into standardized Token Factory solutions for inference workloads. Based on each customer’s workload, site conditions, power resources, and deployment timeline, we integrate cluster planning, rack layout, networking, storage, power distribution, cooling, system integration, on-site tuning, and troubleshooting to deliver operational and scalable inference infrastructure with lower TCO and faster rollout.

Standardized

Breaks compute clusters, racks, networking, power distribution, and cooling into repeatable delivery modules.

End-to-End

Covers solution design, system integration, rack deployment, on-site tuning, troubleshooting, and project acceptance.

Fast Scale-Up

Supports expansion from local enterprise inference to 32-node standard units, scaled clusters, and IDC-level compute factories.

Low TCO

Optimizes around cost per token, long-term energy use, deployment cycle, and O&M complexity.

EDGE starts local enterprise inference, POD32 standardizes scale-up, SCALE supports large-scale expansion, SITE enables low-TCO site conversion, BOX supports fast deployment and multi-site replication.

IFT-EDGE

IFT-EDGEToken Factory

Enterprise Edge Inference

For enterprise knowledge bases, AI agents, internal search, and workflow automation, enabling customers to deploy usable local inference quickly.

Best For

Private enterprise deployment / Data stays in-domain / Internal AI pilots

8–16 nodes

Local deployment

Data stays in-domain

Low-barrier launch

View Positioning+

Designed for enterprises moving from cloud token calls to local inference without building a large compute center yet. IFT-EDGE focuses on fast deployment, data security, and application pilots, helping customers get internal AI use cases running first.

IFT-POD32

IFT-POD32Token Factory

32-Node Token Factory

Uses a 32-node inference cluster as the base module to build a repeatable, scalable, and fast-deploying token production unit.

Best For

Mid-to-large enterprises / Cloud service providers / Campus compute nodes / AI application companies

32-node unit

Standardized module

Air / liquid cooling

Fast expansion

View Positioning+

Built for mid-to-large customers that already have site, power, or rack resources and need production-grade inference capacity quickly. IFT-POD32 turns complex cluster delivery into standardized modules for fast deployment, replication, and scale-out.

IFT-SCALE

IFT-SCALEToken Factory

Scaled Inference Platform

For model companies, internet platforms, and compute operators, supporting growth from validation clusters to thousand-GPU and ten-thousand-GPU inference clusters.

Best For

Model companies / Internet platforms / Compute operators / Large-scale inference

Multi-Pod expansion

Thousand-GPU / ten-thousand-GPU clusters

Architecture adaptation

On-site troubleshooting

View Positioning+

Designed for customers with defined inference workloads and high requirements for cluster stability, scalability, and on-site problem solving. IFT-SCALE focuses on architecture adaptation, performance validation, stability tuning, and closed-loop field issue resolution in large cluster deployments.

IFT-SITE

IFT-SITEToken Factory

Low-TCO Site Conversion

Converts existing IDCs, factory buildings, or campus spaces into low-TCO compute factories based on site conditions, power, cooling, and workload requirements.

Best For

Existing IDCs / Factory buildings / Campus spaces / Energy-side sites

Site-level design

Low-TCO optimization

Power & cooling

Integrated delivery

View Positioning+

Designed for customers that already own site, power, or rack resources but lack AI inference infrastructure design and deployment capability. IFT-SITE reuses existing assets, lowers buildout barriers and long-term operating costs, and turns traditional sites into infrastructure for sustainable token production.

IFT-BOX

IFT-BOXToken Factory

Containerized Token Factory

For overseas delivery, temporary expansion, and multi-site replication, using pre-integrated modules to form operational inference compute units quickly.

Best For

Overseas deployment / Temporary expansion / Energy-side sites / Multi-site replication

Containerized deployment

Factory pre-integration

Fast connection

Movable expansion

View Positioning+

Designed for customers with tight site construction schedules or a need to build inference capacity quickly in overseas, energy-side, or temporary scenarios. IFT-BOX reduces on-site complexity through pre-integration and modular delivery, accelerates compute go-live, and supports later relocation or multi-site replication.

Server Products

Servers

Covers 6U, 5U, 4U, and 2U server form factors for GPU inference, private enterprise clusters, high-density compute nodes, and data center delivery.

6U AI Servers

4 Products

For high-density GPUs, liquid cooling, high-power AI servers, and data center delivery.

View 6U Products ↓

5U GPU Servers

2 Products

For high-power GPUs, complex cooling, and high-density compute deployment.

View 5U Products ↓

4U GPU Servers

4 Products

For multi-GPU inference, enterprise GPU pools, and mid-to-large inference nodes.

View 4U Products ↓

2U Servers

5 Products

For general compute, storage expansion, virtualization, high-concurrency workloads, and lightweight AI inference.

View 2U Products ↓

6U AI Servers

6U Servers

For high-density GPUs, liquid cooling, high-power AI servers, and data center delivery.

IFT0034A

IFT0034A6U

6U Intel AI Server

A 6U dual-socket server for Intel Xeon platforms, suitable for high-power GPUs, complex cooling, and enterprise AI inference node deployment. The 6U chassis provides greater thermal headroom for long-term stability.

Best For

AI inference nodes / High-power GPU deployment / Private clusters / Data center delivery

6U dual-socket form factor

Dual-socket Intel Xeon platform

32 DDR5 5600MT/s DIMMs

CRPS redundant power support

View Full Specs+

Core Configuration

Form Factor

6U dual-socket server.

Processor

Supports two 4th / 5th Gen Intel Xeon Scalable processors, up to 64 cores per CPU and up to 385W TDP.

Memory

Intel platform supports up to 32 DDR5 5600MT/s DIMMs.

Chassis Size

850mm(D) × 438mm(W) × 263.6mm(H).

Storage & I/O

Internal Storage

Supports one M.2 SATA / NVMe device.

Front Storage

Supports 4 Anybay drives and 2 NVMe drives.

Right Ear

Supports 2 USB 3.0 / USB 2.0 ports.

BMC

Supports RJ45 management port, USB 3.0, Mini-DP, and Micro USB.

Power, Cooling & Management

Power

Supports 2000W / 2700W / 3200W CRPS power supplies with 3+ redundancy, 220V AC, and 240V HVDC.

Cooling

Default 2 groups of 5 × 056 fans, with N+1 redundancy.

Operating Systems

Supports Ubuntu, CentOS, Red Hat, Rocky, Windows Server, and related environments.

Management

Supports web-based management UI, KVM over IP, IPMI, and Redfish.

IFT0034L

IFT0034L6U

6U Intel NVLink GPU Server

A 6U Intel platform server for high-density GPU and NVLink interconnect scenarios. Designed for AI infrastructure projects with higher requirements for GPU-to-GPU communication, system cooling, and power redundancy.

Best For

High-density GPU inference / NVLink interconnect / LLM serving / Data center deployment

Supports 16 H20 GPUs

NVL16 Carrier Board

15 NVLink connections per GPU

Optional liquid-cooling specification

View Full Specs+

Core Configuration

Form Factor

6U dual-socket high-density GPU server.

Processor

Supports two 4th / 5th Gen Intel Xeon Scalable processors, up to 64 cores per CPU and up to 385W TDP.

Memory

Intel platform supports up to 32 DDR5 5600MT/s DIMMs.

Chassis Size

850mm(D) × 438mm(W) × 263.6mm(H).

GPU, Interconnect & Storage

GPU Configuration

Supports 16 H20 GPUs and BF3.

NVLink

NVL16 Carrier Board integrates 4 NVLink Switch chips with 240 NVLinks in total; each GPU connects to 15 NVLinks, with bandwidth up to 800GB/s.

Internal Storage

Supports one M.2 device.

Front Storage

Supports 4 Anybay drives and 2 NVMe drives.

Power, Cooling & Management

Power

Default one CRPS power supply; supports 6+ redundancy, 2700W / 3200W, and optional titanium-grade or higher PSU.

Air Cooling

Default 5 × 8038 fans, with 4+1 redundancy.

Liquid Cooling

Supports up to 36kW maximum power and 10PSI maximum pressure drop; supports independent outlet specifications for system, CPU, switch, and GPU cold plates.

Operating Systems

Supports Ubuntu, CentOS, VMware, KVM, Docker, and related environments.

Management

Supports web-based management UI, KVM over IP, IPMI, and Redfish.

IFT5134A

IFT5134A6U

6U AMD AI Server

A 6U dual-socket AI server for AMD EPYC platforms, suited for inference cluster nodes that require high CPU core counts, high memory bandwidth, and strong stability.

Best For

AI inference / High-performance computing / Private deployment / Compute clusters

Dual-socket AMD EPYC platform

Up to 192 cores per CPU

24 DDR5 6400MT/s DIMMs

5 × 8080 fan modules

View Full Specs+

Core Configuration

Form Factor

6U dual-socket server.

Processor

Supports two AMD 4th / 5th Gen EPYC processors, up to 192 cores per CPU and up to 500W TDP.

Memory

Supports up to 24 DDR5 6400MT/s DIMMs, 1DPC.

Chassis Size

850mm(D) × 438mm(W) × 263.6mm(H).

Storage & I/O

Front Storage

Supports 4 Anybay drives and 2 NVMe storage devices.

Right Ear

Supports one VGA and two USB 3.0 / USB 2.0 ports.

BMC

Supports RJ45 management port, USB 3.0, Mini-DP, and Micro USB.

Power, Cooling & Management

Power

Supports 2000W / 2700W / 3200W CRPS power supplies with 4+ redundancy, optional titanium-grade PSU, 220V AC, and 240V HVDC.

Cooling

Default 2 groups of 5 × 8080 standard fans, with N+1 redundancy.

Operating Systems

Supports Ubuntu, CentOS, Red Hat, Rocky, Windows Server, and related environments.

Management

Supports web-based management UI, KVM over IP, IPMI, and Redfish.

IFT0034AL

IFT0034AL6U

6U Liquid-Cooled AI Server

A 6U heterogeneous-platform server for high-power AI systems and liquid-cooled data center deployments. It supports Intel or AMD dual-socket platforms and provides both air-cooling and liquid-cooling options for high-density, high-power, long-running environments.

Best For

Liquid-cooled AI servers / High-power GPU nodes / Data center deployment / Heterogeneous platform delivery

Intel or AMD platform support

Air and liquid cooling options

Up to 3200W CRPS power

Built for high-power GPU nodes

View Full Specs+

Core Configuration

Form Factor

6U dual-socket heterogeneous-platform server.

Processor

Supports Intel Xeon or AMD EPYC dual-socket platforms, configurable by GPU, memory capacity, and delivery scenario.

Memory

Intel platform supports up to 32 DDR5 5600MT/s DIMMs; AMD platform supports up to 24 DDR5 6400MT/s DIMMs.

Chassis Size

850mm(D) × 438mm(W) × 263.6mm(H).

Storage & I/O

Internal Storage

Supports one M.2 SATA / NVMe device.

Front Storage

Supports 4 Anybay drives and 2 NVMe drives.

Right Ear

Supports 2 USB 3.0 / USB 2.0 ports.

BMC

Supports RJ45 management port, USB 3.0, Mini-DP, and Micro USB.

Power, Cooling & Management

Power

Supports 2000W / 2700W / 3200W CRPS power supplies with 3+ redundancy, optional titanium-grade PSU, 220V AC, and 240V HVDC.

Air Cooling

Default 5 × 8038 standard fans, with N+1 redundancy.

Liquid Cooling

Supports liquid cooling for CPU and GPU components, with independent outlet specifications for system, CPU, and GPU cold plates.

Operating Systems

Supports Ubuntu, CentOS, Red Hat, Rocky, Windows Server, and related environments.

Management

Supports web-based management UI, KVM over IP, IPMI, and Redfish.

5U GPU Servers

5U Servers

For high-power GPUs, complex cooling, and high-density compute deployment.

IFT0073H

IFT0073H5U

5U Intel GPU Server

A 5U GPU server for Intel Xeon platforms, offering more chassis space and multiple GPU configuration options for high-power GPUs, complex cooling, and high-density inference nodes.

Best For

High-density inference / Multi-GPU servers / Model serving / Mixed training and inference

5U dual-socket form factor

Dual-socket Intel Xeon platform

Supports 8 dual-width GPU configurations

2200W CRPS power support

View Full Specs+

Core Configuration

Form Factor

5U dual-socket GPU server.

Processor

Supports two 4th / 5th Gen Intel Xeon Scalable processors, up to 64 cores per CPU, HBM support, and up to 350W TDP.

Memory

Supports up to 32 DDR5 DIMMs, up to 5600MT/s.

Motherboard Size

380mm × 427mm.

Dimensions

220mm × 435.5mm × 850mm.

GPU & Storage

Front Storage

Supports 12 × 3.5-inch SAS / SATA / NVMe drives.

GPU Configuration

Supports 8 dual-width GPUs and 5 single-width NICs through switch expansion; supports 8 dual-width GPUs and 4 single-width NICs through direct connection.

I/O, Power & Cooling

Front I/O

Left ear supports power, UID, and system status indicators; right ear supports VGA, USB 3.0 Type-A, and USB 2.0 Type-A.

Rear I/O

Supports RJ45-MNG, MicroUSB-UART, USB Type-A, and Mini-DP.

Power

CRPS specification; supports AC / HVDC, 1300W / 1600W / 2000W / 2200W, N+1 redundancy, and hot-swap.

Cooling

Lower 4 × 056 fan modules and upper 4 × 056 fan modules, with N+1 redundancy.

Operating Environment

Temperature: 5°C–40°C.

IFT5173H

IFT5173H5U

5U AMD GPU Server

A 5U high-performance GPU server for dual-socket AMD EPYC platforms. Built for high-density inference, GPU batch delivery, and enterprise AI infrastructure.

Best For

Large-scale inference / GPU clusters / Enterprise AI infrastructure / Batch server delivery

Dual-socket AMD EPYC platform

24 DDR5 6400MT/s DIMMs

5U expanded cooling space

3200W CRPS power support

View Full Specs+

Core Configuration

Form Factor

5U dual-socket GPU server.

Processor

Supports two AMD 4th / 5th Gen EPYC processors, up to 192 cores per CPU and up to 500W TDP.

Memory

Supports up to 24 DDR5 6400MT/s DIMMs, 1DPC.

Chassis Size

219.5mm × 447mm × 892.6mm.

Storage & I/O

Front Storage

Supports up to 12 × 3.5-inch Anybay drives.

Front I/O

Supports DP and USB 3.0.

BMC

Supports RJ45-MNG, MicroUSB-UART, USB Type-A, and Mini-DP.

Power, Cooling & Management

Power

Supports 2700W / 3200W CRPS power supplies with 2+ redundancy, 220V AC, and 240V HVDC.

Cooling

Lower 4 × 056 fan modules and upper 4 × 056 fan modules, with N+1 redundancy and hot-swap support.

Operating Systems

Supports Ubuntu, CentOS, Red Hat, Rocky, Windows Server, and related environments.

Management

Supports web-based management UI, KVM over IP, IPMI, and Redfish.

4U GPU Servers

4U Servers

For multi-GPU inference, enterprise GPU pools, and mid-to-large inference nodes.

IFT5171G

IFT5171G4U

4U GPU Compute Server

A 4U GPU server for AI inference and multi-GPU compute. It supports high-power GPUs, PCIe 5.0 expansion, and redundant power for stable enterprise-scale inference deployments.

Best For

Multi-GPU inference / AI agent clusters / Model serving / GPU compute

4U GPU server form factor

Supports 4 dual-width GPUs

PCIe 5.0 expansion

Dual CRPS redundant power

View Full Specs+

Core Configuration

Form Factor

4U GPU server.

Processor

Single-node C2 module with dual-CPU architecture; up to 72 cores per CPU, 144 cores per system, and up to 500W TDP.

Memory

C2 module supports LPDDR5x, up to 960GB memory and approx. 1TB/s maximum bandwidth.

Dimensions

Approx. 88mm × 438mm × 900mm.

GPU, Storage & Expansion

GPU

Supports 4 dual-width FHFL GPUs, up to 400W TDP per GPU.

Front Storage

Supports 6 U.2 NVMe SSD slots.

Internal Storage

Two onboard M.2 interfaces.

PCIe

Supports up to 7 PCIe 5.0 x16 slots, including 4 DW slots and 3 SW slots.

I/O, Power & Cooling

BMC

Supports RJ45 BMC, USB 3.0, and mini-DP.

Power

Supports two CRPS power supplies, up to 3200W per PSU, with 1+1 redundancy.

Cooling

Supports 6 × 6056 fans with 5+1 redundancy.

IFT0073G

IFT0073G4U

4U AMD GPU Server

A 4U GPU server based on AMD EPYC, supporting multiple dual-width GPUs and high-density fan modules. Suitable for mid-to-large inference nodes, model serving, and enterprise GPU pools.

Best For

GPU inference / Model serving / Private GPU pools / Batch delivery

AMD EPYC Genoa / Turin platform

Supports up to 6 dual-width GPUs

12 DDR5 6400MT/s DIMMs

N+1 redundant cooling

View Full Specs+

Core Configuration

Form Factor

4U GPU server.

Processor

Supports one AMD EPYC 4th Gen Genoa / 5th Gen Turin processor, up to 192 cores and up to 500W TDP.

Memory

Supports up to 12 DDR5 6400MT/s DIMMs.

Dimensions

175.2mm × 447mm × 877mm.

GPU, Storage & Expansion

GPU

Supports up to 6 dual-width GPUs.

Front Storage

Supports up to 12 × 2.5-inch SAS / SATA / NVMe drives.

BMC

Pluggable design with onboard BIOS / BMC ROM and dual-Flash support.

I/O, Power & Cooling

Front I/O

Left ear supports VGA; right ear supports power button LED, UID button LED, system status LED, and USB 3.0.

Rear I/O

Supports power button LED, UID button LED, system status LED, Type-A USB 3.0, RJ45 management port, Mini-DP, and Mini USB.

Power

CRPS specification; supports AC / HVDC, 1300W / 1600W / 2000W / 2200W / 2700W, and N+1 redundancy.

Cooling

Lower 4 × 056 fan modules and upper 4 × 056 fan modules, with N+1 redundancy and hot-swap support.

Operating Environment

Temperature: 5°C–40°C.

IFT0073S

IFT0073S4U

4U Intel GPU Server

A 4U dual-socket Intel Xeon GPU expansion server with up to 24 front drive bays, rear storage expansion, and multiple PCIe GPU options. Designed for inference and compute workloads that need balanced CPU and GPU resources.

Best For

GPU inference / High-concurrency workloads / Multi-GPU compute / Data processing

Dual-socket Intel Xeon platform

32 DDR5 5600MT/s DIMMs

Up to 24 front 2.5-inch drive bays

RAID 0 / 1 / 10 / 5 / 50 / 60 support

View Full Specs+

Core Configuration

Form Factor

4U dual-socket GPU server.

Processor

Supports two 4th / 5th Gen Intel Xeon Scalable processors, up to 64 cores per CPU, HBM support, and up to 350W TDP.

Memory

Supports up to 32 DDR5 DIMMs, up to 5600MT/s.

Motherboard Size

380mm × 427mm.

Dimensions

175.2mm × 447mm × 877mm.

Storage & Expansion

Front Storage

Supports up to 24 × 2.5-inch SAS / SATA / NVMe drives.

Rear Storage

Rear side supports up to 12 × 2.5-inch SAS / SATA / NVMe drives, plus an additional 4 × 2.5-inch SAS / SATA / NVMe drives.

PCIe

Supports up to 6 rear PCIe SW slots and 1 OCP slot.

RAID

Supports RAID 0, 1, 10, 5, 50, 60, and related storage configurations.

I/O, Power & Management

I/O

Left ear supports power, UID, and system status indicators; right ear supports VGA, USB 3.0 Type-A, and USB 2.0 Type-A.

BMC

Supports RJ45-MNG, MicroUSB-UART, USB 2.0 Type-A, USB 3.0 Type-A, and Mini-DP.

Power

Supports AC / HVDC, 1300W / 1600W / 2000W / 2200W, and 3+1 redundancy.

Operating Systems

Supports Ubuntu, CentOS, Red Hat, Rocky, Windows Server, and related environments.

IFT5173G

IFT5173G4U

4U Dual-Socket AMD GPU Server

A 4U GPU server for dual-socket AMD EPYC platforms. It is suited for large-scale inference, private AI clusters, and high-density GPU resource deployment, with high-core-count CPUs, DDR5 memory, and redundant power and cooling.

Best For

LLM inference / Private AI clusters / GPU resource pools / Batch delivery

Dual-socket AMD EPYC platform

Up to 192 cores per CPU

24 DDR5 6400MT/s DIMMs

3200W CRPS power support

View Full Specs+

Core Configuration

Form Factor

4U dual-socket GPU server.

Processor

Supports two AMD 4th / 5th Gen EPYC processors, up to 192 cores per CPU and up to 500W TDP.

Memory

Supports up to 24 DDR5 6400MT/s DIMMs, 1DPC.

Chassis Size

175.2mm × 447mm × 892.6mm.

Storage & Expansion

Front Storage

Supports up to 12 × 3.5-inch Anybay drives.

Front I/O

Supports DP and USB 3.0.

BMC

Supports RJ45-MNG, MicroUSB-UART, USB Type-A, and Mini-DP.

Power, Cooling & Management

Power

Supports 2000W / 2700W / 3200W CRPS power supplies with 2+ redundancy, 220V AC, and 240V HVDC.

Cooling

Lower 4 × 056 fan modules and upper 4 × 056 fan modules, with N+1 redundancy and hot-swap support.

Operating Systems

Supports Ubuntu, CentOS, Red Hat, Rocky, Windows Server, and related environments.

Management

Supports web-based management UI, KVM over IP, IPMI, and Redfish.

Operating Environment

Operating temperature: 5°C–35°C; RH 8%–85%.

2U Servers

For general compute, storage expansion, virtualization, high-concurrency workloads, and lightweight AI inference.

IFT0011

IFT00112U

2U Single-Socket Server

A 2U single-socket server for enterprise compute, storage expansion, and lightweight inference. Designed for stable operation, standard rack deployment, and flexible drive configurations.

Best For

General compute / Storage nodes / Lightweight inference / Private enterprise deployment

Standard 2U rack design

DDR5 6400MT/s memory support

Multiple HDD / SSD cage options

IPMI / Redfish management

View Full Specs+

Core Configuration

Form Factor

2U single-socket rack server.

Processor

Supports one 6th Gen Intel Xeon Scalable processor, up to 288 cores per CPU and up to 550W TDP.

Memory

Supports up to 12 DDR5 6400MT/s DIMMs.

Chassis Size

839.5mm(D) × 447mm(W) × 87mm(H).

Storage & Expansion

Front Storage

Supports multiple HDD cage options, including 12LFF, 24SFF, and 25SFF.

Rear Storage

Supports GPU / PCIe slot expansion, with optional 3.5-inch HDD cages and 2.5-inch SSD cages.

PCIe Resources

Supports multiple MCIO and PCIe expansion slots.

I/O, Power & Management

System I/O

Right ear supports power button, UID button, system status LED, and USB 3.0; left ear supports VGA.

BMC Interface

Supports mini-DP, GbE, USB 3.0, and Mini USB.

Power

Supports 1300W / 1600W / 2000W / 3200W CRPS power supplies.

Management

Supports web-based management UI, KVM over IP, IPMI, and Redfish.

Operating Systems

Supports Ubuntu, CentOS, VMware, KVM, Docker, and related environments.

IFT5161

IFT51612U

2U AMD Compute Server

A 2U single-socket server based on the AMD EPYC platform for enterprise compute, storage expansion, virtualization, and basic inference workloads. It delivers strong CPU, memory, and PCIe expansion capacity within a standard 2U chassis.

Best For

Enterprise compute / Virtualization / Storage expansion / CPU inference

AMD EPYC Genoa / Turin platform

Up to 192 CPU cores

12 DDR5 RDIMMs

RAID and OCP expansion support

View Full Specs+

Core Configuration

Form Factor

2U single-socket rack server.

Processor

Supports one AMD EPYC 4th Gen Genoa / 5th Gen Turin processor, up to 192 cores and up to 500W TDP.

Memory

Supports 12 DDR5 RDIMMs, up to 6400MT/s.

Chassis Size

Approx. 87mm × 447mm × 850mm.

Storage & Expansion

Front Storage

Supports 12 × 3.5-inch SATA / NVMe hot-swap drives, or 2 × 2.5-inch NVMe hot-swap drives.

Internal Storage

Supports M.2 2280.

OCP Expansion

Supports two OCP 3.0 SFF slots for NIC expansion.

RAID

Supports RAID 0 / 1 / 10 / 5 / 50 / 60 through PCIe RAID cards.

I/O, Power & Management

Front I/O

Supports Type-A USB 3.0, USB 2.0, and VGA.

Rear I/O

Supports Type-A USB 3.0, Type-A USB 2.0, Mini-DP, RJ45 management port, and Micro USB UART.

Power

Supports two CRPS power supplies with 1+1 redundancy; optional 800W / 1300W / 1600W / 2000W / 2700W.

Cooling

Supports 4 × 8056 fans with N+1 redundancy.

Management

Supports web-based management UI, KVM over IP, IPMI, and Redfish.

IFT0063

IFT00632U

2U Dual-Socket Intel Server

A 2U dual-socket server for Intel Xeon Scalable platforms. It provides more memory slots, a flexible rear I/O zone, and multiple PCIe 5.0 expansion options for enterprise compute and high-expansion workloads.

Best For

Enterprise compute / High-speed I/O / Virtualization / High-performance storage nodes

Dual-socket Intel Xeon platform

32 DDR5 RDIMMs

PCIe 5.0 expansion

Intel VROC RAID support

View Full Specs+

Core Configuration

Form Factor

2U dual-socket rack server.

Processor

Supports two 4th / 5th Gen Intel Xeon Scalable processors, up to 350W TDP per CPU.

Memory

Supports 32 DDR5 RDIMMs, up to 5600MT/s, 1DPC.

Storage & Expansion

Front Storage

Optional 12 × 3.5-inch SAS / SATA / NVMe hot-swap drives, or 24 × 2.5-inch SAS / SATA / NVMe hot-swap drives.

Rear I/O Zone

Supports configurations including PCIe 5.0 x16, 3.5-inch hot-swap storage, 2.5-inch hot-swap storage, PCIe 5.0 x8 LP, and PCIe 5.0 x16 FHFL.

OCP

Rear OCP zone supports two OCP 3.0 SFF expansion cards with multi-host support.

RAID

Supports Intel VROC and PCIe RAID cards, including RAID 0 / 1 / 10 / 5 / 50 / 60.

I/O, Power & Management

USB Ports

Right ear supports USB 3.0 and USB 2.0; rear BMC supports USB 3.0, USB 2.0, and Micro USB.

Network & Display

Supports a dedicated 100Mbps management port, front VGA, and rear BMC Mini-DP.

Power

Supports 800W / 1300W / 1600W / 2000W / 2700W CRPS power supplies, with 220V AC and 240V HVDC input.

Operating Systems

Supports Ubuntu, CentOS, Red Hat, Rocky, Windows Server, and related environments.

Management

Supports web-based management UI, KVM over IP, IPMI, and Redfish.

IFT5163

IFT51632U

2U Dual-Socket AMD Server

A 2U high-performance server for dual-socket AMD EPYC platforms. It is built for high-concurrency inference, enterprise compute, and storage-intensive workloads, with dual CPUs, 24 DDR5 DIMMs, and flexible front / rear storage.

Best For

High-concurrency inference / Enterprise compute / Data processing / Storage expansion

AMD EPYC 9004 / 9005 platform

Dual-socket CPU architecture

24 DDR5 6400MT/s DIMMs

3200W CRPS power support

View Full Specs+

Core Configuration

Form Factor

2U dual-socket rack server.

Processor

Supports two AMD EPYC 9004 / 9005 Series processors, up to 192 cores per CPU and up to 400W TDP.

Memory

Supports up to 24 DDR5 6400MT/s DIMMs, 1DPC.

Chassis Size

19-inch rackmount chassis, approx. 87.3mm × 438mm × 799mm.

Storage & Expansion

Internal Storage

Supports one M.2 SATA / NVMe device.

Front Storage

Supports up to 12 × 3.5-inch Anybay drives, or 24 × 2.5-inch NVMe SSDs.

Rear Storage

Supports up to 4 × 2.5-inch Anybay drives.

I/O, Power & Management

Front I/O

Right ear supports DP and USB 3.0 / USB 2.0; left ear supports power, UID, system status, and OCP indicators.

BMC

Supports power, UART, USB 3.0, RJ45-MNG, Mini-DP, UID, and system status interfaces.

Power

Supports 2000W / 2700W / 3200W CRPS power supplies with 1+1 redundancy, 220V AC, and 240V HVDC.

Cooling

Supports 4 × 6056 fan modules with N+1 redundancy.

Operating Systems

Supports Ubuntu, CentOS, Red Hat, Rocky, Windows Server, and related environments.

IFT0414

IFT04142U

2U High-Density AMD Server

A 2U high-density server for dual-socket AMD EPYC platforms. It is suited for enterprise compute, data processing, storage expansion, and high-concurrency inference, with dual CPUs, 24 DDR5 DIMMs, and flexible Anybay / NVMe storage.

Best For

High-concurrency inference / Enterprise compute / Data processing / High-density storage

AMD EPYC 9004 / 9005 platform

2U dual-socket architecture

24 DDR5 6400MT/s DIMMs

2000W / 2700W / 3200W CRPS power support

View Full Specs+

Core Configuration

Platform Model

Reech MGX R723.

Form Factor

2U dual-socket rack server.

Processor

Supports two AMD 9004 / 9005 Series EPYC processors, up to 192 cores per CPU and up to 400W TDP.

Memory

Supports up to 24 DDR5 6400MT/s DIMMs, 1DPC.

Chassis Size

19-inch rackmount chassis, approx. 87.3mm × 438mm × 799mm.

Storage & Expansion

Internal Storage

Supports one M.2 SATA / NVMe device.

Front Storage

Supports up to 12 × 3.5-inch Anybay drives, or 24 × 2.5-inch NVMe SSDs.

Rear Storage

Supports up to 4 × 2.5-inch Anybay drives.

I/O, Power & Management

Right Ear

Supports one DP and two USB 3.0 / USB 2.0 ports.

Left Ear

Supports PWR button LED, UID button LED, system status LED, and OCP indicator.

BMC

Supports PWR button LED, UART, USB 3.0, RJ45-MNG, Mini-DP, UID button LED, and system status LED.

Power

Supports 2000W / 2700W / 3200W CRPS power supplies with 1+1 redundancy, optional titanium-grade PSU, 220V AC, and 240V HVDC.

Cooling

4 × 6056 fan modules with N+1 redundancy and hot-swap support.

Management

Supports web-based management UI, KVM over IP, IPMI, and Redfish.

Server Infrastructure

From Server Selection to Batch Delivery

IFT supports server selection, configuration planning, system testing, and batch delivery for real AI infrastructure deployment needs. Customers can choose the right model based on GPU count, CPU platform, cooling method, power conditions, and rack density.

6U / 5U / 4U / 2UIntel / AMD PlatformsGPU Inference ServersAir / Liquid CoolingIPMI / Redfish ManagementBatch Delivery Support

Talk to Us →

AI Inference InfrastructureProduct Matrix

Token Factory Systems

Standardized

End-to-End

Fast Scale-Up

Low TCO

Enterprise Edge Inference

32-Node Token Factory

Scaled Inference Platform

Low-TCO Site Conversion

Containerized Token Factory

Servers

6U Servers

6U Intel AI Server

Core Configuration

Storage & I/O

Power, Cooling & Management

6U Intel NVLink GPU Server

Core Configuration

GPU, Interconnect & Storage

Power, Cooling & Management

6U AMD AI Server

Core Configuration

Storage & I/O

Power, Cooling & Management

6U Liquid-Cooled AI Server

Core Configuration

Storage & I/O

Power, Cooling & Management

5U Servers

5U Intel GPU Server

Core Configuration

GPU & Storage

I/O, Power & Cooling

5U AMD GPU Server

Core Configuration

Storage & I/O

Power, Cooling & Management

4U Servers

4U GPU Compute Server

Core Configuration

GPU, Storage & Expansion

I/O, Power & Cooling

4U AMD GPU Server

Core Configuration

GPU, Storage & Expansion

I/O, Power & Cooling

4U Intel GPU Server

Core Configuration

Storage & Expansion

I/O, Power & Management

4U Dual-Socket AMD GPU Server

Core Configuration

Storage & Expansion

Power, Cooling & Management

2U Servers

2U Single-Socket Server

Core Configuration

Storage & Expansion

I/O, Power & Management

2U AMD Compute Server

Core Configuration

Storage & Expansion

I/O, Power & Management

2U Dual-Socket Intel Server

Core Configuration

Storage & Expansion

I/O, Power & Management

2U Dual-Socket AMD Server

Core Configuration

Storage & Expansion

I/O, Power & Management

2U High-Density AMD Server

Core Configuration

Storage & Expansion

I/O, Power & Management

From Server Selection to Batch Delivery

AI Inference Infrastructure
Product Matrix