TCPIP-1G/10G
1G/10G TCP/IP Hardware Stack

The TCPIP-1G/10G core implements a complete TCP/IP Hardware Protocol Stack. More capable than many offloading engines, it allows systems to connect to an Internet Protocol (IP) network and exchange data using the TCP protocol without requiring assistance from — or even the presence of — a system processor.

The core acts either as a server or a client and, without assistance from the host system, it autonomously opens, maintains, and closes TCP connections. The system integrating the TCPIP-1G/10G core can configure network parameters and preferences by accessing its control registers, and the core is then able to receive and send data via streaming data interfaces. 

The highly configurable core can adapt to different applications and diverse system requirements. The maximum number of simultaneous TCP sessions is configurable at synthesis time; it can be as high as 32,768 for devices like data servers, or as small as 1 for edge devices requiring minimum silicon area and power. Further user options include implementing a DHCP client that allows the core to automatically be assigned an IP address, enabling or disabling support of the reassembly of out-of-order TCP packets data, and integrating a UDP hardware stack with multicast support (IGMPv3). Finally, users can choose the packet processing mode, either cut-through or store-and-forward. In cut-through mode, the payload data are delivered to the host system as they arrive without any internal packet buffering and before the packets’ integrity can be validated. As a result, the core operating in cut-through mode features extremely low latency and requires less memory, but it cannot reassemble out-of-order packets and it may deliver data that will subsequently be marked as corrupted. Under the store-and-forward mode of operation, the core will always deliver verified, in-order packets, but will have higher latency and require more memory resources.

The TCPIP-1G/10G core is rigorously verified and available in RTL source or as a targeted FPGA netlist. Its deliverables include a testbench, synthesis, and simulation scripts, and comprehensive user documentation.

The TCPIP core receives and transmits TCP packet data and forwards other traffic from the Ethernet MAC to the application and vice versa. It also receives and transmits ARP requests and responses, and responds to ICMP echo reply messages. Optionally the core can also support the UDP and DHCP protocols.

The core consists of the following modules:

  • The Ethernet module sends/receives Ethernet frames to/from an external Ethernet MAC, detects the frame type and multiplexes frames between the protocol stack subsystems.
  • The VLAN Tx/Rx module receives Ethernet frames from an external Ethernet MAC, and when enabled detects and compares VLAN tag ; filters frames to the correct VLAN tag. In the transmission direction, the module receives Ethernet frames from the Ether-net Frame module and adds the VLAN Tag to the frames when enabled.
  • The ARP module sends and receives ARP packets and handles the packets according to command in the packet. It provides the IP-to-MAC address translation services to the IPv4 layers in the protocol stacks.
  • The IPv4 module processes the received frames and distributes decoded frames to the related protocol stack modules. Frames from the upper protocol layers are encapsulated into the IPv4 frames and passed to the Ethernet module.
  • The TCP module provides the primary functionality of the TCPIP-1G/10G core, i.e. the TCP stack. The TCP stack is fully implemented in hardware. The transmit and receive buffers are located in an external memory accessed via the AXI bus. 
  • The ICMP module supports the echo request response (ping).
  • The UDP module is responsible for processing UDP packets for the DHCP module. The module can also provide user UDP channels to the user's design.
  • The IGMP module adds the IGMP protocol stack needed for UDP multicast support. The module is optional and requires user UDP channels feature enabled.
  • The DHCP module automatically requests and acquires an IP address from a DCHP server. The DHCP feature is optional and requires the UDP module.
  •  The Control and Status Registers control the core functionality and report its status.

The TCPIP-1G/10G can be mapped in any AMD FPGA device, provided sufficient resources are available. The FPGA resources required for the implementation of the core depend on its configuration. The following table provides sample implementation data for a few out of the many possible configurations of the core.

Configuration FPGA Resources
Store &
Forward
Max
Sessions
Max
Server Sessions
Max
Ports
Max
Rules
DHCP
Client
Logic
(LUTs)
Memory
(RAMBs)
No 4 2 4 4 Off 19,082 32
No 4 2 4 4 Off 20,589 32
Yes 4 2 4 4 Off 22,422 38
Yes 64 32 64 32 Off 23,586 41
Yes 256 128 128 64 Off 26,003 49
Yes 1,024 512 256 128 Off 34,325 109

Table 1. FPGA resources requirements for the TCPIP-1G/10G core with one UDP channel per direction, no OoO packet reordering, and clocked at 156 MHz on an ARTIX™ Ultrasclale+ device

The above table does not list all configuration options. Please contact CAST to get characterization data for your configuration and target device.

The TCPIP-1G/10G can be mapped in any Intel FPGA device, provided sufficient resources are available. The FPGA resources required for the implementation of the core depend on its configuration. The following table provides sample implementation data for a few out of the many possible configurations of the core.

Configuration FPGA Resources
Store &
Forward
Max
Sessions
Max
Server Sessions
Max
Ports
Max
Rules
DHCP
Client
Logic
(ALMs)
Memory
(M20Ks)
No 4 2 4 4 Off 16,034 98
No 4 2 4 4 Off 17,112 98
Yes 4 2 4 4 Off 18,969 118
Yes 64 32 64 32 Off 19,348 128
Yes 256 128 128 64 Off 21,325 152
Yes 1,024 512 256 128 Off 27,984 291

Table 1. FPGA resources requirements for the TCPIP-1G/10G core with one UDP channel per direction, no OoO packet reordering, and clocked at 156 MHz on an Agilex device.

The above table does not list all configuration options. Please contact CAST to get characterization data for your configuration and target device.

The TCPIP-1G/10G can be mapped in any Lattice FPGA device, provided sufficient resources are available. The FPGA resources required for the implementation of the core depend on its configuration. The following table provides sample implementation data for a few out of the many possible configurations of the core.

Configuration FPGA Resources
Store &
Forward
Max
Sessions
Max
Server Sessions
Max
Ports
Max
Rules
DHCP
Client
Logic
(LUT4s)
Memory
(RAMBs)
No 4 2 2 2 Off 35,871 82
Yes 4 3 2 2 Off 41,719 105
No 64 32 32 32 Off 43,489 96
Yes 64 32 32 32 Off 50,068 120

Table 1. FPGA resources requirements for the TCPIP-1G/10G core with one UDP channel per direction, no OoO packet reordering, and clocked at 62.5 MHz on a CertusPro NX device.

The above table does not list all configuration options. Please contact CAST to get characterization data for your configuration and target device.

Related Content

Features List

Complete TCP/IP Hardware Stack

  • 10/100/1000, 2.5G, and 10G Ethernet Transmit and Receive
  • TCP server or client with up to 32k simultaneous TCP sessions
  • Autonomous and highly efficient TCP connection establishment, maintenance and teardown, retransmission, flow and congestion control
  • Optional out-of-order TCP packet reassembly
  • Checksums generation and validation
  • IPv4 support
  • Jumbo and Super Jumbo Frames
  • ARP with Cache, ICMP (Ping Reply), VLAN (IEEE 802.1Q),
  • UDP, IGMP, and DHCP (optional)

Trouble-Free Operation

  • Highly flexible thanks to runtime programmable options, including:
    • Local MAC address, IP address, Gateway IP address, and IP subnet mask
    • IP Address, & Port Filters
    • Retransmission timeouts, limits, and enable/disable of fast retransmission
    • Flow control and congestion control
    • Window sizes, MSS
  • Non-TCP/UDP IP traffic is optionally forwarded to the system via a dedicated port

Easy SoC Integration

  • Standardized AMBA interfaces:
    • 64-bit AXI4-Stream for packet data
    • 32-bit AXI4-Lite for CSR (APB3 optional)
    • 256-bit AXI4 for external memory
  • Separate clock domains for packet processing and CSR interfaces
  • Independent from external memory type and controller, and from the ethernet MAC or PHY.
  • Optionally pre-integrated with Intel, Xilinx, or other third-party eMAC core

Companion Cores from CAST

Versions & Configuration Options

  • Maximum number of simultaneous TCP sessions (up to 32K)
  • Cut-through or store & forward processing

Deliverables

  • Synthesizable RTL or FPGA netlist
  • UVM Testbench & sample test cases
  • Simulation & synthesis scripts
  • Documentation

 

Resources

Software Tools

  • Wireshark - Open-source packet-analyzer
  • Netcat  - Utility for sending and receiving messages using TCP or UDP
  • lwIP - A Lightweight TCP/IP stack

Related IP Protocols

 

Let's talk about your project and our IP solutions

Request Info