FPGA-Powered AI Runs Karpathy's microGPT at 50,000 Tokens Per Second on DE1-SoC Hardware
TALOS-V2 brings AI inference to FPGA hardware, running Karpathy's microGPT at a blazing 50,000 tokens per second on a DE1-SoC Cyclone V board using fixed-point arithmetic and SystemVerilog RTL, complete with real-time controls and token output on onboard displays.