New FinMCP-Bench Benchmark Tests AI Models on Real-World Financial Problem-Solving With 613 Samples and 65 Financial Tools
A new benchmark called FinMCP-Bench launches to rigorously test AI models on real-world financial problem-solving, featuring 613 samples, 65 real financial tools, and 33 sub-scenarios designed to measure both tool invocation accuracy and reasoning capabilities across mainstream large language models.