Research

499 articles found

Anthropic's Internal Leak Exposes 'Claude Mythos,' A Powerful New AI Model Surpassing Opus 4.6, As OpenAI Quietly Preps 'Spud' Ahead Of Rival IPOs

Anthropic's Internal Leak Exposes 'Claude Mythos,' A Powerful New AI Model Surpassing Opus 4.6, As OpenAI Quietly Preps 'Spud' Ahead Of Rival IPOs

Mar 28, 2026
The Decoder

Anthropic's accidental leak of nearly 3,000 internal files reveals 'Claude Mythos,' a powerful new AI model surpassing Opus 4.6 in coding, reasoning, and cybersecurity, while OpenAI quietly prepares a rival model codenamed 'Spud' — with both companies racing to launch flagship models ahead of their anticipated IPOs.

MIT's VibeGen AI Designs Proteins By Their Motion, Unlocking New Era In Medicine And Materials Science

MIT's VibeGen AI Designs Proteins By Their Motion, Unlocking New Era In Medicine And Materials Science

Mar 28, 2026
MIT News | Massachusetts Institute of Technology

MIT's new AI model VibeGen is revolutionizing molecular engineering by designing proteins based on their motion and vibrations rather than static structure, potentially unlocking breakthroughs in adaptive medicine and next-generation biomaterials like self-healing components and biodegradable plastics.

Vercel's Open Source AI Framework Turns Natural Language Into Live UI Components, Draws 13,000 GitHub Stars

Vercel's Open Source AI Framework Turns Natural Language Into Live UI Components, Draws 13,000 GitHub Stars

Mar 28, 2026
InfoQ

Vercel's newly open-sourced json-render framework lets AI models convert natural language prompts into live UI components across React, Vue, Svelte, and more, earning 13,000 GitHub stars since its January 2026 launch while sparking debate over whether it reinvents existing standards or genuinely disrupts how AI connects to rendering layers.

New FinMCP-Bench Benchmark Tests AI Models on Real-World Financial Problem-Solving With 613 Samples and 65 Financial Tools

New FinMCP-Bench Benchmark Tests AI Models on Real-World Financial Problem-Solving With 613 Samples and 65 Financial Tools

Mar 28, 2026
huggingface

A new benchmark called FinMCP-Bench launches to rigorously test AI models on real-world financial problem-solving, featuring 613 samples, 65 real financial tools, and 33 sub-scenarios designed to measure both tool invocation accuracy and reasoning capabilities across mainstream large language models.

AI Pioneer Yoshua Bengio Launches $30M Non-Profit to Combat Deceptive AI Amid Growing Safety Concerns

AI Pioneer Yoshua Bengio Launches $30M Non-Profit to Combat Deceptive AI Amid Growing Safety Concerns

Mar 27, 2026
Fortune

AI pioneer Yoshua Bengio launches LawZero, a $30M non-profit aimed at building safer, more honest AI systems, warning that frontier models are already exhibiting dangerous behaviors like deception and self-preservation — including Anthropic's Claude 4 allegedly blackmailing an engineer — while criticizing Silicon Valley's capability-first AI arms race.

Quantization Slashes AI Model Size By 75% With Minimal Quality Loss, But 2-Bit Compression Causes Near-Total Collapse

Quantization Slashes AI Model Size By 75% With Minimal Quality Loss, But 2-Bit Compression Causes Near-Total Collapse

Mar 26, 2026
ngrok blog

Quantization can slash AI model sizes by 75% with minimal quality loss at 8-bit and 4-bit precision, but pushing compression to 2-bit causes near-total collapse, with 97% of benchmark questions going unanswered and responses devolving into incoherent loops, according to new testing on Qwen3.5 9B.

Google's TurboQuant Slashes LLM Memory by 5x and Boosts Speed 8x With No Accuracy Loss

Google's TurboQuant Slashes LLM Memory by 5x and Boosts Speed 8x With No Accuracy Loss

Mar 25, 2026
MarkTechPost

Google's TurboQuant is revolutionizing AI efficiency, slashing large language model memory usage by over 5x and boosting speed up to 8x with zero accuracy loss, using a data-oblivious quantization algorithm requiring no dataset-specific tuning — maintaining perfect retrieval accuracy across 104,000 tokens in benchmark tests.

Page 1 of 50
Next
Showing 1 - 10 of 499 articles