Machine Learning

625 articles found

New 'Bayesian Teaching' Framework Trains LLMs To Reason Probabilistically With 80% Accuracy, Transfers Across Domains

New 'Bayesian Teaching' Framework Trains LLMs To Reason Probabilistically With 80% Accuracy, Transfers Across Domains

Mar 06, 2026
research

A groundbreaking 'Bayesian Teaching' framework is training LLMs to reason probabilistically with 80% accuracy by fine-tuning them on interactions with an optimal Bayesian model, and the learned skills are successfully transferring across entirely different domains like hotels and web shopping.

Microsoft's New 15B AI Model Matches Larger Rivals Using One-Fifth the Training Data

Microsoft's New 15B AI Model Matches Larger Rivals Using One-Fifth the Training Data

Mar 05, 2026
Venturebeat

Microsoft's new 15-billion-parameter AI model, Phi-4-reasoning-vision-15B, matches and even outperforms much larger rivals while using only one-fifth the training data, thanks to a 'mixed reasoning' design that intelligently switches between deep analytical thinking and fast direct responses depending on task complexity.

Apple Unveils AI System That Pinpoints Exact Words Where Models Hallucinate

Apple Unveils AI System That Pinpoints Exact Words Where Models Hallucinate

Mar 05, 2026
The Deep View

Apple unveils groundbreaking AI research that pinpoints the exact words where AI models hallucinate, transforming detection from a simple yes-or-no judgment into a precise, multi-step process that outperforms conventional methods — a critical breakthrough as the tech giant faces mounting pressure to ensure accuracy for its 2.5 billion devices worldwide.

New RL4HS Framework Outperforms Existing Models in Detecting Hallucinated Spans in AI-Generated Text

New RL4HS Framework Outperforms Existing Models in Detecting Hallucinated Spans in AI-Generated Text

Mar 05, 2026
Apple Machine Learning Research

A new reinforcement learning framework called RL4HS is outperforming existing AI models in detecting hallucinated spans in large language model outputs, using Group Relative Policy Optimization and a novel Class-Aware Policy Optimization technique to deliver superior results across summarization, question answering, and data-to-text tasks.

Cursor's Support Team Uses Its Own AI Tool to Handle 75% of Interactions, Achieving Up to 10x Engineer Productivity

Cursor's Support Team Uses Its Own AI Tool to Handle 75% of Interactions, Achieving Up to 10x Engineer Productivity

Mar 04, 2026
Cursor

Cursor's support team is using its own AI tool to handle over 75% of interactions, achieving up to 10x engineer productivity by unifying code, logs, Slack threads, and real-time database context into a single session powered by MCP servers, slash commands, and parallel-running subagents.

OpenAI Codex Success Rates Soar to 90% as Companies Integrate AI Coding Tool Into Daily Production Workflows

OpenAI Codex Success Rates Soar to 90% as Companies Integrate AI Coding Tool Into Daily Production Workflows

Mar 04, 2026
Zachary Proser

OpenAI Codex is transforming software development as task success rates skyrocket from 40-60% to 85-90%, with companies like WorkOS now relying on it as core production infrastructure for daily maintenance, CRUD operations, and API endpoints — freeing developers to tackle complex architectural challenges.

Page 1 of 63
Next
Showing 1 - 10 of 625 articles