Machine Learning

723 articles found

New 'Bayesian Teaching' Framework Trains LLMs To Reason Probabilistically With 80% Accuracy, Transfers Across Domains

New 'Bayesian Teaching' Framework Trains LLMs To Reason Probabilistically With 80% Accuracy, Transfers Across Domains

Mar 06, 2026
research

A groundbreaking 'Bayesian Teaching' framework is training LLMs to reason probabilistically with 80% accuracy by fine-tuning them on interactions with an optimal Bayesian model, and the learned skills are successfully transferring across entirely different domains like hotels and web shopping.

Microsoft's New 15B AI Model Matches Larger Rivals Using One-Fifth the Training Data

Microsoft's New 15B AI Model Matches Larger Rivals Using One-Fifth the Training Data

Mar 05, 2026
Venturebeat

Microsoft's new 15-billion-parameter AI model, Phi-4-reasoning-vision-15B, matches and even outperforms much larger rivals while using only one-fifth the training data, thanks to a 'mixed reasoning' design that intelligently switches between deep analytical thinking and fast direct responses depending on task complexity.

Apple Unveils AI System That Pinpoints Exact Words Where Models Hallucinate

Apple Unveils AI System That Pinpoints Exact Words Where Models Hallucinate

Mar 05, 2026
The Deep View

Apple unveils groundbreaking AI research that pinpoints the exact words where AI models hallucinate, transforming detection from a simple yes-or-no judgment into a precise, multi-step process that outperforms conventional methods — a critical breakthrough as the tech giant faces mounting pressure to ensure accuracy for its 2.5 billion devices worldwide.

New RL4HS Framework Outperforms Existing Models in Detecting Hallucinated Spans in AI-Generated Text

New RL4HS Framework Outperforms Existing Models in Detecting Hallucinated Spans in AI-Generated Text

Mar 05, 2026
Apple Machine Learning Research

A new reinforcement learning framework called RL4HS is outperforming existing AI models in detecting hallucinated spans in large language model outputs, using Group Relative Policy Optimization and a novel Class-Aware Policy Optimization technique to deliver superior results across summarization, question answering, and data-to-text tasks.

Previous
Page 10 of 73
Next
Showing 91 - 100 of 723 articles