Trust

184 articles found

Base LLMs Show Strong Semantic Confidence Accuracy, But Fine-Tuning and Chain-of-Thought Reasoning Destroy It

Base LLMs Show Strong Semantic Confidence Accuracy, But Fine-Tuning and Chain-of-Thought Reasoning Destroy It

Mar 25, 2026
Apple Machine Learning Research

New research reveals that base large language models possess strong semantic confidence accuracy, but popular techniques like fine-tuning and chain-of-thought reasoning actively destroy this calibration, raising urgent questions about the reliability of widely deployed AI systems.

Y Combinator-Backed Compliance Startup Delve Faces Whistleblower Fraud Allegations, Security Vulnerabilities Amid $300M Valuation

Y Combinator-Backed Compliance Startup Delve Faces Whistleblower Fraud Allegations, Security Vulnerabilities Amid $300M Valuation

Mar 22, 2026
TechCrunch

Y Combinator-backed compliance startup Delve, valued at $300 million, is under fire from an anonymous whistleblower alleging it fabricates compliance evidence, potentially exposing hundreds of clients to criminal HIPAA liability and GDPR fines, while a security researcher simultaneously uncovers critical vulnerabilities in its systems.

NHS AI Mammography System Detects 25% More Missed Cancers While Cutting Radiologist Workload by Up to 44%

NHS AI Mammography System Detects 25% More Missed Cancers While Cutting Radiologist Workload by Up to 44%

Mar 18, 2026
research

A groundbreaking NHS AI mammography system detects 25% more missed cancers while cutting radiologist workload by up to 44%, offering a powerful solution to the UK's radiologist shortage, though human panels incorrectly overruled the AI on 93 cancer cases, highlighting urgent need for improved human-AI collaboration.

Previous
Page 6 of 19
Next
Showing 51 - 60 of 184 articles