Big Data

346 articles found

AllenAI's olmOCR Converts PDFs and Images to Markdown for Under $200 Per Million Pages

AllenAI's olmOCR Converts PDFs and Images to Markdown for Under $200 Per Million Pages

Jul 02, 2026
GitHub

AllenAI's open-source olmOCR toolkit converts PDFs and images into clean Markdown text — including equations, tables, and handwriting — for under $200 per million pages, with its latest v0.4.0 release scoring 82.4 on a 7,000+ test benchmark, rivaling top OCR tools while supporting GPU inference, Docker, and multi-node cloud processing.

Fluree DB v4.1.0 Launches With 2M+ Facts Per Second Import Speed and 10x Performance Edge Over Competitors

Fluree DB v4.1.0 Launches With 2M+ Facts Per Second Import Speed and 10x Performance Edge Over Competitors

Jun 24, 2026
GitHub

Fluree DB v4.1.0 launches with blazing 2M+ facts per second import speeds and a 10.4x performance edge over competitors, completing all 850 WGPB queries on a 21.5-billion-triple Wikidata dataset with a 43ms geometric mean, while introducing AI memory integration for tools like Claude Code and Cursor.

NVIDIA and AWS Supercharge Cloud AI with New Blackwell GPUs, 10x Faster Vector Search, and Elite Training Certification

NVIDIA and AWS Supercharge Cloud AI with New Blackwell GPUs, 10x Faster Vector Search, and Elite Training Certification

Jun 24, 2026
NVIDIA Blog

NVIDIA and AWS are rolling out a powerful trio of AI advancements: new EC2 G7 instances with Blackwell GPUs delivering 4.6x faster inference, a 10x faster vector search engine now default in Amazon OpenSearch, and AWS earning elite NVIDIA Exemplar Cloud status for large-scale AI training workloads.

Page 1 of 35
Next
Showing 1 - 10 of 346 articles