ainewsblitz.com

Breaking

Ai2 Releases olmOCR 2, Turning PDFs Into Markdown at 82.4 on Bench

  • Open Source
  • Foundation Models
  • AI Agents

The Allen Institute for Artificial Intelligence (Ai2) has released an open-source OCR toolkit, olmOCR, that converts PDFs and scanned images into structure-preserving Markdown, and its latest version, olmOCR 2, scored 82.4 on the project's own benchmark after reinforcement-learning tuning. The aim is to make the vast knowledge locked in PDFs—papers, contracts, financial reports, historical archives—readable by LLMs and RAG systems.

Continue reading

The rest of this article is for AI News Blitz readers. Choose an option below to keep reading.

$20
Read this article
$29/month
Unlimited — all 2,668 articles, the full archive, and comprehension quizzes
Save 72%
$98/year
≈ $8.17/month
Unlimited, billed once a year