AI Dynamics

Global AI News Aggregator

New Document AI Course: OCR to Agentic Document Extraction

New course: Document AI: From OCR to Agentic Doc Extraction, built with @LandingAI, where I'm executive chairman, and taught by David Park and Andrea Kropp. Much of the world's data is locked in PDFs, JPEGs, and other documents. This short course shows you how to build agentic workflows that process documents accurately: breaking them into parts, examining each piece carefully, and extracting information through multiple iterations. Traditional Optical Character Recognition (OCR) captures text but loses context from table headers, chart captions, or reading order of columns. After exploring OCR's limitations, you’ll use LandingAI's Agentic Document Extraction (ADE) framework to process documents. ADE treats pages as visually — as images — to parse information and extract fields. Skills you'll gain: – Build agents to convert unstructured files into structured Markdown/HTML and JSON – Use ADE to parse complex data like forms, handwriting, or equations – Map extracted information to named fields using a specified schema, with bounding boxes for grounding and validation – Deploy RAG applications with event-driven document processing Come learn about the best tools for processing documents like financial invoices, medical records, or academic papers intelligently: deeplearning.ai/short-course…

→ View original post on X — @andrewyng, 2026-01-14 17:42 UTC

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *