New Document AI Course: OCR to Agentic Document Extraction - AI Dynamics

Skip to content

AI Dynamics

Global AI News Aggregator

Rechercher

New Document AI Course: OCR to Agentic Document Extraction

By

–

14 January 2026 18h42

New course: Document AI: From OCR to Agentic Doc Extraction, built with @LandingAI, where I'm executive chairman, and taught by David Park and Andrea Kropp.

Much of the world's data is locked in PDFs, JPEGs, and other documents. This short course shows you how to build agentic… pic.twitter.com/dG9SwmFgKq
— Andrew Ng (@AndrewYNg) 14 janvier 2026

New course: Document AI: From OCR to Agentic Doc Extraction, built with @LandingAI, where I'm executive chairman, and taught by David Park and Andrea Kropp. Much of the world's data is locked in PDFs, JPEGs, and other documents. This short course shows you how to build agentic workflows that process documents accurately: breaking them into parts, examining each piece carefully, and extracting information through multiple iterations. Traditional Optical Character Recognition (OCR) captures text but loses context from table headers, chart captions, or reading order of columns. After exploring OCR's limitations, you’ll use LandingAI's Agentic Document Extraction (ADE) framework to process documents. ADE treats pages as visually — as images — to parse information and extract fields. Skills you'll gain: – Build agents to convert unstructured files into structured Markdown/HTML and JSON – Use ADE to parse complex data like forms, handwriting, or equations – Map extracted information to named fields using a specified schema, with bounding boxes for grounding and validation – Deploy RAG applications with event-driven document processing Come learn about the best tools for processing documents like financial invoices, medical records, or academic papers intelligently: deeplearning.ai/short-course…

→ View original post on X — @andrewyng, 2026-01-14 17:42 UTC

14 January 2026

AGENTS AI AUTOMATION CODE DATA EDUCATION ENTERPRISE AI GENERATIVE AI MACHINE LEARNING MULTIMODAL AI TOOLS

←AI-Powered Content Strategy Audits and Implementation Roadmap

SEO Rankings vs AI Visibility: A Growing Gap for Teams→

MORE ARTICLES

Using AI Agents for Code Orchestration and Workflows

30 May 2026
AI Agent Skills for Video Search and Summarization

30 May 2026
Omni Model Creative Applications: Video Translation and Consistency

29 May 2026
Testing Opus 4.8 Model Performance in Different Harnesses

29 May 2026

INNOVATION GENERATIVE AI RESEARCH LLMS TOOLS MACHINE LEARNING CODE MARKET TRENDS BUSINESS BIG TECH TECHNOLOGY ETHICS ENTERPRISE AI APPS SOFTWARE DATA COMPUTING AGENTS AUTOMATION POLICY OPEN SOURCE CULTURE REGULATION ECONOMY MULTIMODAL AI SOCIETY INVESTMENT CREATIVE AI EDUCATION AI HARDWARE SAFETY HARDWARE JOBS AGI PROMPT ENGINEERING STARTUPS INDUSTRY ROBOTICS WORKFORCE SECURITY CYBERSECURITY HEALTHCARE AI SYSTEMS SUSTAINABILITY WEB3 DECENTRALIZED AI

AI Dynamics

Global AI News Aggregator

About
Archives

Rechercher