Gemini AI OCR
Google Gemini 1.5 Flash Vision extracts Sinhala text from scanned PDF pages with high accuracy.
An AI-powered proof of concept that converts Sri Lanka Ministry of Education Sinhala PDF textbooks into WCAG 2.1 AA compliant digital textbooks — with TTS, keyboard navigation, and EPUB3 output.
UNICEF's Accessible Digital Textbooks (ADT) initiative ensures children with disabilities have equal access to quality education in their native language. In Sri Lanka, over 80,000 children have visual or hearing impairments with severely limited access to accessible Sinhala educational materials.
This proof of concept demonstrates how AI — specifically Google's Gemini 1.5 Flash for OCR and MiniMax M2.5 for text structuring — can automate the conversion of existing Ministry of Education PDF textbooks into fully accessible WCAG 2.1 AA compliant digital formats at scale.
Learn about UNICEF ADTThree steps from any Ministry of Education Sinhala PDF to a fully accessible digital textbook
Upload any Sri Lanka Ministry of Education Sinhala textbook — text-based or scanned. Supports grades 1–13.
Gemini 1.5 Flash OCR extracts Sinhala text from scanned pages. MiniMax M2.5 structures chapters and generates accessible summaries.
Download WCAG 2.1 AA EPUB3 for screen readers, or read instantly in our accessible web reader with TTS and keyboard navigation.
Built to WCAG 2.1 Level AA — every feature serves a real accessibility need
Google Gemini 1.5 Flash Vision extracts Sinhala text from scanned PDF pages with high accuracy.
Web Speech API with si-LK voice — reads Sinhala text aloud. Gracefully falls back to default voice if si-LK unavailable.
Full keyboard control — Space to play/pause TTS, arrow keys for chapters, +/- for font size. No mouse required.
Toggle high contrast for users with low vision. Black background with white text, meeting WCAG AAA contrast ratios.
Four font sizes (S/M/L/XL) using Noto Sans Sinhala — the gold standard for Sinhala script rendering.
Generates WCAG 2.1 AA compliant EPUB3 with accessibility metadata for screen readers like JAWS and NVDA.
Built-in Sinhala demo content — works without a backend
අපේ රටේ ලස්සන ස්වභාවය ගැන ඉගෙන ගනිමු...
Sample content included — no PDF upload required to explore the reader