Arabic End-to-End Structured OCR for textbooks

This is the official demo for the Arabic Nougat models. It is an end-to-end Markdown Extraction model that extracts text from images or PDFs and write them in Markdown.

There are three models available:

Disclaimer: These models hallucinate text and are not perfect. They are trained on a mix of synthetic and real data and may not work well on all types of images.

Model
Examples