Arabic End-to-End Structured OCR for textbooks

This is the official demo for the Arabic Nougat models. It is an end-to-end Markdown Extraction model that extracts text from images or PDFs and write them in Markdown.

There are three models available:

Disclaimer: Models can hallucinate text and are not perfect. Please double check the output if you care about accuracy the most.

Model
Examples