Oryx ORYX Translation

Oryx Translation · Document Studio

Arabic PDFs, restored
to .docx Word — faithfully.

A precise, RTL-aware converter built by translators. We honour your typography, your script, and your layout — whether the source is digital text or a scanned page.

Files auto-deleted within 60 minutes No account · No tracking Free up to 5 pages
01
Upload

Drop your PDF

II
Method

How a page travels through the studio

01أ

Detect the source

Every PDF is read first by PyMuPDF. Pages with embedded text route to the digital pipeline; image-only pages route to OCR.

02ب

Convert with care

Digital pages keep their layout via pdf2docx. Scans pass through Tesseract 5 trained on Arabic, with paragraph-level RTL preserved.

03ج

Deliver a .docx

You receive a Word file ready for editing — with Noto Naskh Arabic embedded, right-to-left flow, and clean paragraph structure.

III
What you receive

Built by translators, for translators

RTL-native typography

Right-to-left flow, Arabic ligatures, and proper paragraph direction set on every line of output.

Layout-aware

Columns, tables, and embedded fonts are preserved on digital PDFs via pdf2docx — not just plain text dumps.

OCR for scanned pages

Tesseract 5 with the Arabic + English language models. Clean modern print yields 90 %+ character accuracy.

Privacy by default

Files are removed from disk within 60 minutes. No accounts, no logs of file content, no third parties.

Free up to 5 pages

No signup. Drop a file, get the .docx. Bring your own API key for premium quality and unlimited pages.

Made by Oryx

Crafted by Oryx Translation — a studio specialising in Arabic content for technical, legal, and editorial work.