Get started with Surya-OCR. Multilingual document OCR toolkit(… | by Ayush

What’s OCR?

OCR stands for Optical Character Recognition is a method to detect and acknowledge texts in photos, paperwork and many others.

Surya is a multilingual doc OCR toolkit which may carry out correct line-level textual content detection.

Surya is called after the Hindu sun god, who has common imaginative and prescient.

Stipulations

You’ll want python 3.9+ and PyTorch in your machine to run surya-ocr.

Get Began

First, let’s set up the Surya OCR-

pip set up surya-ocr

You possibly can detect textual content traces in a picture, pdf, or folder of photos/pdfs with the next command. This can write out a json file with the detected bboxes, and optionally save photos of the pages with the bboxes.

surya_detect DATA_PATH --images

DATA_PATH might be a picture, pdf, or folder of photos/pdfs
--images will save photos of the pages and detected textual content traces (non-obligatory)
--max specifies the utmost variety of pages to course of for those who do not wish to course of all the things
--results_dir specifies the listing to avoid wasting outcomes to as a substitute of the default

In case you use it for the primary time, it’ll obtain a mannequin roughly about 120M. That’s it.

You possibly can learn extra about this via the github hyperlink present under.

Limitations

That is specialised for doc OCR. It would possible not work on pictures or different photos. It would additionally not work on handwritten textual content.
Doesn’t work effectively with photos that seem like adverts or different components of paperwork which can be normally ignored.

Citations

Thanks for studying, for those who discovered it useful do give a like and comply with for extra such content material.

Source link

Speech Recognition and Synthesis: Bridging the Gap Between Speech and Text | by Avadhoot Tavhare | May, 2024

Joyce’s picks: musings and readings in AI/ML, May 20, 2024 | by joyce shen | May, 2024

Exploring the Benefits of FP16 in Model Weights Quantization | by Mithilesh Biradar | May, 2024

ASUS’ first Copilot+ PC locks when you walk away and unlocks when you return

Speech Recognition and Synthesis: Bridging the Gap Between Speech and Text | by Avadhoot Tavhare | May, 2024

3 AI features coming to Copilot+ PCs that I wish were on my MacBook

Microsoft rebuilt Windows 11 around AI and Arm chips

Joyce’s picks: musings and readings in AI/ML, May 20, 2024 | by joyce shen | May, 2024

Most Popular

The Hamas Threat of Hostage Execution Videos Looms Large Over Social Media

Revolutionizing the Way We Find Love

Federal Investigators Widen Tesla Inquiry, Company Says

Our Picks

ASUS’ first Copilot+ PC locks when you walk away and unlocks when you return

Speech Recognition and Synthesis: Bridging the Gap Between Speech and Text | by Avadhoot Tavhare | May, 2024

3 AI features coming to Copilot+ PCs that I wish were on my MacBook

Get started with Surya-OCR. Multilingual document OCR toolkit(… | by Ayush | Jan, 2024

Related Posts