Onlineocr.net might be a bit more popular than Apertium. We know about 4 links to it since March 2021 and only 3 links to Apertium. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
This is very cool, looking forward to it! I've been doing the same thing with Spanish Wikipedia articles for a while, using a few lines of Bash + Regex. I was using Apertium for it. https://apertium.org/ It's definitely worse than most ML-based solutions, but it works reliably and fast; you can run it entirely offline. With Spanish translations, the main problem I was facing is lack of vocabulary, so I created - Source: Hacker News / 10 months ago
I used to keep track of the state of machine translation some years back. I think the way you measure the success of an automated translation is edit distance, i.e. How many manual edits you need to make to a translated text before you reach some acceptable state. I suppose it's somewhat subjective, but it is possible to construct a benchmark and allow for multiple correct results. The best resources I knew back... - Source: Hacker News / about 2 years ago
Apertium is one of them. We make open-source rule-based machine translation systems, and our core tools are in C++. A few of our proposed ideas involve modifying those C++ tools with new features or improvements to existing features. Source: over 3 years ago
Hey! I know exactly what you mean: de-scrambling sentences and lists because of those damn columns sucks, but being blind, and with relatively few pdf, text, doc, etc, or interactive cyoas, it's complicated. My experience is, the drive to doc technique is probably the best, but onlineocr.net isn't bad either: on really hard conversions, I use both. I know thatt my reply comes a bit late, but still, good luck and... Source: over 1 year ago
Look for a website that can use OCR to make the text selectable in ur pdf. U can try onlineocr.net. Source: almost 2 years ago
The best OCR I have come across on the internet is the one on onlineocr.net however its page limit makes its paid version not worth buying. Are there any other OCRs on the internet with similar quality, paid or not, my goal here is to make searchable word documents of textbooks. Source: about 2 years ago
📎34. onlineocr.net: Recognize text from scanned PDFs and images “ see other OCR tools. Source: over 2 years ago
Google Translate - Google's free service instantly translates words, phrases, and web pages between English and over 100 other languages.
Tesseract - Tesseract is an optical character recognition engine for various operating systems
DeepL Translator - DeepL Translator is a machine translator that currently supports 42 language combinations.
ABBYY FineReader - ABBYY's latest PDF editor software, FineReader 16 you can easily convert files like PDF to Excel, PDF to Word, edit, share, collaborate & more with this PDF editor!
Microsoft Translator - Microsoft Translator is your door to a wider world.
GOCR - GOCR homepage. GOCR is an OCR (Optical Character Recognition) program, developed under the GNU Public License.