Quality Magazine logo
search
cart
facebook twitter linkedin youtube
  • Sign In
  • Create Account
  • Sign Out
  • My Account
Quality Magazine logo
  • NEWS
  • PRODUCTS
    • FEATURED PRODUCTS
    • SUBMIT YOUR PRODUCT
  • CHANNELS
    • AUTOMATION
    • MANAGEMENT
    • MEASUREMENT
    • NDT
    • QUALITY 101
    • SOFTWARE
    • TEST & INSPECTION
    • VISION & SENSORS
  • MARKETS
    • AEROSPACE
    • AUTOMOTIVE
    • ENERGY
    • GREEN MANUFACTURING
    • MEDICAL
  • MEDIA
    • A WORD ON QUALITY PUZZLE
    • EBOOKS
    • PODCASTS
    • VIDEOS
    • WEBINARS
  • EVENTS
    • EVENT CALENDAR
    • IMTS
  • DIRECTORIES
    • BUYERS GUIDE >
      • Supplier Insights
    • NDT SOURCEBOOK
    • VISION & SENSORS
    • TAKE A TOUR
  • INFOCENTERS
    • Digital Quality Management Systems
    • NEXT GENERATION SPC & QUALITY ANALYTICS
  • AWARDS
    • ROOKIE OF THE YEAR
    • PLANT OF THE YEAR
    • PROFESSIONAL OF THE YEAR
  • MORE
    • Expert Columns
    • NEWSLETTERS
    • QUALITY STORE
    • INDUSTRY LINKS
    • SPONSOR INSIGHTS
  • EMAG
    • eMAGAZINE
    • ARCHIVES
    • CONTACT
    • ADVERTISE
  • SIGN UP!
Vision & Sensors

Vision & Sensors | Optical Character Recognition

Choosing Cutting-Edge OCR

It’s not the AI that makes the difference, but how the AI is used.

By Heiko Seitz
IDS Denknet OCR press play

All Images Source: IDS Imaging Development Systems GmbH

June 6, 2024
✕
Image in modal.

The expectations of OCR (Optical Character Recognition) are high. Nowadays, users expect it to recognize and correctly interpret all characters out-of-the-box, much like a human can. In addition, accuracy should remain consistent, regardless of changes in lighting or other environmental conditions. Rule-based approaches have the disadvantage that they lack the scope for interpretation that is acquired in neural network approaches through training with many different examples and explains their enormous performance. However, the use of state-of-the-art technology alone is not enough to ensure the success of a project - it is much more important to make it easy to use, performant and user-friendly to maintain. Because even with OCR systems based on deep learning, quality, execution speed and user-friendliness are anything but a given.

Optical character recognition is still one of the most difficult disciplines in image processing and machine intelligence. The pure variety of possible characters and methods of applying characters to different surfaces gives an idea of the challenges involved. The difficulties of converting such complex visual data into clear, structured text include dirt, reflections and shape errors caused by scratching, embossing or laser engraving on solid materials. In addition, overlapping or incomplete characters, as well as a generally low pixel resolution of the image data, can lead to characters quickly becoming almost impossible to distinguish from each other. For example, the number 8 quickly becomes a 3. The image processing market is constantly evolving to improve the accuracy and reliability of text recognition. But what are the decisive factors when choosing an OCR system?

Comprehensive database with reproducible accuracy

To be convincing from the outset, OCR must function simply and offer high reading performance. This requires a well-developed network architecture that has been pre-trained with many varying sample images. Here, situations from real applications are just as indispensable as the use of synthetic data. On the one hand, many additional special cases and variations can be learned and, on the other, this also ensures far more robust recognition of the relevant features. After all, nothing should be left to chance, especially in industrial automation or quality assurance.

This is where an AI vision solution for individual image analysis must take action. In addition to the use of leading AI technology, users should not only have access to a high-performance OCR model, but also one that is constantly evolving. Strict versioning is essential so that application developers can access all development steps, but also have the option of updating to a new improved version to ensure versatile and robust reading at all times. The possibility of testing and verifying the performance and reproducibility of the trained networks with sample data sets in a kind of quality center should also not be missing. This is essential for quality assurance before a production system is upgraded with new software.

Heavily deformed and small numbers on crown caps.
Tire markings with little contrast.
Information on separating discs with considerable overprinting.
Figure 1: OCR systems should read reliably from the outset in different applications, even without fine-tuning. For example, tire markings with little contrast, heavily deformed and small numbers on crown caps or information on separating discs with considerable overprinting, even with a highly inhomogeneous background.

Of transformers & large language models

Another positive characteristic of a good OCR model is its ability to recognize not only individual characters, but also the relationships between them - in the case of character sequences, such as serial numbers or words - and to take this knowledge into account when recognizing characters. The better the OCR can predict subsequent characters and weight the reading result accordingly, the more robustly and precisely special applications can be solved. The generative and combinatorial properties of transformer networks or large language models (LLM), such as those used in ChatGPT, can have a further positive influence on such predictions and thus on the reading quality. However, it should be considered that these architectures are rather slow in execution and require a lot of system resources. This makes it all the more important that such cutting-edge technologies are used to the right extent in order to optimally support the requirements of customer use cases. This is because image processing does not operate in seconds, but rather in a low millisecond range, especially in the automation sector. A trained neural network should therefore remain fast and lightweight so that it can run on “normal” hardware. If high recognition accuracy and speed in productive use are only possible with almost infinite system performance, applications would hardly be economically viable. 

Simple correction and retraining

If the OCR fails to read characters, regardless of whether the reason was an error or an unknown character, font or language, it is important that the user can correct the reading results or train any new characters with little effort. However, this fine-tuning is not simply a matter of “continuing to train” the network. Imagine, for example, that the OCR model has been trained with 2 million images and the user now wants to introduce something new to the OCR model with a few more images of his own. What weighting should be given to such information in the model in order to make a difference without changing everything that is already there? And this is precisely where the provider’s expertise is required to expand the AI in such a way that this type of adaptation does not negatively impact previous stable recognitions. An example: An OCR has problems with numbers for some reason and the user only annotates numbers in subsequent training processes, never letters. The aim is to achieve an intelligent “knowledge backup” to prevent this network from only being able to read numbers successfully at some point because it thinks it does not need to read letters.

A well-designed training system therefore generates additional synthetic data for all new image data during fine-tuning in order to further train and weight the network to the right extent. This prevents the OCR from losing its previous capabilities, no matter how long it continues to be trained. However, such complex processes during “retraining” are better hidden from the user under a simple user interface and are processed quickly and efficiently in the background using sufficient system resources so that the user is not kept waiting for long periods of time. In the best case scenario, however, the basic capabilities of the OCR model are so good that little or no additional training is required.

Figure 2: Fine-tuning is designed to quickly improve the reading quality of an OCR system with less user interaction or expert knowledge required.
Figure 2: Fine-tuning is designed to quickly improve the reading quality of an OCR system with less user interaction or expert knowledge required.

Advantages of cloud training

A cloud-based AI system with a direct connection to a large data center can provide the necessary resources for complex training routines much more easily and quickly than would be possible with self-hosted hardware. And only when absolutely necessary. This means that training performance can be quickly increased and decreased again as required.

If all the functions and services of an OCR training system are executed entirely in the cloud, any fine-tuning with your own image data always works on an up-to-date and controlled software basis and not with any software release on any local hardware system. Continuous further development in the technical backend also makes the basic OCR model increasingly resistant to difficulties that have already been solved. As a result, more and more customer applications can be realized without major adjustments or additional training. 

The cloud solution is also an added value for the user in a support scenario. If there are difficulties with data in a use case, e.g. with unknown characters, technical support in the backend can quickly provide a remedy and positively influence recognition performance. Without having to export/import data and without the risk of different build systems or software versions leading to different results, changes can be made to the network architecture, for example, or the generation of synthetic additional data can be optimized. In such instances, this can be done directly in the customer use case without any loss of time. Not sending sensitive data also minimizes the risk of unauthorized access by third parties.

In the best case scenario, executing a training command for the cloud system is comparable to a software-as-a-service service that trains a large number of suitable network models with different architectures in the background and can ultimately provide the user with the best result.


Cutting-edge OCR system makes the difference

  • Synthetic data - Each time new images are uploaded, image variants are automatically generated to expand and strengthen model capabilities in a systematic way.
  • Ease of use + time savings - Intuitive tools such as "Autoprediction" and "1-Click Annotation" require no prior knowledge and reduce testing, preparation and maintenance time.
  • Cutting-edge technology - Knowledge of the latest network architectures, such as Transformer or Large Language Models, is continuously incorporated into the development of OCR. 
  • Smart Architecture - Fully automated training independently selects the most suitable architecture for the task
  • Cloud training - Always up to date with latest edge technology and continuous improvement of the network base
  • Fast and economical local execution - The goal is an optimally accurate, lean and fast model for local execution in a closed application environment

OCR simply and economically from a single source

There are many providers of OCR solutions in the AI vision environment and there is a veritable race for the best networks. For experienced users, there are also many open source tools and public network architectures available that can be used to quickly gain initial experience and achieve results. However, without in-depth technical knowledge of how AI technology or cutting edge networks and large vision models can be used and combined economically and efficiently, many OCR tasks remain unsolved. 

Here it is important to have an experienced partner who, at best, can offer all the necessary image processing components for fast, reliable and economical OCR tasks from a single source and adapt them to special requirements. Some customers already benefit from all these advantages. And for those who are still searching -- it costs nothing to try it out.

KEYWORDS: AI-based visual inspection Artificial Intelligence (AI) machine vision manufacturing OCR, Optical Character Recognition quality assurance

Share This Story

Looking for a reprint of this article?
From high-res PDFs to custom plaques, order your copy today!

Heiko Seitz, technical author, IDS Imaging Development Systems GmbH. For more information, call +49 7134 96196-0, email [email protected] or visit www.ids-imaging.de.

Recommended Content

JOIN TODAY
to unlock your recommendations.

Already have an account? Sign In

  • 2024 Quality Rookie of the Year Justin Wise 1440x750px banner with "Quality Rookie of the Year" logo inset

    Meet the 2024 Quality Rookie of the Year: Justin Wise

    Justin Wise is an exceptional individual who has been...
    Aerospace
    By: Michelle Bangert
  • Man with umbrella and coat stands outside while it rains at night looking at a building.

    Nondestructive Testing: Is there an ethics problem?

    I was a whistleblower who exposed fraudulent activities...
    NDT
    By: Dale Norwood
  • Unraveling Deflategate: Football stadium with closeup of football on field

    Unraveling the Tom Brady Deflategate

    The Deflategate scandal erupted following the 2014 AFC...
    Measurement
    By: Greg Cenker and Henry Zumbrun
Manage My Account
  • eMagazine Subscriptions
  • Newsletters
  • Online Registration
  • Subscription Customer Service
  • Manage My Preferences

More Videos

Sponsored Content

Sponsored Content is a special paid section where industry companies provide high quality, objective, non-commercial content around topics of interest to the Quality audience. All Sponsored Content is supplied by the advertising company and any opinions expressed in this article are those of the author and not necessarily reflect the views of Quality or its parent company, BNP Media. Interested in participating in our Sponsored Content section? Contact your local rep!

close
  • Key Takeaways for Quality Leaders
    Sponsored byComplianceQuest

    Key Takeaways for Quality Leaders from the 2026 Gartner Magic Quadrant™ for QMS

  • This image shows a person seated next to a Bobcat T66 compact track loader.
    Sponsored byPolyWorks by InnovMetric

    Supercharging Digital Gauging at Bobcat North America

  • Dorsey Calibration Lab photo by Tom LaBarbera Picture this Studios
    Sponsored byDorsey Metrology International

    Ensuring Product Quality in a Competitive Manufacturing Landscape

Popular Stories

iStock-1352825159-jpg.jpg

U.S. Should Substantially Boost Support for Manufacturing USA Program, Issue National Industrial Manufacturing Strategy, Says New Report

Dorsey Calibration Lab photo by Tom LaBarbera Picture this Studios

Ensuring Product Quality in a Competitive Manufacturing Landscape

Visions Sensors Ebook

eBook | How AI-driven Vision Systems Are Transforming Automotive Quality Control

2026 Quality Professional of the Year!

Events

June 22, 2026

Automate 2026

Automate is North America's largest robotics and automation event — and the best place to take your ideas from insight to impact.
 
Our show floor features the world’s leading automation solutions, from AI and robotics to motion control, vision systems, and more. Plus, our educational conference is second to none, led by the brightest minds in automation today.
 
Ready to transform the way you work? Take the next step at Automate.
July 14, 2026

Quality Leaders Forum: Better Communication, Better Quality Data

The Quality Leaders Forum is a quarterly, editor-moderated fireside chat series hosted by Quality Magazine, featuring candid conversations with senior manufacturing and operations executives shaping enterprise-level quality.

View All Submit An Event

Products

Lean Manufacturing and Service Fundamentals, Applications, and Case Studies

Lean Manufacturing and Service Fundamentals, Applications, and Case Studies

See More Products
Quality Podcast Channel Custom Content

Related Articles

  • VS 0323 System Integration AI-based Quality Inspection

    Quality Inspection with AI Vision

    See More
  • Higher speed acquisition and processing in machine vision

    Practical VGR: Cutting-Edge Technologies and Applications

    See More
  • Smiling warehouse worker in a reflective vest and hard hat checking inventory data on a tablet device in an organized storage facility.

    ASQ’s Publications Support Novice Practitioners to Cutting Edge Research

    See More

Related Products

See More Products
  • 9781260108385_20.jpg

    Manufacturing Planning And Control For Supply Chain Management: The CPIM Reference, Second Edition

  • temp_6351_1_5705_1_8318_1_17674.jpg

    How To Implement Lean Manufacturing, 2E

See More Products

Events

View AllSubmit An Event
  • November 11, 2025

    Cutting the Cost of Quality With Industrial CT

    On Demand This webinar explores how industrial CT scanning transforms quality management by revealing internal flaws, reducing downtime, and eliminating costly scrap and rework.
View AllSubmit An Event
×

Stay in the know with Quality’s comprehensive coverage of
the manufacturing and metrology industries.

Newsletters | Website | eMagazine

JOIN TODAY!
  • RESOURCES
    • Advertise
    • Contact Us
    • Directories
    • Manufacturing Division
    • Store
    • Want More
  • SIGN UP TODAY
    • Create Account
    • eMagazine
    • Newsletters
    • Customer Service
    • Manage Preferences
  • SERVICES
    • Marketing Services
    • Market Research
    • Reprints
    • List Rental
    • Survey/Respondent Access
  • STAY CONNECTED
    • LinkedIn
    • Facebook
    • YouTube
    • X (Twitter)
  • PRIVACY
    • PRIVACY POLICY
    • TERMS & CONDITIONS
    • DO NOT SELL MY PERSONAL INFORMATION
    • PRIVACY REQUEST
    • ACCESSIBILITY

Copyright ©2026. All Rights Reserved BNP Media, Inc. and BNP Media II, LLC.

Design, CMS, Hosting & Web Development :: ePublishing