mobrdev OPTICALCHARACTERRECOGNITIONTECHNOLOGYGUIDEFORBUSINESSOWNERS RAZ writtenby EvgeniyKrasnokutsky Al/MLTeamLeaderatMobiDev,PhD mobidev Clutch TopMachineLearningBestAgency DevelopmentCompanyinUkraine SOFTWAREDEVELOPMENT FORVISIONARIESWHOCREATEPRODUCTS WEB&CLOUDAR&MOBILE INFRASTRUCTUREAPPS AIANDMACHINEIOT&HARDWARE LEARNINGINTEGRATION ForStartupsForEmergingcompaniesForEnterprises Guaranteeddelivery Youcanadapttoevolving TopUS-levelquality ontimeandonbudget businessneedsandincreaseROl for1/3thepricetobring Nosurprises withourflexible,provenprocesses 3xfeaturestoyourproduct 100%450+300+ ProductsApprovalratingEnglishspeaking launchedbyUpworkprofessionals Findmoreatwww.mobidev.biz ?info@mobidev.biz+18883800276 UNITEDSTATESOFFICE UNITEDKINGDOMOFFICE ENGINEERINGOFFICEINPOLAND ENGINEERINGOFFICEINUKRAINE Atlanta,Georgia Sheffield Lodz Chernivtsi TableofContents TableofContents WhatisOCRandHowDoesItWork? STEP1.CHECKINGTHEDOCUMENTTYPE&IMAGEPRE-PROCESSING STEP2.CHARACTERRECOGNITIONSTEP3.POST-PROCESSING OCRisaMachineLearningandComputerVisionTask QCRBusinessCases OCRINFINANCIALSERVICES OCRINHEALTHCARE OCRINRETAIL OCRINSECURITYANDLAWENFORCEMENT HardwareforOCR Qut-of-the-boxSolutionsvsCustomQCR.Development COMMERCIALVSOPEN-SOURCEOCRSOLUTIONS LimitationsofOCRTechnologyandHowtoOvercomeThem KeyTakeaways 1 WiththegrowinginterestinOCRandMachineLearning,moreandmore businessownersarelookingforwaystoapplythiskillingcombinationto optimizetheirbusinessprocesses,andifyouareoneofthem,thisarticleisfor you. Let'sfindoutmoreaboutwhatOCRis,howOCRpoweredwithmachinelearning isdifferentfromtheoriginaltechnology,andhowitcanbeusedinbusiness. WhatisOcRandHowDoesItWork? convertsanykindofimagecontainingwrittentextintomachine-readabletext data.OcRallowsyoutoquicklyandautomaticallydigitizeadocumentwithout theneedformanualdataentry.That'swhyOCRiscommonlyusedforbusiness flowoptimizationandautomation.TheoutputofOCRisfurtherusedfor electronicdocumentediting,andcompactdatastorageandalsoformsthebasis forcognitivecomputing,machinetranslationandtext-to-speechtechnologies. TherearedifferenttypesofOCRdependingonthetaskstheysolve: IntelligentWordRecognition(IWR)isusedfortherecognitionofunconstrainedhandwrittenwordsinsteadofrecognitionofindividual characters. IntelligentCharacterRecognition(ICR)isamoreadvancedformofOCR hand-printedcharacters. OpticalWordRecognition(OwR)scanstypewrittentextwordbyword. OpticalMarkRecognition(OMR)isusedtoidentifytheinformationthat peoplemarkonsurveys,tests,etc. Let'sfindouthowOCRworks.Thefunctioningofthetraditionalopticalcharacter recognitionsystemconsistsofthreestages:imagepre-processing,character recognition,post-processing. 2 DOCUMENTPROCESSING PIPELINESELICTIONPRE-PROCESSING 目OCR Result mobidev STEP1.CHECKINGTHEDOCUMENTTYPE&IMAGE PRE-PROCESSING Themainchallengeoftextrecognitionisthateachdocumenttemplatehasitsownsetofentities,values,andlocationofentitiesinthedocument.ForOCR softwaretoworkaccurately,itmustbeabletoidentifydifferenttypesof documentsandrunthecorrectpredefinedpipelinebasedonthat.Forexample, PDFdocumentsmayormaynotcontainatextlayer.IfthePDFdoesnotcontain atextlayer,wemustprocessitdifferentlythanifitdid. Afterchoosingtherightpipelinetheimagecomestothepre-processingstep. Thisisapreparationstepthataffectstheoutcomes.lmagepre-processinghelpstoremoveimagenoiseandincreasethecontrastbetweenthebackgroundand text,whichwillhelpimprovetextrecognition.Atthisstep,theOCRprogram convertsthedocumenttoablackandwhiteversionandthenanalyzesitforthe presenceoflightanddarkareas,Lightareasareidentifiedasthebackground, whiledarkareasareidentifiedascharacterstobeprocessed. 3 STEP2.CHARACTERRECOGNITION Withtheuseoffeaturedetectionandpatternrecognitionalgorithmsasingle characterisdetected.Then,asetofthecharactersareassembledintowords andsentences.Charactersareidentifiedusingpatternrecognitionorfeature detectionalgorithms. Patternrecognitionisamethodbasedonfindingmatchesbetweenthe imagetextandtextsamplesembeddedinthesysteminvariousfontsand formats.Thismethodworksbestwithtypescriptanditdoesn'tworkwell whennewfontsareencounteredthatarenotincludedinthesystem. Thefeaturedetectionalgorithmmakesitpossibletorecognizenew Suchfeaturesmayincludethenumberofslantedlines,intersectinglines, orcurvesinthecomparisonsymbol. Mostoften,OCRprogramswithfeaturedetectionuseclassifiersbasedon machinelearningorneuralnetworkstoprocesscharacters.Classifiersareused tocompareimagefeatureswithstoredexamplesinthesystemandselectthe closestmatch.Thefeaturedetectionalgorithmisgoodforunusualfontsor low-qualityimageswherethefontisdistorted. STEP3.POST-PROCESSING Onceasymbolisidentified,itisconvertedintoacodethatcanbeusedbycomputersystemsforfurtherprocessing.Weshouldmentionthattheoutputof anyOCRandOCR-relatedtechnology/algorithmhasalotofnoiseandfalse positives.ItmakesitdifficulttouseOCR'soutputdirectly,sowehaveto: •Filteroutnoisyoutputsandfalsepositives •Combinerecognizedentitieswiththeirextractedmeaning •Checkforpossiblemistakesandpreventoutputtotheuserifany Based