Temat: optical character recognition

Skocz do pozycji: 1.

Tytuł:: Analiza porównawcza wybranych programów do optycznego rozpoznawania tekstu
Comparative analysis of selected programs for optical text recognition
Autorzy:: Łukasik, Edyta
Zientarski, Tomasz
Tematy:: rozpoznawanie tekstu
OCR
Tesseract
Ocrad
GOCR
Optical Character Recognition; Pokaż więcej
Wydawca:: Politechnika Lubelska. Instytut Informatyki
Powiązania:: https://bibliotekanauki.pl/articles/98410.pdf Link otwiera się w nowym oknie
Opis:: Celem artykułu jest porównanie trzech programów do optycznego rozpoznawania teksu. Zdefiniowany został problem optycznego rozpoznawania tekstu i przedstawione główne jego zastosowania. Opisano działanie tej technologii i krótko scharakteryzowano najważniejsze dostępne na rynku programy realizujące omawiane zagadnienie. Następnie poddano testom wybrane programy wykorzystując dwie próbki pisma maszynowego w języku polskim. Określono szybkość procesu rozpoznawania tekstu. Poprawność rozpoznania znaków i wyrazów w analizowanym tekście została także określona.
The aim of the article is to compare three programs for the optical text recognition. The problem of the optical text recognition has been defined. Next, briefly the functionality of this technology was described. The most important programs realizing the discussed problem were also characterized. The selected programs were tested using two samples of machine writing in Polish. The speed of the text recognition process was determined. The correctness of characters and words recognition in the analyzed text was also specified.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 2.

Tytuł:: Deep convolutional neural network using a new data set for berber language
Autorzy:: Mokrane, Kemiche
Sadou, Malika
Tematy:: optical character recognition
handwritten character recognition
CNN
Berber-MNIST data set
EMNIST
Tifinagh
Latin characters; Pokaż więcej
Wydawca:: Akademia Górniczo-Hutnicza im. Stanisława Staszica w Krakowie. Wydawnictwo AGH
Powiązania:: https://bibliotekanauki.pl/articles/27312869.pdf Link otwiera się w nowym oknie
Opis:: Currently, handwritten character recognition (HCR) technology has become an interesting and immensely useful technology; it has been explored with impressive performance in many languages. However, few HCR systems have been proposed for the Amazigh (Berber) language. Furthermore, the validation of any Amazigh handwritten character-recognition system remains a major challenge due to the lack of availability of a robust Amazigh database. To address this problem, we first created two new data sets for Tifinagh and Amazigh Latin characters by extending the well-known EMNIST database with the Amazigh alphabet. Then, we proposed a handwritten character recognition system that is based on a deep convolutional neural network to validate the created data sets. The proposed convolutional neural network (CNN) has been trained and tested on our created data sets, the experimental tests showed that it achieves satisfactory results in terms of accuracy and recognition efficiency.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 3.

Tytuł:: Preprocessing Photos of Receipts for Recognition
Przetwarzanie wstępne zdjęć paragonów do celów rozpoznawania
Autorzy:: Korobacz, W.
Tabędzki, M.
Tematy:: cyfrowe przetwarzanie obrazów
rozpoznawanie znaków
OCR
digital image processing
optical character recognition; Pokaż więcej
Wydawca:: Politechnika Białostocka. Oficyna Wydawnicza Politechniki Białostockiej
Powiązania:: https://bibliotekanauki.pl/articles/88364.pdf Link otwiera się w nowym oknie
Opis:: The subject of this work is methods of image pre-processing, applied to receipts photos. The purpose is to improve their quality, allowing to increase the efficiency of the conventional text recognition software (OCR). The authors had mainly difficult cases in mind – photos taken freehand in unfavorable lighting conditions. The work describes the analyzed methods of filtering, binarization, searching for the edge of the image, image straightening, marking the area of interest, thinning. The preliminary results with OCR software on a small data set were also presented. Thanks to pre-processing, character recognition efficiency has been improved by 25%. The final part presents conclusions and plans for future work.
Tematem tej pracy są metody przetwarzania wstępnego obrazów, zastosowane do zdjęć przedstawiających paragony. Celem jest poprawa ich jakości, pozwalająca zwiększyć skuteczność działania oprogramowania do rozpoznawania tekstu. Autorzy mieli na uwadze głównie trudne przypadki – zdjęć robionych „z ręki”, przy słabym oświetleniu. Praca opisuje przeanalizowane metody filtrowania, binaryzacji, wyszukiwania krawędzi, prostowania obrazu, oznaczania obszaru zainteresowania, ścieniania. Przedstawiono również wstępne wyniki testów z oprogramowaniem OCR na niewielkiej bazie obrazów. Przetwarzanie wstępne pozwoliło na poprawę identyfikacji znaków o 25%. W końcowej części przedstawiono wnioski oraz plany przyszłej pracy.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 4.

Tytuł:: Rozpoznawanie wzorców cyfrowych za pomocą robota edukacyjnego
Digits recognition using an educational robot
Autorzy:: Dimitrova-Grekow, T.
Grodzki, D.
Tematy:: optyczne rozpoznawanie znaków
algorytm ROI
edukacyjny robot
Optical Character Recognition
region of Interest algorithm
educational robot; Pokaż więcej
Wydawca:: Stowarzyszenie Inżynierów i Techników Mechaników Polskich
Powiązania:: https://bibliotekanauki.pl/articles/156236.pdf Link otwiera się w nowym oknie
Opis:: Artykuł przedstawia system rozpoznający liczby rzymskie przy użyciu edukacyjnego zestawu Mindstorms NXT. Algorytm OCR wybrany do rozpoznania znaków został oparty na klasyfikacji cech. Zaadaptowana wersja algorytmu Region of Interest ROI i klasyfikacja cech są głównymi atutami tej pracy. System został skutecznie przetestowany pod wieloma względami. Powstała konstrukcja umożliwiająca skanowanie kartki formatu A4, a obsługujący ją program umożliwia prawidłową interpretację zeskanowanych liczb rzymskich.
Pattern recognition is always associated with powerful calculation [1, 2]. A specific branch in this area is Optical Character Recognition [3, 4, 5] where one of the most popular techniques is Feature Extraction, also known as Intelligent Character Recognition [6]. All ICR algorithms are topological [7, 8, 9]. This paper presents an implementation of Roman Number Recognition system realized on LEGO Mindstorms NXT educational robot. The main point is successful minimalistic realization of an on-board pattern recognition system. The NXT platform allows also an easy reconfiguration of the hardware and more building freedom without extra costs (Fig. 1.). An adapted version of the ROI algorithm is implemented [10]. Based on the extracted features (Fig. 2.) a classification of the roman digits is proposed (Fig. 3.). The final stage of the program includes segmentation, end result calculation and visualization of it on the robot screen. The conducted experimental tests proved a 100% efficiency for digit and number recognition having a process in optimal conditions and quite good stability for the optical noises (Fig. 4.) and color chances (Tab. 1). In spite of many drawbacks of the hardware, the implemented system seems very perspective and invokes many ideas toward pattern recognition technics.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 5.

Tytuł:: Cross-task code reuse in genetic programming applied to visual learning
Autorzy:: Jaśkowski, W.
Krawiec, K.
Wieloch, B.
Tematy:: genetic programming
code reuse
knowledge sharing
visual learning
multi task learning
optical character recognition
programowanie genetyczne
dzielenie się wiedzą
uczenie wizualne
optyczne rozpoznawanie znaków; Pokaż więcej
Wydawca:: Uniwersytet Zielonogórski. Oficyna Wydawnicza
Powiązania:: https://bibliotekanauki.pl/articles/330367.pdf Link otwiera się w nowym oknie
Opis:: We propose a method that enables effective code reuse between evolutionary runs that solve a set of related visual learning tasks. We start with introducing a visual learning approach that uses genetic programming individuals to recognize objects. The process of recognition is generative, i.e., requires the learner to restore the shape of the processed object. This method is extended with a code reuse mechanism by introducing a crossbreeding operator that allows importing the genetic material from other evolutionary runs. In the experimental part, we compare the performance of the extended approach to the basic method on a real-world task of handwritten character recognition, and conclude that code reuse leads to better results in terms of fitness and recognition accuracy. Detailed analysis of the crossbred genetic material shows also that code reuse is most profitable when the recognized objects exhibit visual similarity.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 6.

Tytuł:: Development of Extensive Polish Handwritten Characters Database for Text Recognition Research
Autorzy:: Tokovarov, Mikhail
Kaczorowska, Monika
Miłosz, Marek
Tematy:: OCR
Handwriting character samples
Database for optical character recognition
Polish handwritten characters database
Próbki znaków pisma ręcznego
Baza danych do optycznego rozpoznawania znaków
Baza polskich znaków pisanych odręcznie; Pokaż więcej
Wydawca:: Stowarzyszenie Inżynierów i Techników Mechaników Polskich
Powiązania:: https://bibliotekanauki.pl/articles/102832.pdf Link otwiera się w nowym oknie
Opis:: In the modern world, fast and efficient processing of non-digital (handwritten or typed) texts is the task of extreme importance. Similar to many other fields, optical character recognition (OCR) benefits from the application of machine learning (ML) which allows developing effective and accurate methods. In order to achieve good performance, a machine learning algorithm requires great amount of data. Nowadays, a large database of handwritten characters prepared by National Institute of Standards and Technology (NIST), USA, can be used for training an ML model. However, significant differences between the manners of handwriting exist in the US and Poland. That fact, along with the absence of Polish diacritical marks, causes the NIST database to be less useful for development of an OCR model for the Polish language. According to the best of the authors’ knowledge, no database with samples of Polish handwriting exists. The present research is focused at filling this gap, i.e. gathering and preparing an extensive database of Polish handwritten characters. The paper presents the very first database of Polish handwriting samples. The database is by far larger than all the datasets used in the previous attempts of implementing OCR for the Polish handwriting. It is also the first fully publicly accessible database of Polish handwriting of this scale. The same method and developed tools can be used to build handwritten characters databases of other languages.
Dostawca treści:: Biblioteka Nauki

Artykuł

na półce

Skocz do pozycji: 7.

Tytuł:: Osmanlı Türkçesi araştırmalarında optik karakter tanıma teknolojisinin kullanımı
The use of optical character recognition technology in Ottoman Turkish research
Autorzy:: Altun, Hilal Oytun
Wydawca:: İksad
Opis:: Together with the increasing availability of electronic archives of Ottoman text images, conducting computer aided studies on those texts attracts more interest as a field of study. While some historical languages have been object to computational studies more than the others, Ottoman is still among the low resource languages in terms of machine-readable texts despite its richness of document image archives. Main principle to take a step towards creating an environment in the area of computational processing of Ottoman Turkish texts is building an electronic text corpus. To archive this it is needed to develop AI powered Optical Character Recognition (OCR) tools for Ottoman. Current OCR tools have some limitations such that they only can be used on printed documents with Naskh font with a simple layout book pages. The metrics for evaluating OCR performance target both physical segmentation and textual content recognition. In this paper I am focusing on the content recognition performance of the selected tool. I stored OCR output and the reference text in separate txt files and wrote a little script to compare the content of the files in terms of word matching. Even putting aside, the complexities of page layout, due to the problems related to the word-level segmentation, the tool's over 90% performance in letter recognition does not guarantee that the text is actually rendered accurately. It means that producing high-quality machine-readable texts still depends on manual correction, controlling the texts character by character. Improving performance of the AI powered OCR tools is highly dependent on high-quality training data. Due to the language characteristics of the documents, the preparation of pre-processed parallel data sets for the training-testing phases, requires linguistic expertise in Ottoman Turkish. Therefore, data and methods of related fields should be integrated while developing AI powered tools in order to progress in computerized historical linguistics studies.
Osmanlı Türkçesi kaynaklarının elektronik arşivlerinin yaygınlaşmasıyla birlikte, bu metinlerüzerinde bilgisayar destekli çalışmalar giderek ilgi çeken bir alan hâline gelmiştir. Bazı tarihî dillerdiğerlerinden daha fazla bilgisayarlı dil bilimi çalışmalarına konu olurken Osmanlı Türkçesi, görselmetin arşivlerinin zenginliğine rağmen, makine tarafından okunabilen metinler açısından hâlâ düşükkaynaklı diller arasında yer almaktadır. Osmanlı Türkçesiyle yazılmış metinlerin sayısal olarakişlenmesine imkân yaratılmasına yönelik ilk adım elektronik bir metin külliyatı oluşturmaktır.Bunun da yolu Osmanlı Türkçesi için yapay zekâ destekli Optik Karakter Tanıma (OCR) araçlarıgeliştirmekten geçmektedir. Mevcut OCR araçlarının yalnızca Nesih yazı tipiyle basılı, basit birsayfa düzenine sahip kitap sayfalarında kullanılabilmeleri gibi bazı sınırlılıkları bulunmaktadır.OCR performansını değerlendirmeyi hedefleyen ölçekler, hem fiziksel segmentasyonu hem demetin içeriğini tanıma başarısını tespit etmeyi hedefler. Bu yazıda, seçtiğim OCR aracının içeriktanıma performansını değerlendirdim. OCR çıktısını ve referans metni txt dosyalarına kaydederekdosyaların içeriğini kelime bazında karşılaştırmak için küçük bir komut dosyası (script) yazdım.Sayfa düzeninden kaynaklanan karışıklıkları bir yana bıraksak bile, kelime seviyesindesegmentasyon problemleri sebebiyle, aracın harf tanımada %90 üzerinde performans göstermesiçıktı metninin aslına uygunluğunu garanti etmemektedir. Bu da makine tarafından okunabilenyüksek kalitede metinlerin oluşturulması için OCR yapılmış metinlerin harf harf kontrol edilerektekrar elle düzeltilmesi gerektiği anlamına gelmektedir.
Dostawca treści:: Repozytorium Uniwersytetu Jagiellońskiego

Artykuł

na półce

Skocz do pozycji: 8.

Tytuł:: Preprocessing methods for handwriting character recognition
Metody przetwarzania wstępnego przy rozpoznawaniu pisma odręcznego
Autorzy:: Chyczyński, Mikołaj
Opis:: The paper describes chosen pre-processing methods that can be used to improve the accuracyof handwriting recognition – including binarization, size normalization, morphologicaltransformations or skeletonization. All methods were applied to images containing single, separatedcharacters, ignoring the problem of letters extraction.Described experiments were carried out in order to measure the effectiveness of these methods.Experiments based on database of 1000 digits, with the conclusions drawn, are discussedin the chapter 4. There were two classifiers used during the experiments – K Nearest Neighborsand Support Vector Machine, whose basic features and behavior are described in the chapter1. C++ implementation using the OpenCV library is provided in the chapter 3.
W pracy opisano wybrane metody przetwarzania wstępnego, które mogą zostać użyte wcelu poprawy dokładności rozpoznawania pisma odręcznego – m.in. binaryzację, normalizacjęrozmiaru, przekształcenia morfologiczne czy szkieletyzację. Wszystkie metody stosowanodo obrazów zawierających pojedyncze, oddzielone znaki, pomijając problem ekstrakcji liter zwyrazów.Zawarto także opis eksperymentów, które zostały przeprowadzone w celu zbadania skutecznościtychże metod. Eksperymenty te, przeprowadzone na bazie 1000 cyfr, wraz z wyciągniętymiwnioskami omówiono w rozdziale 4. Podczas eksperymentów wykorzystywano dwa klasyfikatory– K Najbliższych Sąsiadów oraz Maszynę Wektorów Wspierających, których podstawowe cechyi sposób działania przedstawiono w rozdziale 1. Implementację w języku C++, wykorzystującąbibliotekę OpenCV przedstawiono w rozdziale 3.
Dostawca treści:: Repozytorium Uniwersytetu Jagiellońskiego

Inne

na półce

Skocz do pozycji: 9.

Tytuł:: Automatic tax form recognition
Automatyczne rozpoznawanie formularzy podatkowych
Autorzy:: Wolny, Maciej
Opis:: This thesis is a proof of concept on how to automate tax settlement in Poland based on the example of PIT-11 and PIT-37 documents. The motivation is to introduce the reader with the office automation topic by showing how to automate a small piece of accountancy work. Artificial Intelligence conquers areas related to a variety of white collar jobs and provides tools capable of replacing most of the repetitive human work. The thesis reviews difficulties faced while converting the information stored on paper to digital data and proposes a new way of reading forms while maintaining context. The reader will be introduced to image processing and optical character recognition as well as to the text matching. All of the above-mentioned topics are presented on the example of modern tax-settling application stored in the Google Cloud.
Ta praca to proof-of-concept w temacie automatyzacji rozliczeń podatkowych w Polsce na przykładzie dokumentów PIT-11 i PIT-37. Motywacją jest zapoznanie czytelnika z tematem automatyzacji biura poprzez pokazanie, jak zautomatyzować niewielką część pracy księgowej. Sztuczna inteligencja podbija różnorodne obszary związane z pracą umysłową i zapewnia narzędzia, które mogą zastąpić większość powtarzalnej ludzkiej pracy. Praca przedstawia trudności, napotkane przy konwersji informacji przechowywanych na papierze na dane cyfrowe, i przedstawia nowy sposób czytania formularzy przy zachowaniu kontekstu. Czytelnik zapozna się z przetwarzaniem obrazu i optycznym rozpoznawaniem znaków, a także dopasowywaniem tekstu. Wszystkie te tematy zostały przedstawione na przykładzie nowoczesnej aplikacji do rozliczania podatków działającej w chmurze Google Cloud.
Dostawca treści:: Repozytorium Uniwersytetu Jagiellońskiego

Inne

na półce

Informacja

Wyszukujesz frazę "optical character recognition" wg kryterium: Temat