0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
Spotting of a Particular Printed Word in Farsi Handwritten Forms Using Deep Learning
نویسندگان :
Mohammad jafar Gholami Kenari
1
Ehsanollah Kabir
2
1- دانشگاه تربیت مدرس تهران
2- دانشگاه تربیت مدرس تهران
کلمات کلیدی :
Word Spotting،Mask R-CNN،Anchors،Data Augmentation،Farsi،Persian
چکیده :
Keyword spotting in documents plays a crucial role in information retrieval and document analysis. Recent years have witnessed significant progress in keyword spotting through deep learning methods. This paper introduces a method that utilizes a pre-trained Mask R-CNN with transfer learning to spot the printed keyword “تاریخ” in the printed forms filled in handwriting. To address data scarcity and enhance the network's performance, data augmentation methods are employed. Additionally, specific to the keyword “تاریخ”, adjustments such as changes in the dimensions and aspect ratio of anchors, are implemented in the region proposal network (RPN). The results illustrate that the proposed method achieves a mean Average Precision (mAP) of 98.1 percent
لیست مقالات
لیست مقالات بایگانی شده
A New High Voltage Gain Full Bridge Resonant Switched-Capacitor Converter
Sajad AfsharZarandi - Reza Beiranvand
Speech Emotion Recognition Using Transfer Learning and Self-Supervised Speech Representation Learning
Marziye Azad - Babak Nasersharif
Efficient Full Adders for Approximate Arithmetic Units in the Image Processing Applications
Bahram Rashidi
بهینه سازی تزویج فیبر نوری باریک شده و موجبر نوری بر بستر پلیمر
مهتاب حسینعلی زاده - مونا ثریا - غلام محمد پارسا نسب - شکراله کریمیان
A New Method on Failure Detection of Fixed and Moving Contacts of Circuit Breakers
Hassan Hamidi - Ali Asghar Razi Kazemi
Secret Sharing Implementation of Predictive Functional Control
Enayat Amiri - Mohammad Haeri - Saeed Adelipour
E-RESO: An Enhanced Time Redundancy-based Error Detection Approach for Arithmetic Operations
Sina Shahoveisi - Athena Abdi
A COMPREHENSIVE DEEP LEARNING METHOD for SHORT-TERM LOAD FORECASTING
Mohammad Sayadlou - Mahdi Salay naderi - Mehrdad Abedi - Sajad Esmaeili - Mohammad Amini
Source Seeking Via Circular Formation of n-Nonholonomic Agents in a 2-D Environment
Milad Ghane - Mohsen Mojiri - Mohammad Ali Ghadiri-Modarres - Elaheh Zadhoosh
Large Scale Indoor VLC Positioning Using Image Sensor with Limited Field of View
Arezoo Kabiri - Foroogh Sadat Tabataba
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.0.4