Movatterモバイル変換


[0]ホーム

URL:


US20210034907A1 - System and method for textual analysis of images - Google Patents

System and method for textual analysis of images
Download PDF

Info

Publication number
US20210034907A1
US20210034907A1US16/940,578US202016940578AUS2021034907A1US 20210034907 A1US20210034907 A1US 20210034907A1US 202016940578 AUS202016940578 AUS 202016940578AUS 2021034907 A1US2021034907 A1US 2021034907A1
Authority
US
United States
Prior art keywords
text
mathematical model
regions
image
text regions
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US16/940,578
Inventor
Pranay Dugar
Anirban Chatterjee
Saswata Sahoo
Rajesh Shreedhar Bhat
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Walmart Apollo LLC
Original Assignee
Walmart Apollo LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Walmart Apollo LLCfiledCriticalWalmart Apollo LLC
Priority to US16/940,578priorityCriticalpatent/US20210034907A1/en
Assigned to WALMART APOLLO, LLCreassignmentWALMART APOLLO, LLCASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS).Assignors: SAHOO, SASWATA, BHAT, Rajesh Shreedhar, CHATTERJEE, ANIRBAN, DUGAR, PRANAY
Publication of US20210034907A1publicationCriticalpatent/US20210034907A1/en
Priority to US18/152,271prioritypatent/US11861669B2/en
Priority to US18/510,811prioritypatent/US12125080B2/en
Abandonedlegal-statusCriticalCurrent

Links

Images

Classifications

Definitions

Landscapes

Abstract

Segmentation first breaks the images into segments or regions, with the segments of the region having text or symbols. The segmented image is separately applied to two different CNN-based models. Each model produces text boxes where potential text might exist. Then, a selective NMS algorithm is applied to the output of each model to produce a final group of text regions. These text regions are analyzed and actions taken.

Description

Claims (14)

What is claimed is:
1. A system, comprising:
a data storage unit including a trained first mathematical model and a trained second mathematical model, wherein the first mathematical model is different and distinct from the second mathematical model;
an electronic communication network;
an electronic server coupled to the electronic communication network that hosts a web-based catalog ordering system that receives electronic orders from customers;
a control circuit that is coupled to the electronic communication network and the data storage unit, wherein the control circuit is configured to:
receive an image of a product from a vendor via the electronic communication network, the product proposed by the vendor to be sold to retail customers;
perform segmentation on the image to divide the image into individual regions of homogeneous pixels, wherein the segmentation is effective to create a segmented image;
apply the segmented image to the first mathematical model to produce a first group of text regions and apply the segmented image to the second mathematical model to obtain a second group of text regions, wherein each of the text regions are regions includes potential text or symbols;
apply a selective non-maximal suppression (sNMS) algorithm to the first group of text regions and the second group of text regions to obtain a final group of text regions, the selective NMS algorithm being effective to remove overlapping regions at the same location or general location in the image, the selective NMS algorithm selecting text regions most likely to include text;
analyze informational content of the text regions and perform an action that utilizes the informational content of the text regions, the action being one or more of:
applying the informational content to the web-based ordering catalog, receiving a customer order from a customer as a result of the informational content, and physically fulfilling the received customer orders using an automated order fulfillment system to ship items in the order to the customer;
scanning the informational content for offensive content, and sending a message to a vendor via the electronic network to remove the offensive content or removing the item from a retail store or warehouse when an item including the offensive content exists in the retail store or warehouse.
2. The system ofclaim 1, wherein the item that is removed from the retail store or warehouse is removed using an automated vehicle to navigate to the item and remove the item from a display unit or storage unit.
3. The system ofclaim 3, wherein the automated vehicle is an automated ground vehicle or an aerial drone.
4. The system ofclaim 1, wherein the first group of text regions, the second group of text regions, and the final group of text regions comprise text boxes.
5. The system ofclaim 1, wherein the first mathematical model and the second mathematical model are convolutional neural networks (CNNs).
6. The system ofclaim 1, wherein the first mathematical model and the second mathematical model are trained using training images.
7. The system ofclaim 1, further comprising a camera, the camera coupled to the electronic communication network, the camera configured to obtain the image.
8. A method, the method comprising:
providing a data storage unit that includes a trained first mathematical model and a trained second mathematical model, wherein the first mathematical model is different and distinct from the second mathematical model;
providing an electronic communication network and an electronic server that is coupled to the electronic communication network, the server hosting a web-based catalog ordering system that receives electronic orders from customers;
providing a control circuit that is coupled to the electronic communication network and the data storage unit;
at the control circuit, receiving an image of a product from a vendor via the electronic communication network, the product proposed by the vendor to be sold to retail customers;
at the control circuit, performing segmentation on the image to divide the image into individual regions of homogeneous pixels, wherein the segmentation is effective to create a segmented image;
at the control circuit, applying the segmented image to the first mathematical model to produce a first group of text regions and apply the segmented image to the second mathematical model to obtain a second group of text regions, wherein each of the text regions are regions includes potential text or symbols;
at the control circuit, applying a selective non-maximal suppression (sNMS) algorithm to the first group of text regions and the second group of text regions to obtain a final group of text regions, the selective NMS algorithm being effective to remove overlapping regions at the same location or general location in the image, the selective NMS algorithm selecting text regions most likely to include text;
at the control circuit, analyzing informational content of the text regions and perform an action that utilizes the informational content of the text regions;
wherein the action being one or more of:
applying the informational content to the web-based ordering catalog, receiving a customer order from a customer as a result of the informational content, and physically fulfilling the received customer orders using an automated order fulfilment system to ship items in the order to the customer;
scanning the informational content for offensive content, and sending a message to a vendor via the electronic network to remove the offensive content or removing the item from a retail store or warehouse when an item including the offensive content exists in the retail store or warehouse.
9. The method ofclaim 8, wherein the item that is removed from the retail store or warehouse is removed using an automated vehicle to navigate to the item and remove the item from a display unit or storage unit.
10. The method ofclaim 9, wherein the automated vehicle is an automated ground vehicle or an aerial drone.
11. The method ofclaim 8, wherein the first group of text regions, the second group of text regions, and the final group of text regions comprise text boxes.
12. The method ofclaim 8, wherein the first mathematical model and the second mathematical model are convolutional neural networks (CNNs).
13. The method ofclaim 8, wherein the first mathematical model and the second mathematical model are trained using training images.
14. The method ofclaim 8, further comprising a camera, the camera coupled to the electronic communication network, the camera configured to obtain the image.
US16/940,5782019-07-292020-07-28System and method for textual analysis of imagesAbandonedUS20210034907A1 (en)

Priority Applications (3)

Application NumberPriority DateFiling DateTitle
US16/940,578US20210034907A1 (en)2019-07-292020-07-28System and method for textual analysis of images
US18/152,271US11861669B2 (en)2019-07-292023-01-10System and method for textual analysis of images
US18/510,811US12125080B2 (en)2019-07-292023-11-16System and method for textual analysis of images

Applications Claiming Priority (4)

Application NumberPriority DateFiling DateTitle
IN2019410304772019-07-29
IN2019410304772019-07-29
US201962902745P2019-09-192019-09-19
US16/940,578US20210034907A1 (en)2019-07-292020-07-28System and method for textual analysis of images

Related Child Applications (1)

Application NumberTitlePriority DateFiling Date
US18/152,271ContinuationUS11861669B2 (en)2019-07-292023-01-10System and method for textual analysis of images

Publications (1)

Publication NumberPublication Date
US20210034907A1true US20210034907A1 (en)2021-02-04

Family

ID=74258227

Family Applications (3)

Application NumberTitlePriority DateFiling Date
US16/940,578AbandonedUS20210034907A1 (en)2019-07-292020-07-28System and method for textual analysis of images
US18/152,271ActiveUS11861669B2 (en)2019-07-292023-01-10System and method for textual analysis of images
US18/510,811ActiveUS12125080B2 (en)2019-07-292023-11-16System and method for textual analysis of images

Family Applications After (2)

Application NumberTitlePriority DateFiling Date
US18/152,271ActiveUS11861669B2 (en)2019-07-292023-01-10System and method for textual analysis of images
US18/510,811ActiveUS12125080B2 (en)2019-07-292023-11-16System and method for textual analysis of images

Country Status (1)

CountryLink
US (3)US20210034907A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN113409289A (en)*2021-06-292021-09-17南开大学Ancient building brick surface defect detection method based on Faster R-CNN
US11631116B1 (en)*2021-02-052023-04-18Gatekeeper Financial LlcShopping method
CN117326557A (en)*2023-09-282024-01-02连云港市沃鑫高新材料有限公司 A method for preparing high-purity silicon carbide powder for reaction-sintered ceramic structural parts
US11861669B2 (en)2019-07-292024-01-02Walmart Apollo, LlcSystem and method for textual analysis of images
CN117875906A (en)*2024-03-062024-04-12青岛冠成软件有限公司Electronic bill auditing method based on artificial intelligence

Citations (14)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
WO1999066426A1 (en)*1998-06-151999-12-23Commissariat A L'energie AtomiqueImage search method, based on image invariant indexing
CA2736750A1 (en)*2008-09-152010-03-18James A. AmanSession automated recording together with rules based indexing, analysis and expression of content
US8150166B2 (en)*2006-09-062012-04-03Sharp Laboratories Of America, Inc.Methods and systems for identifying text in digital images
CA2815944A1 (en)*2010-11-042012-05-10Digimarc CorporationSmartphone-based methods and systems
EP2521071A2 (en)*2011-05-062012-11-07King Abdulaziz City for Science & Technology (KACST)Method and system for text segmentation
CA2888153A1 (en)*2012-10-192014-04-24Digimarc CorporationMethods and arrangements for identifying objects
CA2846273A1 (en)*2013-03-142014-09-14Nordstrom, Inc.System and methods for order fulfillment, inventory management, and providing personalized services to customers
US9008435B2 (en)*2005-05-092015-04-14Google Inc.System and method for search portions of objects in images and features thereof
US9256618B2 (en)*2011-02-152016-02-09Ebay Inc.Identifying prohibited information within and item image
US20170169315A1 (en)*2015-12-152017-06-15Sighthound, Inc.Deeply learned convolutional neural networks (cnns) for object localization and classification
US10410261B2 (en)*2017-05-252019-09-10Walmart Apollo, LlcSystems and methods for determining facet rankings for a website
WO2019199719A1 (en)*2018-04-102019-10-17Ebay Inc.Dynamically generated machine learning models
US20210019566A1 (en)*2019-07-172021-01-21Walmart Apollo, LlcSystems and methods for identifying items in a digital image
US10922957B2 (en)*2008-08-192021-02-16Digimarc CorporationMethods and systems for content processing

Family Cites Families (21)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US8271336B2 (en)*1999-11-222012-09-18Accenture Global Services GmbhIncreased visibility during order management in a network-based supply chain environment
US7263506B2 (en)*2000-04-062007-08-28Fair Isaac CorporationIdentification and management of fraudulent credit/debit card purchases at merchant ecommerce sites
US20060041484A1 (en)*2004-04-012006-02-23King Martin TMethods and systems for initiating application processes by data capture from rendered documents
US20070300142A1 (en)*2005-04-012007-12-27King Martin TContextual dynamic advertising based upon captured rendered text
US7817855B2 (en)*2005-09-022010-10-19The Blindsight CorporationSystem and method for detecting text in real-world color images
US8077930B2 (en)*2007-04-132011-12-13Atg Advanced Swiss Technology Group AgMethod for recognizing content in an image sequence
US8009928B1 (en)*2008-01-232011-08-30A9.Com, Inc.Method and system for detecting and recognizing text in images
US11461785B2 (en)*2008-07-102022-10-04Ron M. RedlichSystem and method to identify, classify and monetize information as an intangible asset and a production model based thereon
WO2010022185A1 (en)*2008-08-192010-02-25Digimarc CorporationMethods and systems for content processing
US8732014B2 (en)*2010-12-202014-05-20Yahoo! Inc.Automatic classification of display ads using ad images and landing pages
US9367770B2 (en)*2011-08-302016-06-14Digimarc CorporationMethods and arrangements for identifying objects
US9483732B1 (en)*2013-02-082016-11-01Marko MilakovichHigh value information alert and reporting system and method
US11257143B2 (en)*2014-12-302022-02-22Hand Held Products, Inc.Method and device for simulating a virtual out-of-box experience of a packaged product
WO2018200866A1 (en)*2017-04-262018-11-01UHV Technologies, Inc.Material sorting using a vision system
US11687800B2 (en)*2017-08-302023-06-27P Tech, LlcArtificial intelligence and/or virtual reality for activity optimization/personalization
US11775926B2 (en)*2018-01-292023-10-03Maplebear, Inc.Machine-learned model for optimizing selection sequence for items in a warehouse
US11475404B2 (en)*2018-09-052022-10-18Trax Technology Solutions Pte Ltd.Aggregating product shortage information
WO2020194004A1 (en)*2019-03-282020-10-01The Nielsen Company (Us), LlcMethods and apparatus to detect a text region of interest in a digital image using machine-based analysis
US10832083B1 (en)*2019-04-232020-11-10International Business Machines CorporationAdvanced image recognition for threat disposition scoring
US20210034907A1 (en)2019-07-292021-02-04Walmart Apollo, LlcSystem and method for textual analysis of images
US20220292543A1 (en)*2021-03-092022-09-15Alexandra Valentina HendersonPop-up retial franchising and complex econmic system

Patent Citations (15)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
WO1999066426A1 (en)*1998-06-151999-12-23Commissariat A L'energie AtomiqueImage search method, based on image invariant indexing
US9008435B2 (en)*2005-05-092015-04-14Google Inc.System and method for search portions of objects in images and features thereof
US8150166B2 (en)*2006-09-062012-04-03Sharp Laboratories Of America, Inc.Methods and systems for identifying text in digital images
US10922957B2 (en)*2008-08-192021-02-16Digimarc CorporationMethods and systems for content processing
CA2736750A1 (en)*2008-09-152010-03-18James A. AmanSession automated recording together with rules based indexing, analysis and expression of content
CA2815944A1 (en)*2010-11-042012-05-10Digimarc CorporationSmartphone-based methods and systems
US9256618B2 (en)*2011-02-152016-02-09Ebay Inc.Identifying prohibited information within and item image
US20120281919A1 (en)*2011-05-062012-11-08King Abdul Aziz City For Science And TechnologyMethod and system for text segmentation
EP2521071A2 (en)*2011-05-062012-11-07King Abdulaziz City for Science & Technology (KACST)Method and system for text segmentation
CA2888153A1 (en)*2012-10-192014-04-24Digimarc CorporationMethods and arrangements for identifying objects
CA2846273A1 (en)*2013-03-142014-09-14Nordstrom, Inc.System and methods for order fulfillment, inventory management, and providing personalized services to customers
US20170169315A1 (en)*2015-12-152017-06-15Sighthound, Inc.Deeply learned convolutional neural networks (cnns) for object localization and classification
US10410261B2 (en)*2017-05-252019-09-10Walmart Apollo, LlcSystems and methods for determining facet rankings for a website
WO2019199719A1 (en)*2018-04-102019-10-17Ebay Inc.Dynamically generated machine learning models
US20210019566A1 (en)*2019-07-172021-01-21Walmart Apollo, LlcSystems and methods for identifying items in a digital image

Cited By (6)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US11861669B2 (en)2019-07-292024-01-02Walmart Apollo, LlcSystem and method for textual analysis of images
US12125080B2 (en)2019-07-292024-10-22Walmart Apollo, LlcSystem and method for textual analysis of images
US11631116B1 (en)*2021-02-052023-04-18Gatekeeper Financial LlcShopping method
CN113409289A (en)*2021-06-292021-09-17南开大学Ancient building brick surface defect detection method based on Faster R-CNN
CN117326557A (en)*2023-09-282024-01-02连云港市沃鑫高新材料有限公司 A method for preparing high-purity silicon carbide powder for reaction-sintered ceramic structural parts
CN117875906A (en)*2024-03-062024-04-12青岛冠成软件有限公司Electronic bill auditing method based on artificial intelligence

Also Published As

Publication numberPublication date
US20240104617A1 (en)2024-03-28
US11861669B2 (en)2024-01-02
US12125080B2 (en)2024-10-22
US20230169555A1 (en)2023-06-01

Similar Documents

PublicationPublication DateTitle
US11861669B2 (en)System and method for textual analysis of images
EP3327617B1 (en)Object detection in image data using depth segmentation
US11657602B2 (en)Font identification from imagery
EP3327616B1 (en)Object classification in image data using machine learning models
US20210312206A1 (en)System and method for classifier training and retrieval from classifier database for large scale product identification
CN110929774B (en)Classification method, model training method and device for target objects in image
US9830704B1 (en)Predicting performance metrics for algorithms
Meier et al.Fully convolutional neural networks for newspaper article segmentation
US9342757B2 (en)Pattern recognition apparatus, method thereof, and program product therefor
EP2806374B1 (en)Method and system for automatic selection of one or more image processing algorithm
US20210166028A1 (en)Automated product recognition, analysis and management
US11532036B2 (en)Digital image ordering using object position and aesthetics
US11531994B2 (en)Electronic detection of products and arrangement of products in a display structure, electronic detection of objects and arrangement of objects on and around the display structure, electronic detection of conditions of and around the display structure, and electronic scoring of the detected product and object arrangements and of the detected conditions
US20170124618A1 (en)Methods and Systems for Image-Based Searching of Product Inventory
CN111460888B (en)Article identification method and device based on machine learning
CN114821102A (en)Intensive citrus quantity detection method, equipment, storage medium and device
US10235594B2 (en)Object detection in image data using color segmentation
WO2020022329A1 (en)Object detection/recognition device, method, and program
Yaman et al.Image processing and machine learning‐based classification method for hyperspectral images
Mankar et al.Classification of remote sensing image using SVM kernels
Sharath Kumar et al.Hand gesture recognition for sign language: a skeleton approach
Dugar et al.Semi-Bagging Based Deep Neural Architecture to Extract Text from High Entropy Images
US20250292578A1 (en)Dynamic mapping using image data
Ghoshal et al.Text extraction from scene images through local binary pattern and business features based color image segmentation
Patel et al.Design and Development of Vegetable Detection and Recognition Model Using Deep Learning in Market Environment

Legal Events

DateCodeTitleDescription
ASAssignment

Owner name:WALMART APOLLO, LLC, ARKANSAS

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:DUGAR, PRANAY;CHATTERJEE, ANIRBAN;SAHOO, SASWATA;AND OTHERS;SIGNING DATES FROM 20190814 TO 20200723;REEL/FRAME:053405/0338

STPPInformation on status: patent application and granting procedure in general

Free format text:NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS

STPPInformation on status: patent application and granting procedure in general

Free format text:NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS

STCBInformation on status: application discontinuation

Free format text:ABANDONED -- FAILURE TO PAY ISSUE FEE


[8]ページ先頭

©2009-2025 Movatter.jp