CROSS-REFERENCE TO RELATED APPLICATIONThis application claims the benefit of provisional Patent Application Ser. No. 62/758,457, filed Nov. 9, 2018, assigned to the assignee of the present application, and incorporated herein by reference.
BACKGROUNDIn high stress and oftentimes hazardous work environments-including firefighting, search & rescue, oil and gas, surgery, fighter pilots, mining, special ops, and the like, one false step has critical consequences, but so do too many slow steps. Go too fast and something life-threatening may be missed; go too slow and the results could be doubly devastating. The challenges of effectively and safely performing critical work in harsh and obscured environments have always existed. These challenges combine the physical strain imposed by hazardous terrain with the mental distress placed upon the individual operating within them. Critical human performance in high-stress environments is limited by how rapidly and effectively the brain can process impoverished or jumbled sensory inputs. Until now technology has been leveraged primarily to increase the amount of information provided to the senses, but not designed to specifically enhance the brain's existing (and unmatched) cognitive ability to make sense of that information.
For example, several emergency response systems are centered on the use of thermal imaging cameras (TICs) and augmented reality (AR) optics to provide a hands-free thermal display to the user. Current systems are typically carried by a crewmember who must iteratively scan, mentally process and communicate what they perceive. Current handheld and hands-free TICs lack the computational resources and software required to unobtrusively offer advanced image processing and data visualization features to all crewmembers in real-time. This capability and time gap in the visual understanding of hazardous environments has been identified as a significant causative factor in responder line of duty deaths. Such systems cause crewmembers, such as first responders, to operate in a Stop, Look, Process and Remember paradigm, which is cumbersome and time consuming.
Accordingly, there is a need for improved methods and systems for effectively for providing information to the senses of first responders operating in high-stress environments in a manner that reduces cognitive load to enhance performance.
BRIEF SUMMARYThe exemplary embodiment provides a cognitive load reducing platform for first responders that incorporates contextual and physiological visualizations delivered via electronic communications designed to reduce cognitive load and elevate performance. Aspects of the cognitive load reducing platform include enhancing edges of objects in a thermal image by receiving a thermal image and generating a gradient magnitude image comprising a plurality of pixels having associated gradient magnitude values. The gradient magnitude image is partitioned into subregions and gradient magnitude statistics are calculated for each. Mapping parameters are calculated for each of the subregions that equalize and smooth a dynamic range of the corresponding gradient magnitude statistics across the subregions. The mapping parameters calculated for each of the subregions are applied to pixels in the subregions to generate enhanced gradient magnitude values having equalized luminosity and contrast, and a wireframe image is formed therefrom having enhanced edges of objects. The wireframe image is displayed on a display device, wherein the wireframe image appears as a decluttered line drawing where the enhanced edges have increased luminosity and contrast compared to the thermal image to reduce the cognitive load of the user.
According to the method and system disclosed herein, the cognitive load reducing platform changes first responders' use of these technologies from the cumbersome, and information overloaded, Stop, Look, Process and Remember paradigm to a state of continuous assisted perception for all crewmembers.
BRIEF DESCRIPTION OF SEVERAL VIEWS OF THE DRAWINGSFIG. 1 is a diagram illustrating one embodiment of a cognitive load reducing platform according to one embodiment.
FIGS. 2A and 2B are diagrams illustrating components of the cognitive load reducing platform in a firefighting vision system embodiment.
FIGS. 3A and 3B are diagrams illustrating an example of the enhanced images displayed on the display unit.
FIG. 4 is a diagram illustrating an example command and control interface of the cognitive load reducing platform.
FIG. 5 is a flow diagram illustrating the process performed by the edge enhancement engine on the thermal images to enhance the edges of objects therein to improve user vision and navigation a user in obscure environments, while reducing cognitive load of the user.
FIG. 6 is an illustration an example thermal image received from the TIC.
FIG. 7 is an illustration of an example gradient magnitude image generated from the thermal image ofFIG. 6.
FIG. 8 is a diagram graphically illustrating the gradient magnitude image partitioned into a set of non-overlapping subregions.
FIG. 9 is a diagram illustrating an example subregion histogram.
FIG. 10 is a diagram illustrating an example cumulative histogram generated from the gradient magnitude value histogram of one particular subregion.
FIG. 11 is a diagram graphically illustrating an example oflinear mapping1100 of subregion gradient magnitude values represented by line formed between control points (L, U) to predetermined reference gradient magnitude values.
FIG. 12 is a diagram graphically illustrating an example of one embodiment of the weight function used to interpolate the subregion mapping parameters before applying the mapping parameters to pixels (p) in a current subregion.
DETAILED DESCRIPTIONThe exemplary embodiment relates to methods and systems for incorporating contextual and physiological visualizations into electronic communications via a cognitive load reducing platform having image edge enhancement. The following description is presented to enable one of ordinary skill in the art to make and use the invention and is provided in the context of a patent application and its requirements. Various modifications to the exemplary embodiments and the generic principles and features described herein will be readily apparent. The exemplary embodiments are mainly described in terms of particular methods and systems provided in particular implementations. However, the methods and systems will operate effectively in other implementations. Phrases such as “exemplary embodiment”, “one embodiment” and “another embodiment” may refer to the same or different embodiments. The embodiments will be described with respect to systems and/or devices having certain components. However, the systems and/or devices may include more or less components than those shown, and variations in the arrangement and type of the components may be made without departing from the scope of the invention. The exemplary embodiments will also be described in the context of particular methods having certain steps. However, the method and system operate effectively for other methods having different and/or additional steps and steps in different orders that are not inconsistent with the exemplary embodiments. Thus, the present invention is not intended to be limited to the embodiments shown, but is to be accorded the widest scope consistent with the principles and features described herein.
In many critical, high-stress activities, such as firefighting, specialized tools have been developed to support challenging environments and critical objectives of crewmembers engaged in the high-stress activities. For the most part, these tools have evolved to support the crewmembers' physical needs—heat protection, airway protection, forcible entry, fire suppression, and the like. In the past 10-15 years, a greater focus has been placed on supporting the crewmembers' informational needs, including hazardous environment detection, communication, and safety alerting. For example, hearing aids, binoculars, and seismic sensors all increase the collection of information, but don't increase crewmembers' abilities to process or critically discern that extra information. Polarized glasses, gas monitors, thermal imagers, and the like all collect information, but still do not address the time and stress penalty required to absorb and interpret all that information. This “more is better” approach is both distracting and inefficient.
Unfortunately, often times stress is the limiting factor to crewmembers successfully completing these critical and dangerous activities. These are, by definition, high-stress environments and the difficulty in absorbing more and more information is made worse by stress. The health of the crewmembers is also compromised by stress, and regrettably contributes to a majority of crewmember fatalities every year.
The exemplary embodiments provide a cognitive load reducing platform having edge enhancement that leverages the principles of neuroscience and the tools of computer vision to reduce the cognitive load of a user and elevate human performance in high stress environments. The principles of neuroscience are used to integrate sensor data into the natural senses in a manner that is optimized for the task at hand, e.g. search and rescue, and computer vision supplies the means in one embodiment. The cognitive load reducing platform significantly enhances the crewmembers' or user's ability to make well informed decisions rapidly when operating in complex environments where cognitive abilities decline. A premise of the cognitive load reducing platform is that if thinking and understanding are easier for crewmembers, then crewmember can achieve objectives more rapidly, spend less time in harsh conditions, and have potentially reduced stress levels because of the real-time assurance or reinforcement of a human sense, i.e., vision, hearing and or touch. Example users of the cognitive load reducing platform include, but are not limited to, firefighters, surgeons, soldiers, police officers, search and rescue and other types of first responders.
FIG. 1 is a diagram illustrating one embodiment of a cognitive load reducing platform according to one embodiment. In one embodiment, the cognitive load reducing platform is a wearable electronic system worn on the body of a user when the user is engaged in complex, high stress environments that reduce cognitive abilities.
The cognitiveload reducing platform10 comprises, one or more sensors12a-12n(collectively sensors12) that collect information about an environment as sensor data. The information collected about the environment refers primarily to sensor data that can be used for navigation and detecting hazards, but also to a user's health status. In one embodiment, the sensors are worn by the crewmembers. For example, multiple sensors may be incorporated into a sensor package that is worn by one or more crewmembers. In another embodiment, other sensors may be remote from the crewmembers, such as on a drone equipped with a camera, gas detector, and the like.
Example categories of sensors includes situational awareness sensors and biometric sensors for health status. The situational awareness sensors measure data about the user's external environment for hazard detection and navigation. Examples of situational awareness sensors for hazard detection may include, but are not limited to: cameras (e.g., a TIC, a radiometric thermal camera, a drone camera), a spectrometer, a photosensor, magnetometer, a seismometer, a gas detector, a chemical sensor, a radiological sensor, a voltage detector, a flow sensor, a scale, a thermometer, a pressure sensor, and an acoustic sensor (e.g., for selective active noise cancellation inside the mask to facilitate radio communication). Examples of situational awareness sensors used for user navigation may include, but are not limited to: an inertial measurement unit (IMU), a GPS sensor, a speedometer, a pedometer, an accelerometer, an altimeter, a barometer, attitude indicator, a depth gauge, a compass (e.g., a fluxgate compass), a gyroscope, and the like. Examples of biometric sensors that measure health conditions/status of the user may include, but are not limited to: a heart rate sensor, a blood pressure monitor, a glucose sensor, an electrocardiogram (EKG or ECG) sensor, an electroencephalogram (EEG) sensor, an electromyography (EMG) sensor, a respiration sensor, and a neurological sensor.
The platform also includes a high-speed processor complex14 coupled to the sensors12. The high-speed processor complex14 includes amemory16, acommunication interface19, and one or more processors18, such as graphics processor units (GPUs). The processor/GPUs18 execute one more software-basedcognitive enhancement engines20 to process the sensor data from the sensors12 into enhanced characterization data that incorporate contextual and physiological visuals, auditory and/or haptic cues. The cognitiveload reducing platform200 is sensor agnostic and as any type of sensor can be added to the platform as long a correspondingcognitive enhancement engine20 is provided to process and present that sensor data. Thememory16 may contain sensor data collected from the one or more sensors12, and may store thecognitive enhancement engines20 for implementing the processes described herein when executed by the one or more processors18. Although theedge enhancement engines20 are described as single modules, the functionality provided by theedge enhancement engines20 may be performed by multiple modules.
The cognitiveload reducing platform10 further includes one ormore output devices22 coupled to theprocessor complex14 to electronically communicate the enhanced characterization data to the wearer. In one embodiment, theoutput devices22 may be implemented as a visual display, headphones/ear buds and/or a haptic device. The enhanced characterization data is thus integrated into natural senses of the wearer in a manner that is optimized for the performance of a specific task of the user—navigating in limited, to no, view environments.
Prior solutions increase the amount of information provided to the user's senses without specifically enhancing the brain's existing (and unmatched) cognitive ability to make sense of that information. The cognitiveload reducing platform10, in contrast, filters, summarizes, and focuses sensor data into the enhanced characterization data comprising contextual and physiological visuals, audio and/or haptic cues to create a new category called “Assisted Perception” that significantly reduces complexity and cognitive load (and accompanying stress)—and decreases Time-To-Clarity required to save lives. The cognitiveload reducing platform10 is designed to reduce risk, improve human safety, and save lives.
The cognitive load reducing platform supports the introduction of life-saving, Assisted Perception solutions to high-stress environments. One example use of this new category of Assisted Perception is as a firefighting vision system. In this embodiment, the cognitive load reducing platform is a real-time computer vision engine designed to aid first responders as they navigate smoke filled, hazardous environments with little or no visible light. In this embodiment, the cognitive load reducing platform increases the speed and safety of first responders in the field with a focus upon navigation and visual communication applications. The Assisted Perception of the cognitive load reducing platform dramatically enhances one's ability to make well informed decisions rapidly when operating in complex environments where cognitive abilities decline. The platform has shown performance improvements of 267% (e.g., reducing the time to complete mission critical search and rescue tasks from 4.5 mins to 1.7 mins).
Several emergency response systems are based on the use of a thermal camera and AR optics to provide a hands-free imaging system to the user. However, the cognitive load reducing platform provides a novel integrated design of these hardware and software elements into a system that efficiently integrates into natural human visual perception in a manner that decreases stress in the field. In the first responder embodiment, the platform combines a unique combination of enhanced thermal imaging, augmented reality (AR) or virtual reality (VR), and environment visualization and mapping capabilities.
FIGS. 2A and 2B are diagrams illustrating components of the cognitive load reducing platform in the firefighting vision system embodiment. Referring toFIG. 2A, in one embodiment, the cognitiveload reducing platform200 comprises two components: i) an assistedperception module221 integrated with a face mask224 (or simply “mask”) of the SCBA worn by each user/crewmember; and ii) a command andcontrol interface226 displayed on adisplay device228 to a person of authority, such as an incident commander. The command andcontrol interface226 enables the person of authority to manage the incident and the crewmembers wearing respective assistedperception modules221 by displaying the enhanced characterization data from each of the respective assisted perception modules.
Each of the assistedperception modules221 comprises a modular set of components including aTIC212, aprocessor complex214 in communication with theTIC212 for executing anedge enhancement engine220, and adisplay unit222, which is removably attached to themask224. In relation toFIG. 1, theTIC212 inFIG. 2A comprises one of the sensors12 of theplatform200, thedisplay unit222 comprises theoutput device22, and theedge enhancement engine220 comprises thecognitive enhancement engine20 executed on theprocessor complex214.
In the embodiment shown, thedisplay unit222 may comprise an augmented reality (AR) display unit, a virtual reality (VR) display unit, or a head-mounted projection display unit. In the AR embodiment, the AR display unit may comprise optical see through glasses that can be either binocular or monocular.FIG. 2A shows an embodiment where thedisplay unit222 comprises an AR monocular display.
FIG. 2B is a diagram showing an exploded view of the assistedperception module221 showing additional components in a wireless embodiment. TheTIC212 captures thermal image data of obstacles and objects251 and sends the thermal image data via acable250 to theprocessor complex14. The exploded view of theprocessor complex14 shows that theprocessor complex14 is powered bybatteries252 and thecommunication interface19 ofFIG. 1 is coupled to anantenna254 to wirelessly transmit enhanced image data to thedisplay unit222. Thedisplay unit222 further includes aprocessing board256 with anantenna258 to receive the enhanced image data wirelessly transmitted from theprocessor complex14. Theprocessing board256 relays the enhanced image data to a digital signal processing board260, which processes the enhanced image data for display on thedisplay unit222 directly in front of the user's eye.
As stated above, in one embodiment, the cognitive load reducing platform is a wearable electronic system. As such, there are many placement embodiments for the components of the cognitive load reducing platform. In most embodiments, all components are located on, or otherwise carried by, a user. For example,FIGS. 2A and 2B illustrate an embodiment where the TIC212 (sensors12) and the display unit222 (output device22) are carried by the user by virtue of being integrated withmask224. Theprocessor complex214 is also worn by the user, such as being clipped to a belt or clothing, stowed in a pouch or a pocket, or attached to a back frame of the SCBA.
In some embodiments, however, the sensors12 and/or theprocessor complex14 may be located remote from the user. As an example, consider the use case where a remote gas sensor controlled by a third party sends gas data over a network (e.g., Internet) to the cognitiveload reducing platform10. In this embodiment, the cognitiveload reducing platform10 may be implemented at least in part as a website where theprocessor complex14 is implemented as one or more servers that wirelessly receives sensor data of various types. In this embodiment, the gas sensor data from the remote gas sensor is pushed to theprocessor complex14 where the sensor data is processed locally by a correspondingcognitive enhancement engine20, which converts and outputs a brain optimized visual format for display to the user on thedisplay unit22. In one embodiment, a third party could collect and push the gas sensor data into the cognitive load reducing platform in the cloud.
There are also many communication embodiments for the components of the cognitive load reducing platform. For example, in the embodiment shown inFIGS. 2A and 2B, theTIC212 communicates with theprocessor complex214 over a wired connection viacable250. However, in an alternative embodiment, theTIC212 wirelessly communicates with theprocessor complex14. In the embodiment shown inFIGS. 2A and 2B, theprocessor complex214 communicates with the AR display unit over a wireless connection. However, in an alternative embodiment, theprocessor complex214 communicates with the AR display unit over a wired connection. In yet a further embodiment, theTIC212 and theprocessor complex214 may be integrated together within a common housing, in which case theTIC212 is in direct communication with theprocessor complex214, and the communication between theprocessor complex214 and thedisplay unit222 could be over a wired or wireless connection. In either embodiment, thedisplay unit222 would still communicate wirelessly with the centralcommand display device228.
In one embodiment, the display unit222 (including digital signal processing board260, processingboard256, and antenna258) is mounted inside themask224. However, in an alternative embodiment, thedisplay unit222 is mounted outside themask224. For example, the display itself may be positioned outside themask224, while the digital signal processing board260, processingboard256 andantenna258, may be worn by the user, such as being clipped to a belt or clothing, stowed in a pouch or a pocket, or attached to a back frame of the SCBA.
According to one aspect of the disclosed embodiments, theedge enhancement engine220 executed by theprocessor complex214 in the firefighting embodiment performs high speed processing on the thermal images from theTIC212 in real time into enhanced characterization data comprising a stream of wireframe images where edges of objects appearing in the thermal images have increased luminosity and contrast in relation to the thermal images. The display unit is worn in a line of sight of the user and electronically displays the enhanced characterization data, where the stream of the wireframe images appear as decluttered, enhanced line drawings to enable the user to see and navigate in obscure conditions, while reducing the cognitive load of the user. In one embodiment, the enhanced characterization data comprises a stream of AR/VR or wireframe images having enhanced the edges or outlines of objects that are projected on the display device in the user's field of view, so the user can see and effectively navigate in dark or smoke-filled environments without overwhelming the user's ability to process the displayed information. In this embodiment, the enhanced line drawing images produced by the platform dramatically enhance the user's ability to make well-informed decisions rapidly when operating in complex environments where cognitive abilities decline, such as a first responder (e.g., a fire fighter or search and rescue personnel).
FIGS. 3A and 3B are diagram illustrating example enhanced wireframe images displayed on thedisplay unit222. The enhanced wireframe images are shown on a transparent AR or head mounted projection display placed directly within the user's line of sight to provide real time navigation. The enhanced wireframe images of the disclosed embodiments have enhanced edges and a decluttered appearance after processing of thermal images/video by the cognitive load reducing platform. In other words, the enhanced images clarify the TIC video by highlighting key details with bright edges and minimizing the rest. Notice that theedge enhancement engine220 is not optimized to recognize objects in the thermal images nor to create continuous lines outlining the objects, but is rather optimized to reduce visual complexity by filtering out faint elements and boosting contrast throughout the image. A close review of the enhanced wireframe images inFIGS. 3A and 3B reveals some or all the edges outlining objects may have small breaks and thus may not meet the mathematical definition of a continuous line, but when viewed by a human, the brain easily completes the scene and can recognize simple objects sufficiently enough to navigate. In an alternative embodiment, theedge enhancement engine220 may create wireframe images having continuous lines outlining the objects.
The enhanced wireframe images overlayed in front of a user's eye in real time are designed to save lives by providing the user (e.g., a firefighter) with just the information the user needs, instead of overloaded the user as can be easily done in stressful situations. As shown, the assistedperception module221 enables the user to see in dark, smoke-filled environments. However, seeing through smoke is a side benefit to the value of the cognitive load reducing platform, which is to reduce the visual complexity of hazardous environments, while allowing individuals to more easily make sense of their surroundings.
The Assisted Perception provided by the cognitive load reduction platform leverages the principles of neuroscience to enhance aggregated sensor data in real-time to allow first responders to do their jobs significantly faster and more safely. The closest competitor to an infrared sensor-based, extreme environment tool would be the handheld or helmet mounted infrared camera and display systems. However, none of these systems offer any context-specific interpretive processing of the output, nor are they designed as true augmented reality interfaces that reduce the cognitive load of the user.
Referring again toFIG. 2A, the wireless capabilities of theprocessor complex14 is also used create a shared experience between the crewmembers and the incident commander (IC) via the command andcontrol interface226. According to the present embodiment, the command andcontrol interface226 runs on thedisplay device228 to the incident commander and is associated or paired to a group of theAR modules221 worn by crewmembers. Through its wireless capabilities, each assistedperception module221 streams its enhanced image to the command andcontrol interface226 on thedisplay device228 so that the incident commander can see in real time what the crewmembers see. Thus, the platform improves situational awareness on the front lines as well as to create a shared operating picture with the commander on the backend through a cognitive load reducing command andcontrol interface226 that aggregates information and presents insights to the commander via a simplified GUI based on similar principals of neuroscience.
FIG. 4 is a diagram illustrating an example command and control interface of the cognitive load reducing platform. In one embodiment, thedisplay device228 comprises a tablet computer or large mobile phone, and the command andcontrol interface226 comprises multiple sub-panels or frames for displaying the streams from each of theAR modules221 of the crewmembers (e.g., up to 8). There are two communication modes. In the first communication mode, the incident commander can select any sub-panel to engage in audio or visual icon based communication. In the second communication mode, the incident commander can broadcast to all of theAR modules221 equipped crewmembers in the field.
Traditional emergency response tools to aid the incident commander focus upon the incident commander's ability to integrate information unavailable to the crewmembers, and to then communicate these insights via radio channels. In contrast, the cognitive load reducing platform allows the incident commander to see the moment to moment visual experience of their crewmembers and to communicate back to them using visual cues displayed to crewmembers equipped withAR modules221. Consequently, the connected nature of the platform (streaming visual data betweenAR modules221 to the central command display device228) elevates the safety of the entire workspace by providing a shared operating picture between individuals in the field and leaders monitoring workers from the periphery.
Edge Enhancement
Referring again toFIGS. 2A and 2B, in the firefighting embodiment, theedge enhancement engine220 performs high speed processing on the thermal images from theTIC212 to enhance the edges or outlines of objects therein. In one embodiment, theedge enhancement engine220 utilizes histogram equalization algorithms designed to facilitate the human brain's ability to make rapid and accurate inferences related to navigation. The histogram equalization algorithms reduce and declutter thermal image data to a simple “line drawing” or “cartoon image” that leverages the underlying neuro-structure of vision to aid in navigation.
FIG. 5 is a flow diagram illustrating the process performed by theedge enhancement engine220 on the thermal images to enhance the edges of objects therein to improve user vision and navigation a user in obscure environments, while reducing cognitive load of the user.
The process may begin by receiving a series of one or more thermal images and generating a gradient magnitude image from the thermal images, the gradient magnitude image comprising a plurality of pixels having associated gradient magnitude values (block500). A series of the thermal images form a video that is streamed out ofTIC212 and into theprocessor complex214 in real-time. In one embodiment, each pixel in thermal image may be 8-bits in length and have 256 possible intensities.
FIG. 6 is an illustration an example thermal image received from the TIC. Thermal images are normally grayscale where black objects represent cold temperatures, white objects represent hot temperatures, and the depth of gray indicates variations between the two. Some thermal cameras, such as radiometric thermal cameras that measure surface temperatures, add color to the images to aid users in identifying objects at different temperatures. In the example shown, thethermal image600 is an 8-bit grayscale image having grayed edges and faint contours.
The brightness or luminosity of object edges in thethermal image600 are not uniform, ranging from very bright to very faint. When displayed to a user in this form, thethermal image600 makes it difficult for first responders to quickly determine what they are seeing, and then determine a course of action, such as determining how to quickly and safely navigate the environment. One goal of the disclosed embodiments is to generate wireframe images were object edges throughout the image have uniform brightness. In one embodiment, the object edges may have uniform thickness as well. In a further embodiment, however, the object edges may have varying thickness related to a perceived distance of the object that from the TIC/user. For example, object edges closer to the user may be displayed with increased thickness compared with object edges farther from the user.
In one embodiment, the gradient magnitude of each pixel in the thermal image may be extracted using a 2D Gaussian filter to create a smoothed image. Next, x- and y-axis gradients (gxand gy) may be approximated by convolving the smoothed image with orthogonal (e.g., vertical and horizontal) Sobel filters to determine how much gray levels in the thermal image change in vertical and horizontal directions, respectively. The output of the Sobel filters may be combined pixel-wise through a l2-norm filter to produce the gradient magnitude image.
FIG. 7 is an illustration of an example gradient magnitude image generated from the thermal image ofFIG. 6. Thegradient magnitude image700 is a local estimate of contrast per pixel. Brighter pixel levels in thegradient magnitude image700 indicate a change from a dark value to a bright value in the thermal image, and darker pixel levels in thegradient magnitude image700 indicate a change from a bright value to a dark value in the thermal image. More specifically, white pixels indicate a high change in luminosity in the initial thermal image, while black pixels indicate no change at all. Thegradient magnitude image700 has a significant amount of pixel noise from the TIC and discontinuities due to the 8-bit space.
Referring again toFIG. 5, thegradient magnitude image700 is an estimate of contrast per pixel location, but an edge or line of an object in the image is not defined by a single pixel, but rather a region of space. Therefore, referring again toFIG. 5, theedge enhancement engine220 continues by partitioning thegradient magnitude image700 into a set of subregions to analyze luminosity and contrast in regions of space in the gradient magnitude image700 (block502). In one embodiment, the gradient magnitude image is partitioned into a set of non-overlapping subregions.
FIG. 8 is a diagram graphically illustrating thegradient magnitude image700 partitioned into a set ofnon-overlapping subregions800. In an example embodiment where the thermal image comprises 640×480 pixels or 307,200 total pixels, thegradient magnitude image700 may be partitioned into 300subregions800, each 32×32 pixels is size for a total of 1024 pixels.
Referring again toFIG. 5, theedge enhancement engine220 calculates gradient magnitude value statistics for each of the subregions (block504). In one embodiment, calculating the gradient magnitude statistics comprises for each of the subregions, generating a subregion histogram of gradient magnitude values that counts the number of pixels having each gradient magnitude value (block504A); generating cumulative gradient magnitude value counts (block504B); and generating upper and lower gradient magnitude percentile values (block504C). In some embodiments, the gradient magnitude value statistics for the subregions may be computed in parallel.
FIG. 9 is a diagram illustrating anexample subregion histogram900. Onesubregion histogram900 is generated for each of thesubregions800. The x-axis of thesubregion histogram900 represents possible gradient magnitude values, and the y-axis is the number of pixels at each different gradient magnitude value found in the subregion. InFIG. 9, the peak in the graph in thesubregion histogram900 may indicate, for example, that a number “x” of pixels in the subregion have a gradient magnitude value of “y”. For an 8-bit image, each pixel has a gradient magnitude value ranging from 0 to 255, which is reflected on the x-axis. As another example, in a 16-bit image, each pixel has a gradient magnitude value ranging between 0 and 65,535.
Referring again toFIG. 5, generating the cumulative histogram of the gradient magnitude value counts for each of the subregions (block504B) may comprise performing a cumulative sum operation on each subregion gradient magnitude value histogram that returns a cumulative sum of the gradient magnitude value histogram of each subregion. The cumulative sum operation is a sequence of partial sums of a given sequence. For example, the cumulative sums of the sequence {a, b, c . . . }, are a, a+b, a+b+c, . . . .
FIG. 10 is a diagram illustrating an example cumulative histogram generated from the gradient magnitude value histogram of one particular subregion. In one embodiment, thecumulative histogram1000 of the gradient magnitude value histogram is represented as ϕ|g|. The x-axis represents the cumulative sums of the gradient magnitude value counts, which range between 0 and the number of pixels within the subregion. The y-axis shows the proportion of counts less than or equal to “x”.
According to one aspect of the disclosed embodiments, generating upper and lower gradient magnitude percentile values (FIG. 5, block504C) may comprise applying a predetermined lower bound percentage1002 and upper boundpercentage1004 to the cumulative histogram ϕ|g| for each of the subregions. The goal is to determine the gradient magnitude value at the lower bound percentage1002 and upper boundpercentage1004, which are parameters that remain the same across all the subregions, and therefore are referred to as invariants. In one embodiment, the lower bound percentile1002 is set between a 5thpercentile and a 20thpercentile, or a 10thpercentile in a particular embodiment. In one embodiment, the upper boundpercentile1004 is set between an 80thpercentile and a 95thpercentile, or a 90thpercentile in a particular embodiment.
In a further embodiment, to compensate for background noise from the TIC, an absoluteminimum percentile1006 may be placed upon the gradient magnitude values to censor noise. Basically, while theedge enhancement engine220 analyzes how statistics for the gradient magnitude values in the different subregions differ, the absoluteminimum percentile1006 is used to minimize the effect of uniform noise in the scene. The predetermined upper and lower boundgradient percentiles1002 and1004 and the absoluteminimum percentile1006 are applied to the gradient magnitude values across all the subregions.
The gradient magnitude value at apoint1008 where the cumulative histogram crosses the lower bound percentile1002 and apoint1010 where the cumulative histogram crosses the upper boundpercentile1004 may then be extracted to determine the gradient magnitude value at the 10thpercentile and the 90thpercentile. In one embodiment, “L” is set equal to thegradient magnitude value1008 at the lower bound percentile1002 (also represented as IV, and “U” is set equal to thegradient magnitude value1010 at the upper bound percentile1004 (also represented as |g|u). The output of this process are gradient magnitude value control points (L, M) for each of thesubregions800.
Referring again toFIG. 5, according to the disclosed embodiments, theedge enhancement engine220 calculates mapping parameters for each of the subregions to equalize and smooth the dynamic range of the corresponding gradient magnitude statistics across the subregions (block506). By equalizing and smoothing the dynamic range of the great magnitude statistics across the subregions, minimum and maximum gradient magnitude values in each subregion are made substantially similar or uniform. As used herein the gradient magnitude values reflect luminosity and contrast.
In one embodiment, the process of calculating the mapping parameters may comprise determining mapping parameters necessary to linearly map the control points (L, U) of eachsubregion800 to a predetermined reference gradient magnitude representing a desired dynamic range (block506A). Using this process, a spatially non-local description of contrast within each subregion is equalized in that region to match the desired dynamic range across all the subregions.
FIG. 11 is a diagram graphically illustrating an example oflinear mapping1100 of subregion gradient magnitude values represented by line1104 formed between control points (L, U) to predetermined reference gradient magnitude values represented by1102. As inFIG. 10, the x-axis represent the cumulative histogram of the gradient magnitude values, which range between 0 and 255 for the 8-bit pixels. The y-axis shows percentiles of gradient magnitude values, including the invariant predetermined lower bound percentile1002 and the upper boundpercentile1004. The reference gradient magnitude1104, the lower bound percentile1002 and the upper boundpercentile1004 remain the same across allsubregions800.
The reference gradient magnitude is represented asdiagonal line1102 formed with positive slope between the intersection of the lower bound percentile1002 (10th percentile) and gradient magnitude value |g|l, and the intersection of the upper bound percentile1004 (90thpercentile) and gradient magnitude value |g|u.
A subregion gradient magnitude is represented as a line1104 formed between control points (L, M) of the current subregion. The subregion gradient magnitude line1104 shows that the subregion is in an area of the gradient magnitude image that is dim and has low contrast (shown by the steep slope). Thelinear mapping process1100 finds mapping parameters that moves the upper value control point M of the subregion gradient magnitude line1104 to the right to align with the intersection of the upper boundpercentile1004 and |g|uof thegradient magnitude line1102; and moves the lower value control point L to the left toward the intersection of lower bound percentile1002 and |g|lof thegradient magnitude line1102. The mapping parameters will now match the subregion gradient magnitude with the reference gradient magnitude to provide improved luminosity and contrast.
In one embodiment, the linear mapping process is performed by determining an intercept and slope of the referencegradient magnitude line1102 and using the intercept and slope to create linear mapping parameters to apply to each pixel in the subregion. In one embodiment, the linear mapping parameters are calculated as:
x′=f(α,β,x); andf(α,β,x)=βx+α,
where x=a current pixel gradient magnitude value, α=intercept, and β=slope. The result of this linear mapping is the equalized pixel value x′.
Referring again toFIG. 5, the linear mapping parameters (intercept α and slope β) to be applied to each pixel of the subregion are interpolated as a function of each pixel's computed distance to neighboring subregions to generate smoothed mapping parameters to compensate for arbitrary subregion boundaries (block506B). In one embodiment, a 2D nearest neighbor interpolation scheme may be used. In another embodiment, a convolution process using a sliding window may be performed, but with the penalty of computational cost. In one embodiment, the computation of the distances between pixels in a subregion to neighboring subregions may be performed prior to generating the mapping parameters.
In one embodiment, the 2D nearest neighbor interpolation uses a weight function when performing the mapping parameter interpolation to give gradient magnitude values from closest subregions more “weight” or influence on the gradient magnitude values of pixels in the current subregion over other subregions.
FIG. 12 is a diagram graphically illustrating an example of one embodiment of the weight function used to interpolate the subregion mapping parameters before applying the mapping parameters to pixels (p) in acurrent subregion1200. First, a set of anchor points (wi) are defined that are to be applied to all of the subregions. In one embodiment, a set of i=9 anchor points (w0-w8) are defined, which are located at a center, four corners, and the four midpoints of each side of thesubregion1200. Next, for each pixel (p) in each respective subregion, weights are calculated for each anchor point by calculating the distance (d0-d8) between the current pixel and each of the anchor points representing neighboring subregions. In one embodiment, the weights are inversely proportional to the calculated distances.
Finally, the smoothed mapping parameters (intercept α and slope β) are calculated for each pixel (p) in the respective subregion as a weighted sum of α and β for the anchor points wi:
∝p=Ei=08f(wi)*∝iand βp=Ei=08f(wi)*βi.
Referring again toFIG. 5, theedge enhancement engine220 applies the mapping parameters calculated for each of the subregions to pixels in the corresponding subregions to generate enhanced gradient magnitude values having equalized luminosity in contrast, and using the enhanced gradient magnitude values to form a wireframe image having enhanced edges of objects (block508). In one embodiment, the smoothed mapping parameters generated for each pixel (p) are applied to the respective pixels in the gradient magnitude image to create the wireframe image (See examples shown inFIGS. 3A and 3B).
The wireframe image is then displayed on a display device to the user, wherein the wireframe image appears as a decluttered line drawing where the enhanced edges have equalized luminosity and contrast compared to the thermal image to reduce the cognitive load of the user (block510). Preferably, the wireframe image is displayed as a video stream in real time on the display device, such asdisplay unit222, and in a line of sight of the user's eye.
In one embodiment, displaying the wireframe image on a display device, further comprises first translating the enhanced gradient magnitude values from a perspective of the TIC that captured the thermal image to a viewing perspective of an eye of the user (block510A). In one embodiment, a transformation operator may perform an affine transformation that is user configurable through a set of controls (e.g., a keyboard). The affine transformation (T) may be implemented as follows:
x′=Tt(x);Tt(x)=Tt-1(x)+δt(x),
where x=a current pixel enhanced gradient magnitude value, and Ttis the current transformation, which is incrementally adjusted from Tt-1(x) by δt(x) until appropriate for the user. Tt(x) is initialized to identity. The transformed gradient magnitude values may then be displayed as the wireframe image.
A method and system for a cognitive load reducing platform having image edge enhancement has been disclosed. The present invention has been described in accordance with the embodiments shown, and there could be variations to the embodiments, and any variations would be within the spirit and scope of the present invention. For example, the exemplary embodiment can be implemented using hardware, software, a computer readable medium containing program instructions, or a combination thereof. Accordingly, many modifications may be made by one of ordinary skill in the art without departing from the spirit and scope of the appended claims.