Mimicking the easy instantaneous image processing power of the human eye

via Phys.org

Mimicking the easy instantaneous image processing power of the human eye

On a cold, sunny day, you’re driving on a rural road, surrounded by snow-covered fields. In an instant, your eyes process the scene, picking out individual objects to focus on — a stop sign, a barn — while the rest of the scene blurs in the periphery. Your brain stores the focused and blurred images as a memory that can be pictured in your mind later, while sitting at your desk.

Mimicking this easy, instantaneous image processing power of the human eye, Penn State electrical engineering researchers created a metasurface: an optical element akin to a glass slide that uses tiny nanostructures placed at different angles to control light. Led by corresponding author Xingjie Ni, associate professor of electrical engineering and computer science (EECS) at Penn State, the team published their invention in Nature Communications.

Artificial intelligence (AI) systems require significant computing power and energy and can be slow to process images and identify objects, according to the researchers. By contrast, the metasurface can be used to preprocess and transform images before they are captured by a camera, allowing a computer — and AI — to process them with minimal power and data bandwidth.

The metasurface works by converting an image from the Cartesian coordinate system, where image pixels are arranged in straight rows and columns along the x and y axes, to the log-polar system, which uses a bullseye-like pixel distribution.

“Like the arrangement of light receptors inside the human eye, the metasurface takes images and arranges them in a log-polar coordinate system — with denser pixels for the central, focused features and sparser pixels for the peripheral regions,” Ni said. “This allows for the more important aspects of a photo to come through clearly while others remain less in focus, thereby saving data bandwidth.”

The metasurface is placed in front of a camera so that light first passes through it and transforms the image from the Cartesian system into log-polar coordinates before it is digitalized by a camera and transferred to a computer. Since it works using nanostructures that bend light, the metasurface does not need any power and works at the speed of light.

“As an image of an object can vary in size or orientation, it is desirable to preprocess images to make them resistant to scale and rotation changes,” Ni said. “This preprocessing helps AI applications more easily recognize them as the same object.”

By placing a different metasurface in front of a camera, researchers also can transform the log-polar image back into the original image with Cartesian coordinates.

The invention has many potential applications, the researchers said, including for use in target tracking and surveillance to map how a car, for example, moves across a city.

“A metasurface can be used in tandem with AI systems as a preprocessor, making it easier to recognize the same car from multiple street view cameras,” Ni said. “Or if it is applied to a satellite, it could potentially track planes from takeoff to landing.”

See Also
QUANTUM LOOKING GLASS — Green laser light illuminates a metasurface that is a hundred times thinner than paper, which was fabricated at the Center for Integrated Nanotechnologies. CINT is jointly operated by Sandia and Los Alamos national laboratories for the DOE Office of Science. (Photo by Craig Fritz)

 

Original Article: Optical invention mirrors the image processing power of a human eye

More from: Pennsylvania State University 

 

 

The Latest Updates from Bing News

Go deeper with Bing News on:
Image processing
    Feed has no items.
Go deeper with Bing News on:
Metasurface
    Feed has no items.
What's Your Reaction?
Don't Like it!
0
I Like it!
0
Scroll To Top