The 'YoTube' detector helps makes AI more human-centered. Photo: iStock |
Image analysis technology will need to become better at understanding human intentions if it is to be employed in a wide range of applications, says Hongyuan Zhu, a computer scientist at A*STAR's Institute for Infocomm Research, who led the study. Driverless cars must be able to detect police officers and interpret their actions quickly and accurately, for safe driving, he explains. Autonomous systems could also be trained to identify suspicious activities such as fighting, theft, or dropping dangerous items, and alert security officers.
Computers are already extremely good at detecting objects in static images, thanks to deep learning techniques, which use artificial neural networks to process complex image information. But videos with moving objects are more challenging. "Understanding human actions in videos is a necessary step to build smarter and friendlier machines," says Zhu.
Read more...
Additional resources
Hongyuan Zhu et al. YoTube: Searching Action Proposal Via Recurrent and Static Regression Networks, IEEE Transactions on Image Processing (2018).
DOI: 10.1109/TIP.2018.2806279
Detecting 'deepfake' videos in the blink of an eye by Siwei Lyu, Associate Professor of Computer Science; Director, Computer Vision and Machine Learning Lab, University at Albany, State University of New York
"The new technology behind machine learning-enhanced fake videos has a crucial flaw: Computer-generated faces don't blink as often as real people do."
Source: Phys.Org