View on GitHub

thirdwave

New Machine Vision Software Describing Images

Link

We present a model that generates free-form natural language descriptions of image regions. Our model leverages datasets of images and their sentence descriptions to learn about the inter-modal correspondences between text and visual data.

The parts in blue are generated by the new AI. Very cool.