Show and Tell

E899056

Show and Tell is a neural network-based image captioning model developed by Google that automatically generates natural language descriptions for images.

Try in SPARQL Jump to: Surface forms Statements Referenced by

Observed surface forms (1)

Surface form Occurrences
Show & Tell 1

Statements (47)

Predicate Object
instanceOf deep learning model
image captioning model
neural network
achieves state-of-the-art performance on MSCOCO (2015)
approach end-to-end training
maximum likelihood training
author Alexander Toshev NERFINISHED
Dumitru Erhan NERFINISHED
Oriol Vinyals NERFINISHED
Samy Bengio NERFINISHED
basedOn convolutional neural network
encoder-decoder architecture
recurrent neural network
sequence-to-sequence model
captionStyle descriptive sentences
developer Google
Google Research NERFINISHED
domain computer vision
multimodal learning
natural language processing
evaluationMetric BLEU NERFINISHED
CIDEr NERFINISHED
METEOR
featureExtractor Inception CNN GENERATED
implementedIn TensorFlow (research implementation) NERFINISHED
influenced Neural image captioning research
Show, Attend and Tell NERFINISHED
input image
language English
learningType supervised learning
modality vision-to-language
optimizationAlgorithm stochastic gradient descent
organization Google Brain NERFINISHED
output natural language caption
paperTitle Show and Tell: A Neural Image Caption Generator NERFINISHED
publicationVenue CVPR NERFINISHED
publicationYear 2015
relatedTo encoder-decoder neural networks for machine translation
task automatic image captioning
natural language description generation
trainedOn Flickr30k dataset NERFINISHED
Flickr8k dataset NERFINISHED
MSCOCO dataset
uses CNN encoder
LSTM NERFINISHED
RNN decoder
word embeddings

Referenced by (2)

Full triples — surface form annotated when it differs from this entity's canonical label.

African Giant hasPart Show and Tell
this entity surface form: Show & Tell