Show and Tell
E899056
Show and Tell is a neural network-based image captioning model developed by Google that automatically generates natural language descriptions for images.
Observed surface forms (1)
| Surface form | Occurrences |
|---|---|
| Show & Tell | 1 |
Statements (47)
| Predicate | Object |
|---|---|
| instanceOf |
deep learning model
ⓘ
image captioning model ⓘ neural network ⓘ |
| achieves | state-of-the-art performance on MSCOCO (2015) ⓘ |
| approach |
end-to-end training
ⓘ
maximum likelihood training ⓘ |
| author |
Alexander Toshev
NERFINISHED
ⓘ
Dumitru Erhan NERFINISHED ⓘ Oriol Vinyals NERFINISHED ⓘ Samy Bengio NERFINISHED ⓘ |
| basedOn |
convolutional neural network
ⓘ
encoder-decoder architecture ⓘ recurrent neural network ⓘ sequence-to-sequence model ⓘ |
| captionStyle | descriptive sentences ⓘ |
| developer |
Google
ⓘ
Google Research NERFINISHED ⓘ |
| domain |
computer vision
ⓘ
multimodal learning ⓘ natural language processing ⓘ |
| evaluationMetric |
BLEU
NERFINISHED
ⓘ
CIDEr NERFINISHED ⓘ METEOR ⓘ |
| featureExtractor | Inception CNN GENERATED ⓘ |
| implementedIn | TensorFlow (research implementation) NERFINISHED ⓘ |
| influenced |
Neural image captioning research
ⓘ
Show, Attend and Tell NERFINISHED ⓘ |
| input | image ⓘ |
| language | English ⓘ |
| learningType | supervised learning ⓘ |
| modality | vision-to-language ⓘ |
| optimizationAlgorithm | stochastic gradient descent ⓘ |
| organization | Google Brain NERFINISHED ⓘ |
| output | natural language caption ⓘ |
| paperTitle | Show and Tell: A Neural Image Caption Generator NERFINISHED ⓘ |
| publicationVenue | CVPR NERFINISHED ⓘ |
| publicationYear | 2015 ⓘ |
| relatedTo | encoder-decoder neural networks for machine translation ⓘ |
| task |
automatic image captioning
ⓘ
natural language description generation ⓘ |
| trainedOn |
Flickr30k dataset
NERFINISHED
ⓘ
Flickr8k dataset NERFINISHED ⓘ MSCOCO dataset ⓘ |
| uses |
CNN encoder
ⓘ
LSTM NERFINISHED ⓘ RNN decoder ⓘ word embeddings ⓘ |
Referenced by (2)
Full triples — surface form annotated when it differs from this entity's canonical label.
this entity surface form:
Show & Tell