LRCN

E899022

LRCN is a deep learning architecture that combines convolutional neural networks with recurrent neural networks to model and interpret visual sequences such as video and image descriptions.

All labels observed (1)

Label Occurrences
LRCN canonical 1

How this entity was disambiguated

Statements (45)

Predicate Object
instanceOf deep learning architecture
appliedTo activity recognition
image captioning
image sequences
video data
video description
visual sequences
architecturePattern CNN followed by RNN
canUsePretrained CNN backbone
captures long-term temporal dependencies in visual data
combines spatial feature learning and temporal modeling
domain computer vision
multimodal learning
sequence modeling
featureExtractionBy convolutional neural network GENERATED
feeds frame-level CNN features into RNN
fullName Long-term Recurrent Convolutional Network NERFINISHED
handles variable-length input sequences
variable-length output sequences
inputType sequence of images
sequence of video frames
introducedInField deep learning for video understanding
learningType supervised learning
models spatiotemporal data
temporal dynamics of visual features
outputType class label sequence
natural language description
relatedTo RNN-based sequence models
encoder-decoder architectures
image captioning models
video captioning models
represents each frame with CNN features
sequenceModelingBy LSTM network NERFINISHED
recurrent neural network
supports one-to-sequence learning
sequence-to-one learning
sequence-to-sequence learning
trainingObjective minimize prediction loss over sequences
usedFor end-to-end training on image captioning
end-to-end training on video tasks
usesComponent CNN NERFINISHED
LSTM NERFINISHED
RNN NERFINISHED
convolutional neural network
recurrent neural network

How these facts were elicited

Referenced by (1)

Full triples — surface form annotated when it differs from this entity's canonical label.