Describing Multimedia Content Using Attention-Based Encoder-Decoder Networks
Describing Multimedia Content Using Attention-Based Encoder-Decoder Networks
Whereas deep neural networks were first mostly used for classification tasks, they are rapidly expanding in the realm of structured output problems, where the observed target is composed of multiple random variables that have a rich joint distribution, given the input. We focus in this paper on the case where …