论文交流 >  机器视觉

Image Question Answering Using Convolutional Neural Network With Dynamic Parameter Prediction

使用CNN和动态参数预测实现图像问答

1年前 1048 0  点赞 (0)  收藏 (0)

研究领域: 机器视觉   CVPR2016

应用方向: 图像问答

原理方法:

软件实现:

论文摘要:

We tackle image question answering (ImageQA) problem by learning a convolutional neural network (CNN) with a dynamic parameter layer whose weights are determined adaptively based on questions. For the adaptive parameter prediction, we employ a separate parameter prediction network, which consists of gated recurrent unit (GRU) taking a question as its input and a fully-connected layer generating a set of candidate weights as its output. However, it is challenging to construct a parameter prediction network for a large number of parameters in the fully-connected dynamic parameter layer of the CNN. We reduce the complexity of this problem by incorporating a hashing technique, where the candidate weights given by the parameter prediction network are selected using a predefined hash function to determine individual weights in the dynamic parameter layer. The proposed network—joint network with the CNN for ImageQA and the parameter prediction network— is trained end-to-end through back-propagation, where its weights are initialized using a pre-trained CNN and GRU. The proposed algorithm illustrates the state-of-the-art performance on all available public ImageQA benchmarks.

论文精要:

论文点评 

您可以在评论中对论文进行“摘要翻译” “标签备注” “精要点评” “疑难提问”,我们会及时更新到数据库中!

任何论文都是 "在特定领域内"、"基于某种学术原理"、"研究某个应用问题", 因此分 领域标签 / 应用标签 / 原理标签 / 补充标签