Яндекс Метрика
Языковая модель

genCNN + dyn eval

Chinese Academy of Sciences,Huawei Noah's Ark Lab,Dublin City University
Языковое моделирование

Модель genCNN предлагает альтернативный взгляд на языковое моделирование, используя сверточные сети вместо привычных RNN. Благодаря механизму динамической оценки (dyn eval), этот ИИ эффективнее предсказывает следующее слово, анализируя историю текста переменной длины.

We propose a convolutional neural network, named genCNN, for word sequence prediction. Different from previous work on neural networkbased language modeling and generation (e.g., RNN or LSTM), we choose not to greedily summarize the history of words as a fixed length vector. Instead, we use a convolutional neural network to predict the next word with the history of words of variable length. Also different from the existing feedforward networks for language modeling, our model can effectively fuse the local correlation and global correlation in the word sequence, with a convolution-gating strategy specifically designed for the task. We argue that our model can give adequate representation of the history, and therefore can naturally exploit both the short and long range dependencies. Our model is fast, easy to train, and readily parallelized. Our extensive experiments on text generation and n-best re-ranking in machine translation show that genCNN outperforms the state-ofthe-arts with big margins.

Что такое genCNN + dyn eval?+
Кто разработал genCNN + dyn eval?+
Какие задачи решает genCNN + dyn eval?+