CMU 11-731(MT&Seq2Seq) Feed-Forward Language Models

NN is a good Universal Function Approximator. 😸

Potential and Problems with Combination Features

  • separate feauture set
    • can not rule out unnatural phase
  • combining feature set
    • expand the parameters
  • corresponding algorithms
    • kernalized svm
    • neural networks

Overview

  • MLP
  • Space mapping

Train NN

  • Non-linear function

    • ReLU
    • Tanh (saturated)
  • How to do complicated derivation on computer ?

  • Toolbox

    • Chainer
    • DyNet
    • MxNet
    • PyTorch
    • TensorFlow
    • Theano
  • Why good?

    • Better generalization of contexts
    • More generalizable combination of words into contexts
    • Ability to skip previous words

Further Reading

Share