Bengio et al., 2003

Study / Research

Influential paper introducing (in context) a multi-layer perceptron approach to predict the next character/token in a sequence; discusses embedding-based representations and a neural network for sequence modeling.

Mentioned in 1 video