This blog

Natural Language Processing

Surgery on an Attentional Neural Network

Surgery on an Attentional Neural Network

Customising an LSTM model to better understand Attention in Sequence to Sequence text prediction

November 08, 2020

Machine Learning

Explaining the concept of ‘Attention’ in Natural Language Processing Models by removing part of the memory function of a Recurrent Neural Network Encoder-Decoder

Recent Posts

Socratic argument against using Validation Sets
Modelling Property Prices with Mixture models
5-min reads: why propogation cell state through mini-batches slows convergence

Categories

Datascience
Machine Learning

Tags

Attentional LSTM classification convex optimisation data visualisation exploratory data analysis feature engineering first princinples lasso LSTM machine learning mixture model model selection Natural Language Processing neural network PCA philosophy price model RNN seq2eq statistics timeseries

Social

sam@onewaveisallittakes.com

© 2021 Sam Schumacher. Generated with Hugo and Mainroad theme.