Motivation

Design my own syllabus for self-study during the program.

themes: techniques in generative text; learning song representations; language/sequence modeling enhanced by reinforcement learning or GANs

[OpenAI Scholars intro blog post] [Designing a syllabus blog post]

Week 1: Setup

Get and understand data; set up environment; build an n-gram model as a ‘shallow’ baseline; read LSTM, seq2seq, PyTorch tutorials

[June 8 blog post]

Resources:

Deep Learning book, chapter 5 (machine learning basics)
Speech and Language Processing book, chapter 4 (language modeling with n-grams)
"The unreasonable effectiveness of Character-level Language Models (and why RNNs are still cool)" by Yoav Goldberg [blog]
Deep Learning with PyTorch: A 60 Minute Blitz [tutorial]

Optional:

"The Unreasonable Effectiveness of Recurrent Neural Networks" by Andrej Karpathy [blog]
Goodman, J. (2001). A Bit of Progress in Language Modeling. [paper]
Practical PyTorch Series 1: RNNs for NLP [tutorial]
- Good recommended reading section as well
Speech and Language Processing book, chapter 8 (neural networks and neural language models)

Week 2: LSTMs, part 1

Define metrics for evaluating language model (LM) output; train an RNN (LSTM) on some sequential text data

[June 15 blog post]

Resources:

Deep Learning book, chapters 10 (sequence modeling) and 11 (practical methodology)
Graves, A. (2013). Generating sequences with recurrent neural networks. [paper]
Merity, S., Keskar, N. S., Socher, R (2017). Regularizing and Optimizing LSTM Language Models. [paper] [code]
PyTorch: Generating Names with a Character-Level RNN [tutorial]
Deep Learning for NLP with PyTorch [tutorial]
Natural Language Processing book (draft), chapter 6 (language models)

Optional:

course.fast.ai lessons 6 (rnns) + 7 (grus, lstms)
Karpathy, A. (2015). Visualizing and Understanding Recurrent Networks. [video] [paper]
Colah, C. (2014). Deep Learning, NLP, and Representations. [blog]
Bengio, Y (2003). A Neural Probabilistic Language Model. [blog] [paper]

Week 3: LSTMs, part 2

Modify the LSTM to take context as part of the input and see if text improves!

Genres from Genius.com
Audio features from Spotify

[June 22 blog post]

Resources:

Learning song representations with deep learning for structured data

Fun extra blog post #1: Train an LSTM on song titles and something stodgy, like deep learning paper titles

Inspired by ‘AI scream for ice cream’ (aiweirdness generating metal band ice cream flavors)
Related posts: AIWeirdness: 'Generated ice cream flavors: now it’s my turn'; Kottke: ‘Ask an Ice Cream Professional’; Janelle’s Twitter thread on it

Week 4: seq2seq

Train a seq2seq model with VAE loss; compare to LSTM results

[June 29 blog post]

Resources:

Deep Learning book, chapter 14 (autoencoders)
Sutskever, I., Vinyals, O., and Le, Q. V. (2014). Sequence to sequence learning with neural networks. [paper]
Bowman, S. R., Vilnis, L., Vinyals, O., Dai, A.M., Jozefowicz, R., Bengio, S (2016). Generating Sentences from a Continuous Space. [paper]
Introduction to Variational Autoencoders [video]
course.fast.ai lesson 11 (seq2seq, attention)

Optional:

OpenAI: Generative Models [blog]
Introducing Variational Autoencoders (in Prose and Code) [blog]
Under the Hood of the Variational Autoencoder (in Prose and Code) [blog]
Kingma, D.P., Welling, M (2014). Auto-Encoding Variational Bayes. [paper]
Practical PyTorch: Translation with a Sequence to Sequence Network and Attention [tutorial]

Week 5: Classification and Attention

Use LSTM-LM transfer learning to do classification; learn all about attention

[July 6 blog post]

Resources:

Bahdanau, D., Cho, K., Bengio, Y (2014). Neural Machine Translation by Jointly Learning to Align and Translate. [paper]
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, L., Polosukhin, I (2017). Attention Is All You Need. [transformer paper] [blog]
course.fast.ai lesson 11 (seq2seq, attention)
Distill.pub: Attention and Augmented Recurrent Neural Networks [blog]

Optional:

Howard, J., Ruder, S (2018). Universal Language Model Fine-tuning for Text Classification. [paper] [blog] [code]
How to Visualize Your Recurrent Neural Network with Attention in Keras [blog]
textgenrnn

Week 6: Model interpretability, part 1

Compare attention-based method of interpretability to other more explicit methods:

LIME/Anchor/SHAP
Input gradients
"Seeing what a neuron sees" à la Karpathy's char-rnn [blog] [paper]

[July 13 blog post]

Resources:

Distill.pub: The Building Blocks of Interpretability [blog]
Ribeiro, M.T., Singh, S., Guestrin, C. Why Should I Trust You?: Explaining the Predictions of Any Classifier. [LIME project] [paper]
Ross, A. S., Hughes, M. C., Doshi-Velez, F. Right for the Right Reasons: Training Differentiable Models by Constraining their Explanations. [input gradients paper]
Ribeiro, M.T., Singh, S., Guestrin, C. Anchors: High-Precision Model-Agnostic Explanations. [Anchor project] [paper]

Optional:

"Awesome Interpretable Machine Learning": an opinionated list of resources facilitating model interpretability (introspection, simplification, visualization, explanation) [repo]
How neural networks learn - Part I: Feature Visualization [video]
DeepMind: Understanding deep learning through neuron deletion [blog]

Week 7: Model interpretability, part 2

Examine latent space and forms of bias in my models

Selection bias from source text
Inspect embeddings and latent dimensions, à la David Ha's World Models

[July 21 blog post]

Resources:

World Models: Can agents learn inside of their own dreams? [demo]
"Many opportunities for discrimination in deploying machine learning systems" by Hal Daumé III [blog]

Optional:

Hardt, M., Price, E., Srebro, N. Equality of Opportunity in Supervised Learning. [paper]

Week 8: GANs

Improve VAE generations with a latent constraints GAN (LC-GAN)

[July 28 blog post]

Resources:

Engel, J., Hoffman, M., Roberts, A. (2017). Latent Constraints: Learning to Generate Conditionally from Unconditional Generative Models. [paper]
Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., and Bengio, Y. (2014). Generative adversarial nets. [paper]
Jaques, N., McCleary, J., Engel, J., Ha, D., Bertsch, F., Eck, D., Picard, R (2018). Learning via social awareness: Improving a deep generative sketching model with facial feedback. [paper]
course.fast.ai lesson 12 (gans)

Optional:

Deep Learning book, chapter 20 (deep generative models)
Yoav Goldberg. "Adversarial training for discrete sequences (like RNN generators) is hard" [blog]

Note that the Week 8 project will lead directly into my final (4-week) project!

Final project ideas

[Project proposal]

[Project notes 1: genre and inspiration] [Project notes 2: topic modeling]

[Project summary]

Music blog review generation

Structured, topical, specific to attributes of the song (similar to ‘essay writing’ goals).
Conditioned on song representation, source blog (?), sentiment...
- Could choose a limited set of source blogs ("voices"): e.g., Pitchfork, most consistent blogger on HypeM Time Machine, etc.
Datasets: blog scrape, Spotify API track audio analysis and audio features, Genius API’s song tags, description, lyrics

Lyric generation

Structured, topical, specific to attributes of the song. Genius annotate-ability as a measure of interesting-ness (?)
Conditioned on song representation
Datasets: https://www.kaggle.com/rakannimer/billboard-lyrics, Spotify API track audio analysis and audio features, Genius API’s song tags, description

What do I mean by song representation? I want to encode data from the Spotify/Genius knowledge graph for the song (e.g., title, genre, similarity to other songs, audio features, lyrics, description...) and use it to condition/seed the generated text.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

syllabus.md

syllabus.md

Motivation

themes: techniques in generative text; learning song representations; language/sequence modeling enhanced by reinforcement learning or GANs

Week 1: Setup

Get and understand data; set up environment; build an n-gram model as a ‘shallow’ baseline; read LSTM, seq2seq, PyTorch tutorials

Week 2: LSTMs, part 1

Define metrics for evaluating language model (LM) output; train an RNN (LSTM) on some sequential text data

Week 3: LSTMs, part 2

Modify the LSTM to take context as part of the input and see if text improves!

Week 4: seq2seq

Train a seq2seq model with VAE loss; compare to LSTM results

Week 5: Classification and Attention

Use LSTM-LM transfer learning to do classification; learn all about attention

Week 6: Model interpretability, part 1

Compare attention-based method of interpretability to other more explicit methods:

Week 7: Model interpretability, part 2

Examine latent space and forms of bias in my models

Week 8: GANs

Improve VAE generations with a latent constraints GAN (LC-GAN)

Final project ideas

Music blog review generation

Lyric generation

Files

syllabus.md

Latest commit

History

syllabus.md

File metadata and controls

Motivation

themes: techniques in generative text; learning song representations; language/sequence modeling enhanced by reinforcement learning or GANs

Week 1: Setup

Get and understand data; set up environment; build an n-gram model as a ‘shallow’ baseline; read LSTM, seq2seq, PyTorch tutorials

Week 2: LSTMs, part 1

Define metrics for evaluating language model (LM) output; train an RNN (LSTM) on some sequential text data

Week 3: LSTMs, part 2

Modify the LSTM to take context as part of the input and see if text improves!

Week 4: seq2seq

Train a seq2seq model with VAE loss; compare to LSTM results

Week 5: Classification and Attention

Use LSTM-LM transfer learning to do classification; learn all about attention

Week 6: Model interpretability, part 1

Compare attention-based method of interpretability to other more explicit methods:

Week 7: Model interpretability, part 2

Examine latent space and forms of bias in my models

Week 8: GANs

Improve VAE generations with a latent constraints GAN (LC-GAN)

Final project ideas

Music blog review generation

Lyric generation