Deep Learning for Computer Vision, Speech, and Language

Tentative Course Schedule
Date Lectures Other
class 1 (09/04) Liangliang, Course overview 1st homework released
class 2 (09/11) Xiaodong, Kapil: Neural Network Fundamentals and Optimization
class 3 (09/18) Kapil: Language Representation and Recurrent Nets
class 4 (09/25) Xiaodong: Deep Learning for Automatic Speech Recognition -- Part I 1st homework due
2nd homework released
class 5 (10/02) Liangliang: Evolvement of Convolutional Networks.
Guest lecturer Lei Zhang: Large Scale Face Recongition with MS Celeb 1M
class 6 (10/09) Xiaodong: Deep Learning for Automatic Speech Recognition -- Part II Student paper presentation
zx2214,mg3825,yz3170, Towards End-to-End Speech Recognition with Recurrent Neural Networks, [Slide]
ss5410,cc4181,ml4025, Deep Speech: Scaling up end-to-end speech recognition, [Slides]
class 7 (10/16) Xiaodong: Deep Learning for Automatic Speech Recognition -- Part III Student paper presentation
zj2242,zq2154,hz2482, Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition[Slides]
ac4218,bj2376,ys3031, Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition, [Slides]
jl4924,cz2465,cy2468, Listen, Attend and Spell, [Slides]
hl3100,ls3439,jl4930, Toward Human Parity in Conversational Speech Recognition and English Conversational Telephone Speech Recognition by Humans and Machines, [Slides]
class 8 (10/23) Kapil: Sequence-to-Sequence Architectures 3rd homework

Student paper presentation
wc2608,jh3853, Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation (also mention Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation), [Slides]
jw3535,yg2520,cz2458, Generating Sentences from a Continuous Space, [Slides]
cb3331,mz2649,zc2393, Improving Language Understanding by Generative Pre-training (also mention Universal Language Model Fine-tuning for Text Classification), [Slides]
class 9 (10/30) Liangliang:
Fine-Tuning and Adversarial Attack
Visual Embedding and 
Visual Search

Student paper presentation
yl3747,yw2924,yh2950, Densely Connected Convolutional Networks, [Slides]
lz2576,jj2883, Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour,[Slides]
hc3040,jak2294, Matrix capsules with EM routing, [Slides]
zh2318,xw2501,rcj2118, One-Shot Face Recognition via Generative Learning, [Slides]
class 10 (11/06) University Holiday (no class)
class 11 (11/13) Kapil: Reinforcement learning and NLP Student paper presentation
xl2680,yg2523,js5334, Mastering the game of Go without human knowledge, [Slides]
hs2991,sw3196,ty2362, Learning to Compose Neural Networks for Question Answering and Learning to Reason: End-to-End Module Networks for Visual Question Answering, [Slides]
jz2883,jc4805,hd2377, Neural Architecture Search with Reinforcement Learning, [Slides]
class 12 (11/20) Guest lecturer Noel Codella: Medical image understanding Student paper presentation
xz2631,yz3169,xz2663, Dermatologist-level classification of skin cancer with deep neural networks, [Slides]
xl2699,hx2224,hz2489, Adversarial examples for evaluating reading comprehension systems, [Slides]
hh2699,th2713,jm4743, Visual Rhythm and Beat, [Slides]
class 13 (11/27) Guest lecturer Honghui Shi: Object Detection
Guest lecturer Andrew Kae: Generative Adversarial Networks
class 14 (12/04) No class
class 15 (12/11, at 750 CEPSR) Final presentations and demos The poster session will be at 750 CEPSR (no longer in Mudd)