Deep Learning for Computer Vision, Speech, and Language

Tentative Course Schedule

Date	Lectures	Other
class 1 (09/04)	Liangliang, Course overview	1st homework released
class 2 (09/11)	Xiaodong, Kapil: Neural Network Fundamentals and Optimization
class 3 (09/18)	Kapil: Language Representation and Recurrent Nets
class 4 (09/25)	Xiaodong: Deep Learning for Automatic Speech Recognition -- Part I	1st homework due 2nd homework released
class 5 (10/02)	Liangliang: Evolvement of Convolutional Networks. Guest lecturer Lei Zhang: Large Scale Face Recongition with MS Celeb 1M
class 6 (10/09)	Xiaodong: Deep Learning for Automatic Speech Recognition -- Part II	Student paper presentation zx2214,mg3825,yz3170, Towards End-to-End Speech Recognition with Recurrent Neural Networks, [Slide] ss5410,cc4181,ml4025, Deep Speech: Scaling up end-to-end speech recognition, [Slides]
class 7 (10/16)	Xiaodong: Deep Learning for Automatic Speech Recognition -- Part III	Student paper presentation zj2242,zq2154,hz2482, Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition [Slides] ac4218,bj2376,ys3031, Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition, [Slides] jl4924,cz2465,cy2468, Listen, Attend and Spell, [Slides] hl3100,ls3439,jl4930, Toward Human Parity in Conversational Speech Recognition and English Conversational Telephone Speech Recognition by Humans and Machines, [Slides]
class 8 (10/23)	Kapil: Sequence-to-Sequence Architectures	3rd homework Student paper presentation wc2608,jh3853, Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation (also mention Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation), [Slides] jw3535,yg2520,cz2458, Generating Sentences from a Continuous Space, [Slides] cb3331,mz2649,zc2393, Improving Language Understanding by Generative Pre-training (also mention Universal Language Model Fine-tuning for Text Classification), [Slides]
class 9 (10/30)	Liangliang: Fine-Tuning and Adversarial Attack Visual Embedding and  Visual Search	Student paper presentation yl3747,yw2924,yh2950, Densely Connected Convolutional Networks, [Slides] lz2576,jj2883, Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour,[Slides] hc3040,jak2294, Matrix capsules with EM routing, [Slides] zh2318,xw2501,rcj2118, One-Shot Face Recognition via Generative Learning, [Slides]
class 10 (11/06)	University Holiday (no class)
class 11 (11/13)	Kapil: Reinforcement learning and NLP	Student paper presentation xl2680,yg2523,js5334, Mastering the game of Go without human knowledge, [Slides] hs2991,sw3196,ty2362, Learning to Compose Neural Networks for Question Answering and Learning to Reason: End-to-End Module Networks for Visual Question Answering, [Slides] jz2883,jc4805,hd2377, Neural Architecture Search with Reinforcement Learning, [Slides]
class 12 (11/20)	Guest lecturer Noel Codella: Medical image understanding	Student paper presentation xz2631,yz3169,xz2663, Dermatologist-level classification of skin cancer with deep neural networks, [Slides] xl2699,hx2224,hz2489, Adversarial examples for evaluating reading comprehension systems, [Slides] hh2699,th2713,jm4743, Visual Rhythm and Beat, [Slides]
class 13 (11/27)	Guest lecturer Honghui Shi: Object Detection Guest lecturer Andrew Kae: Generative Adversarial Networks
class 14 (12/04)	No class
class 15 (12/11, at 750 CEPSR)	Final presentations and demos	The poster session will be at 750 CEPSR (no longer in Mudd)