Economics 420/706

This page is for the course on Machine Learning

Class Notices:

My office is Leacock 321C.
The method of evaluation is that the course grade will be based on two assignments, plus the final project.

The class outline is here as a PDF file, and here as HTML. Some of the information it contains is also given below.

The first text for the course is a book by Aurélien Géron, which I have found to be very useful for learning how to program machine-learning algorithms. The book was originally called Hands-on Machine Learning with SciKit-Learn and TensorFlow , and it is available from the O'Reilly website, to which I believe McGill people can get free access. However, the book has been updated, with the new title Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow, 3rd Edition . Keras is a software layer that makes programming an algorithm even easier than with TensorFlow, the leading platform until very recently.

The second text for the course is Deep Learning, by Ian Goodfellow, Yoshua Bengio, and Aaron Courville. It is now quite old. I used to use it as the main text, and it is still useful for many theoretical considerations.

This link takes you to the website for the book. Its contents are completely available online. The book is also available in hardcover . It is published by the MIT Press.

The third text is one that I came across just recently. It is GANs in Action, by Jakub Langr and Vladimir Bok, Manning. I acquired it as an ebook, and I don't know if it is available in a hardcopy version. Although GANs are a quite advanced topic, the first part of this book is a rather good, non-mathematical introduction to many of the topics we will consider.

Provisionally, here is a summary of the topics I covered over the last few years, and hope to be able to cover this year as well, plus one addition, the last in the list.

Random forests
Artificial neural networks (ANN)
Deep feedforward networks and back-propagation
Regularisation and optimisation of deep networks
Convolutional networks
Recurrent networks
Gated units
Auto-encoders
Generative Adversarial Networks (GANs)

Software:

In the last few years, I have been working with Python, an interpreted language that has, for the most part, a straightforward syntax, and can be learnt swiftly by anyone with even just a little experience of programming. (Prefer Python 3 to Python 2. The two versions are not completely inter-compatible, and Python 3 provides better functionality.)

The relative simplicity of programming in Python is probably the main reason for which Python has quite the best set of libraries for machine learning, and not just for deep learning. Although deep learning will be the main focus of the course, I plan to look at some other machine-learning techniques, for which the Python libraries are equally useful. The study of some of these other techniques reveals how much all modern machine-learning approaches have in common, despite the fact that some are much better adapted than others for specific applications.

Resources

There is a super-abundance of resources available online for studying machine learning, and for implementing it. Machine learning is often coupled with the buzzword Big Data, and this is simply because machines usually learn better if they have a lot of data available to train their algorithms. Many big datasets are available online, the best known, and probably the most comprehensive, being

https://www.kaggle.com/datasets

Here are some of the available resources which I found useful. I will add to the list as the term proceeds.

Deep Learning, Goodfellow, Bengio, and Courville, MIT Press (2016) -- our main text.

Hands-on Machine Learning with SciKit-Learn, Keras, and TensorFlow, Aurélien Géron, O'Reilly 2022. This is the latest edition, the existence of which some of you pointed out to me - thanks!
This is an excellent reference for people wishing to learn how to use the Python libraries named in the title of the book. It really is hands-on, with detailed instructions for the examples given.

GANs in Action, Jakub Langr and Vladimir Bok, Manning
This book covers some pretty advanced topics, but using only the most basic mathematics. Myself, I think that some topics would be clearer with just a little more sophisticated math. The idea, as the title suggests, is to exposit the Generative Adversarial Networks, introduced mainly by Ian Goodfellow in his PhD thesis at the Université de Montréal. In order to do so, it has to provide a rather extensive overview of many of the topics we cover at an earlier stage, and so it can serve as a valuable additional reference.

Computer Age Statistical Inference, Bradley Efron and Trevor Hastie, Cambridge University Press, 2016
This expensive book by two distinguished statisticians covers a great deal of ground, from elementary statistical ideas through methods developed in the twentieth century up to 2016, the date of publication of the book. The third and final Part of the book treats many of the topics we are studying in this course, including Deep Learning.

By following this link to the file SVD.pdf, you will find a treatment of the singular value decomposition (SVD), the generalised inverse of a matrix, and of Principal Components Analysis. I think it is clearer than the treatment in the Deep Learning book.

Follow this link for the file backprop.pdf, in which the algorithm for a forward pass followed by backpropagation is laid out in detail. A picture may follow, but it will take me a while to do it to my satisfaction.

A couple of years ago, Alireza Alavi drew my attention to a couple of videos on Youtube, which give a reasonably decent explanation of backpropagation. Here are the links:
- What is backpropagation really doing?
- Backpropagation calculus

Log of material covered:

Assignments:

To send me email, click here or write directly to russell.davidson@mcgill.ca.

Back to the main page of this site

URL: https://russell-davidson.research.mcgill.ca/e706/