Data Science for Internet Of Things
Seven books to grasp the mathematical foundations of Data Science
Most people learn Data Science with an emphasis on Programming. However, to be truly proficient with Data Science, you cannot ignore the mathematical foundations behind Data Science. In this post, I present seven books that I enjoyed in learning the mathematical foundations of Data Science. ‘Enjoy’ is perhaps not the best of words since this effort is hard going!
So, why should you undertake the efforts of learning the Maths foundations of Data Science?
Here are some reasons which motivated me:
AI is rapidly changing. Geoffrey Hinton already believes we should rethink backpropagation. Understanding the Maths will help you understand the evolution of AI better. It will help you distinguish from others who approach AI from a superficial level. It will also help you to see the Intellectual Property(IP) potential of AI better. Finally, understanding the Maths behind Data Science could also lead you to the higher end jobs in AI and Data Science.
So, here is the list of books with my comments:
You cannot create a list about Maths books and not include the great Russian mathematicians! So, the first in my list is The Nature of Statistical Learning Theory by Vladimir Vapnik. Of all the books in this list, Vapnik is the hardest to find. I have an older Indian edition. Vladimir Vapnik is the creator of SVM. His Wikipedia pagegives a lot more about his work
Stephen Marsland’s book is now in its second edition. Marsland was one of the earliest books I have read (I only have the first edition). Both are very good. The second edition I believe has lot more code in Python. Like the first two books, this book also places a heavy emphasis on Algorithms.
Hastie is another classic. The version I have is very well printed with colours. This is another reference book.
Pattern Recognition and Machine Learning (Information Science and Statistics) by Christopher M. Bishop is also an in-depth and well-presented reference book.
Finally, my most recommended book
7) Deep Learning – by Goodfellow, Bengio and Corville
If there is one book you should read end to end – it’s this one. Both detailed but also modern covering everything you can think of
Two more worthy additions
- A First Course in Machine Learning, Second Edition (Machine Learning & Pattern Recognition) by Simon Rogers, Mark Girolami This was the first book I got (first edition). I would not recommend it as a beginner, but it is still a great book (esp the second edition)
- Machine Learning: A Probabilistic Perspective by Kevin Murphy is highly rated – but I have not read it personally (and hence was not on the list).
If you can recommend any I have missed, please let me know
Concluding comments:
- Except for possibly the Goodfellow – Bengio book, I would not recommend reading the books cover to cover. I prefer to read the books by topic as needed i.e. as a reference book. I also like examples from different authors ex Duda for fish sorting; - Hastie - with advertising data sales TV and radio; Flach concept of hypothesis space with sea animals example etc
- I find that these books taught me a sense of humility i.e. How little we know and how vast and complex the field is
- These books are timeless. Vladimir Vapnik is now aged 81. Duda was published first in 1973. I expect 50 years from now, the industry would still be reading them. Like old friends who have stood the test of time. That’s a comforting thought. It shows the longevity of the maths based approach.
Comments
Post a Comment