There are lots of github repositories and blogs that have more exhaustive resources than what I’ve put down here that covers a whole lot. But these would be good to start off, I’ll put together an intermediate list in the next post.

Understanding Anaconda distribution & using it for Machine Learning(ML)

As someone starting off with ML & using Python to do it, you will have to understand that ML or Data Science is more of sharing, collaboration and involves you having to share all your findings/research/outcomes to the stake-holders or engineering folks that would use it. Anaconda distribution is one such means through which you will keep yourself sane as well as keep others sane.

Right from the start, try using Jupyter notebooks rather than an IDE, this will help you to explore more and document your findings/issues/quirks without leaving the notebook.

Basics of Python:

Extra Resources:

Statistics & Machine Learning basics

Python for Data Science

You have to learn about a few libraries that are core pillars of Python:

NOTE: Click here for an umbrella source for Data Science using Python

A few blogs that regularly you will need to refer regularly

Generic Reading

Regular Reading

Books that would help

Do let me know if this a good list for a beginner or would have to add/remove anything. Enjoy & learn!