Cosine Similarity
Cosine Similarity - Understanding the math and how it works? (with python)
https://www.machinelearningplus.com/nlp/cosine-similarity/
Probability
Regularization: Simple Definition, L1 & L2 Penalties - Statistics How To
https://www.statisticshowto.datasciencecentral.com/regularization/The Normal Distribution
https://web.sonoma.edu/users/w/wilsonst/papers/Normal/default.htmlTroeger Teaching
https://warwick.ac.uk/fac/soc/economics/staff/vetroeger/teaching/1.4 - Likelihood & LogLikelihood | STAT 504
https://newonlinecourses.science.psu.edu/stat504/node/27/
General
Difference Between Machine Learning And Deep Learning
http://www.iamwire.com/2017/11/difference-between-machine-learning-and-deep-learning/169100
Prior Knowledge
OLS Derivation
https://stats.seandolinar.com/ols-derivation/What You Must Know Before You Dive Into Machine Learning - DZone AI
https://dzone.com/articles/what-you-must-know-before-you-dive-into-machine-le
Terminology
1.4 - Likelihood & LogLikelihood | STAT 504
https://newonlinecourses.science.psu.edu/stat504/node/27/What is the difference between "likelihood" and "probability"?
https://stats.stackexchange.com/questions/2641/what-is-the-difference-between-likelihood-and-probabilityWhat does fitting a model mean in data science? - Quora
https://www.quora.com/What-does-fitting-a-model-mean-in-data-science
Background Theories
- http://cs229.stanford.edu/notes/cs229-notes1.pdf
Loss Functions and Optimization Algorithms. Demystified.
https://medium.com/data-science-group-iitr/loss-functions-and-optimization-algorithms-demystified-bb92daff331cA Friendly Introduction to Cross-Entropy Loss
https://rdipietro.github.io/friendly-intro-to-cross-entropy-loss/A Short Introduction to Entropy, Cross-Entropy and KL-Divergence
https://www.youtube.com/watch?v=ErfnhcEV1O8&t=274sVisual Information Theory -- colah's blog
http://colah.github.io/posts/2015-09-Visual-Information/
Loss function
Logistic Regression — Gradient Descent Optimization — Part 1
https://medium.com/technology-nineleaps/logistic-regression-gradient-descent-optimization-part-1-ed320325a67e- http://www.ccs.neu.edu/home/vip/teach/MLcourse/2_GD_REG_pton_NN/lecture_notes/lecture_regression_GD.pdf
- https://cmci.colorado.edu/classes/INFO-4604/files/slides-4_optimization.pdf
Understanding binary cross-entropy / log loss: a visual explanation
https://towardsdatascience.com/understanding-binary-cross-entropy-log-loss-a-visual-explanation-a3ac6025181aLoss Functions — ML Cheatsheet documentation
https://ml-cheatsheet.readthedocs.io/en/latest/loss_functions.html
Lecture Notes
Index of /files
https://cs230.stanford.edu/files/Index of /notes/cs229-notes-all
http://cs229.stanford.edu/notes/cs229-notes-all/