• Review of the fundamentals of probability theory, statistics and basic linear algebra notions. • Installation and introduction to common ML software, which includes the introduction on the use of R and Python as needed for this course. • Introduction to linear, multiple and logistic regression. • Model selection, regularization and cross-validation: • Applications I: Introduction to NLP and classifying text data using logistic regression and naive Bayes. • Introduction to support vector machines (SVM): Applications II: Classifying text data using SVM classifiers. • Introduction to neural networks (NN): Applications III: Classifying text data and image data using recurrent and convolutional NN. • Unsupervised learning: K-Means Clustering and Hierarchical Clustering: Applications IV: Google?fs PageRank algorithm. |