• Review of the fundamentals of probability theory, statistics and basic linear algebra notions.
• Installation and introduction to common ML software, which includes the introduction on the use of R and Python as needed for this course.
• Introduction to linear, multiple and logistic regression.
• Model selection, regularization and cross-validation:
• Applications I: Introduction to NLP and classifying text data using logistic regression and naive Bayes.
• Introduction to support vector machines (SVM): Applications II: Classifying text data using SVM classifiers.
• Introduction to neural networks (NN): Applications III: Classifying text data and image data using recurrent and convolutional NN.
• Unsupervised learning: K-Means Clustering and Hierarchical Clustering: Applications IV: Google?fs PageRank algorithm. |