GET /api/v2/video/2157
HTTP 200 OK Vary: Accept Content-Type: text/html; charset=utf-8 Allow: GET, PUT, PATCH, HEAD, OPTIONS
{ "category": "SciPy 2013", "language": "English", "slug": "intro-to-scikit-learn-i-scipy2013-tutorial-pa-7", "speakers": [], "tags": [ "Tech" ], "id": 2157, "state": 1, "title": "Intro to scikit-learn (I), SciPy2013 Tutorial, Part 1 of 3", "summary": "Presenters: Ga\u00ebl Varoquaux, Jake Vanderplas, Olivier Grisel\n\nDescription\n\nMachine Learning is the branch of computer science concerned with the development of algorithms which can learn from previously-seen data in order to make predictions about future data, and has become an important part of research in many scientific fields. This set of tutorials will introduce the basics of machine learning, and how these learning tasks can be accomplished using Scikit-Learn, a machine learning library written in Python and built on NumPy, SciPy, and Matplotlib. By the end of the tutorials, participants will be poised to take advantage of Scikit-learn's wide variety of machine learning algorithms to explore their own data sets. The tutorial will comprise two sessions, Session I in the morning (intermediate track), and Session II in the afternoon (advanced track). Participants are free to attend either one or both, but to get the most out of the material, we encourage those attending in the afternoon to attend in the morning as well.\n\nSession I will assume participants already have a basic knowledge of using numpy and matplotlib for manipulating and visualizing data. It will require no prior knowledge of machine learning or scikit-learn. The goals of Session I are to introduce participants to the basic concepts of machine learning, to give a hands-on introduction to using Scikit-learn for machine learning in Python, and give participants experience with several practical examples and applications of applying supervised learning to a variety of data. It will cover basic classification and regression problems, regularization of learning models, basic cross-validation, and some examples from text mining and image processing, all using the tools available in scikit-learn.\n\nOutline\n\nTutorial 1 (intermediate track)\n\n0:00 - 0:15 -- Setup and Introduction\n0:15 - 0:30 -- Quick review of data visualization with matplotlib and numpy\n0:30 - 1:00 -- Representation of data in machine learning\nDownloading data within scikit-learn\nCategorical & Image data\nExercise: vectorization of text documents\n1:00 - 2:00 -- Basic principles of Machine Learning & the scikit-learn interface\nSupervised Learning: Classification & Regression\nUnsupervised Learning: Clustering & Dimensionality Reduction\nExample of PCA for data visualization\nFlow chart: how do I choose what to do with my data set?\nExercise: Interactive Demo on linearly separable data\nRegularization: what it is and why it is necessary\n2:00 - 2:15 -- Break (possibly in the middle of the previous section)\n2:15 - 3:00 -- Supervised Learning\nExample of Classification: hand-written digits\nCross-validation: measuring prediction accuracy\nExample of Regression: boston house prices\n3:00 - 4:15 -- Applications\nExamples from text mining\nExamples from image processing\n\n\n\nRequired Packages\n\nThis tutorial will use Python 2.6 / 2.7, and require recent versions of numpy (version 1.5+), scipy (version 0.10+), matplotlib (version 1.1+), scikit-learn (version 0.13.1+), and IPython (version 0.13.1+) with notebook support. The final requirement is particularly important: participants should be able to run IPython notebook and create & manipulate notebooks in their web browser. The easiest way to install these requirements is to use a packaged distribution: we recommend Anaconda CE, a free package provided by Continuum Analytics: or the Enthought Python Distribution:", "description": "", "quality_notes": "", "copyright_text": "", "embed": "<object width=\"640\" height=\"390\"><param name=\"movie\" value=\";hl=en_US\"></param><param name=\"allowFullScreen\" value=\"true\"></param><param name=\"allowscriptaccess\" value=\"always\"></param><embed src=\";hl=en_US\" type=\"application/x-shockwave-flash\" width=\"640\" height=\"390\" allowscriptaccess=\"always\" allowfullscreen=\"true\"></embed></object>", "thumbnail_url": "", "duration": null, "video_ogv_length": null, "video_ogv_url": null, "video_ogv_download_only": false, "video_mp4_length": null, "video_mp4_url": null, "video_mp4_download_only": false, "video_webm_length": null, "video_webm_url": null, "video_webm_download_only": false, "video_flv_length": null, "video_flv_url": null, "video_flv_download_only": false, "source_url": "", "whiteboard": "needs editing", "recorded": "2013-06-27", "added": "2013-07-04T10:09:04", "updated": "2014-04-08T20:28:26.481" }