GET /api/v2/video/2121
HTTP 200 OK Vary: Accept Content-Type: text/html; charset=utf-8 Allow: GET, PUT, PATCH, HEAD, OPTIONS
{ "category": "SciPy 2013", "language": "English", "slug": "processing-biggish-data-on-commodity-hardware-si-1", "speakers": [], "tags": [ "Tech" ], "id": 2121, "state": 1, "title": "Processing biggish data on commodity hardware: simple Python patterns; SciPy 2013 Presentation", "summary": "Authors: Author: Gael Varoquaux Institution: INRIA, Parietal team\n\nTrack: Machine Learning\n\nWhile big data spans many terabytes and requires distributed computing, most mere mortals deal with gigabytes. In this talk I will discuss our experience in applying efficiently machine learning to hundreds of gigabytes on commodity hardware. In particular, I will discuss patterns implemented in two Python libraries, joblib and scikit-learn, dissecting why they help addressing big data and how to implement them efficiently with simple tools.\n\nIn particular, I will cover:\n\nOn the fly data reduction\nOn-line algorithms and out-of-core computing\nParallel computing patterns: performance outside of a framework\nCaching of common operations, with efficient hashing of arbitrary\nPython objects and a robust datastore relying on Posix disk semantics\nThe talk will illustrate the high-level concepts introduced with detailed technical discussions on Python implementations, based both on examples using scikit-learn and joblib and on an analysis of how these libraries work. The goal here is less to sell the libraries themselves than to share the insights gained in using and developing them.", "description": "", "quality_notes": "", "copyright_text": "", "embed": "<object width=\"640\" height=\"390\"><param name=\"movie\" value=\";hl=en_US\"></param><param name=\"allowFullScreen\" value=\"true\"></param><param name=\"allowscriptaccess\" value=\"always\"></param><embed src=\";hl=en_US\" type=\"application/x-shockwave-flash\" width=\"640\" height=\"390\" allowscriptaccess=\"always\" allowfullscreen=\"true\"></embed></object>", "thumbnail_url": "", "duration": null, "video_ogv_length": null, "video_ogv_url": null, "video_ogv_download_only": false, "video_mp4_length": null, "video_mp4_url": null, "video_mp4_download_only": false, "video_webm_length": null, "video_webm_url": null, "video_webm_download_only": false, "video_flv_length": null, "video_flv_url": null, "video_flv_download_only": false, "source_url": "", "whiteboard": "needs editing", "recorded": "2013-07-01", "added": "2013-07-04T10:08:57", "updated": "2014-04-08T20:28:26.425" }