Word Count in big data is the equivalent of 'Hello World' in programming.Word Count on US State of the Union (SoU) Addresses The html source url of this databricks notebook and its recorded Uji : Databricks notebook source exported at Sat, 05:09:42 UTC Scalable Data Science prepared by Raazesh Sainudiin and Sivanand Sivaram Introduction to XML-parsing of Old Bailey Online Shakira Suwan, Change Detection in Random Graph Seriesĭatabricksified Spark SQL Programming Guide 1.6ĭatabricksified Data Types in MLLib Programming Guide 1.6 Xin Zhao, Higher Order Spectral CLustering Scalable Spatio-temporal Constraint Satisfactionįiltered_Tweets_Collector_Set-up_by_Keywords_and_Hashtagsįiltered_Tweets_Collector_Set-up_by_Classīinary_classification_with_Loop_TweetDataSet Introduction to Magellan for Scalable Geospatial Analyticsĭillon George, Scalable Geospatial Algorithms Week 10: Scalable Geospatial Analytics with Magellan Scalable Object Identification with Sparkling TensorFlow Week 9: Deep Learning, Convolutional Neural Nets, Sparkling Water and Tensor Flow HOMEWORK: On-Time Flight Performance with GraphFrames Week 8: Graph Querying in GraphFrames and Distributed Vertex Programming in GraphX HOMEWORK: Introduction to XML-parsing of Old Bailey Online Week 7: Probabilistic Topic Modelling via Latent Dirichlet Allocation and Intro to XML-parsing of Old Bailey Online
Elements of programming interviews in java ebook full#
Streaming Model-Prediction Server, the Full Powerplant Pipeline Week 6: Introduction to Spark Streaming, Twitter Collector, Top Hashtag Counter and Streaming Model-Prediction Server Power Plant Pipeline: Model, Tune, Evaluate HOMEWORK: Spark Data Types for Distributed Linear Algebra HOMEWORK: breeze linear algebra cheat sheetĭistributed Linear Algebra for Linear Regression Introduction Week 5: Introduction to Non-distributed and Distributed Linear Algebra and Applied Linear Regression Supervised Classification of Hand-written Digits via Decision Trees Unsupervised Clustering of 1 Million Songs via K-Means in 3 Stages Week 4: Introduction to Machine Learning - Unsupervised Clustering and Supervised Classification Week 3: Introduction to Spark SQL, ETL and EDA of Diamonds, Power Plant and Wiki CLick Streams Data HOMEWORK: RDDs, Transformations and ActionsĮXTRA_Word Count: ETL of US State of Union Addesses Week 2: Introduction to Spark RDDs, Transformations and Actions and Word Count of the US State of the Union Addresses Week 1: Introduction to Scalable Data Science