104971003 蔡詠捷 104971012 陳映孜 104971018 王登文
Data set: https://www.kaggle.com/census/2013-american-community-survey
The American Community Survey is an ongoing survey from the US Census Bureau. In this survey, approximately 3.5 million households per year are asked detailed questions about who they are and how they live. Many topics are covered, including ancestry, education, work, transportation, internet use, and residency.
https://www.kaggle.com/snap/amazon-fine-food-reviews
https://www.kaggle.com/kaggle/college-scorecard dataSet 約570MB 根據Score Card 及學費學貸 以及畢業後的income 來推薦哪個學校值得念 或預測哪個學校未來的ranking會提高
Run
> Edit
> Add -Dspark.master=local[2]
on VM options