ML - hw1

1. Machine Learning Problems

(a) 1. BF,2. C,3. AD,4. G,5. AE,6. A,7. BF,8. AE,9. BG

(b) False. Although a large number of data can train an excellent model working quite well in data resource, or training data, we focus more on the model performance on the test data or the model generalization ability.

  • Maximizing performance on the whole dataset may result in severe overfitting.
  • On the other hand, using all the data will consume more computation and time.

2. Bayes Decision Rule

maximum likelihood decision rule
optimal bayes decision rule

3. Gaussian Discriminant Analysis and MLE

c.

Gaussian Boundaries

4. Text Classification with Naive Bayes

a. 10 words

ooking  9453
voip    9494
computron   13613
nbsp    30033
meds    37568
pills   38176
cialis  45153
sex 56930
php 65398
viagra  75526

b. accuracy = 0.9857315598548972

c. False. When the ratio of spam and ham is 1:99, the spam filter can easily to find ham emails but may regard spam email as the ham email, too.

d.
precision = 0.9750223015165032
recall = 0.9724199288256228

e. Precision really matters. In this condition, it can find more spams.

To identify drugs and bombs at an airport, I think the recall is more important, because we must find all the bombs to make sure the safety.

最后編輯于
?著作權歸作者所有,轉載或內容合作請聯系作者
【社區(qū)內容提示】社區(qū)部分內容疑似由AI輔助生成,瀏覽時請結合常識與多方信息審慎甄別。
平臺聲明:文章內容(如有圖片或視頻亦包括在內)由作者上傳并發(fā)布,文章內容僅代表作者本人觀點,簡書系信息發(fā)布平臺,僅提供信息存儲服務。

相關閱讀更多精彩內容

  • rljs by sennchi Timeline of History Part One The Cognitiv...
    sennchi閱讀 7,854評論 0 10
  • 從今天開始,寫一下我在刷 LeetCode 時的心得體會,包括自己的思路和別人的優(yōu)秀思路,歡迎各種監(jiān)督??! ...
    秋名山菜車手閱讀 982評論 0 1
  • “唔,疼?!蔽乙恍褋恚透械缴砩嫌兴盒牧逊蔚奶弁?。低頭一看,渾身血跡甚是嚇人,衣裙亦被刮的破洞四處。往上望...
    梨主閱讀 294評論 0 0
  • 冒泡排序法相信很多人剛開始接觸C語言的時候就已經很熟悉了,那么我們今天就來使用冒泡排序法進行簡單的算法練習。 結果...
    少寨主的互聯網洞察閱讀 170評論 0 0

友情鏈接更多精彩內容